Heat Equation

Background knowledge

A. The Heat Equation in 1D

1. The differential equation

The heat equation in one dimension is given by:

\frac{\partial u}{\partial t} - \frac{\partial}{\partial x} (α \frac{\partial u}{\partial x}) = f for x \in (0, L) and t > 0

where:

$x$ is the spatial coordinate,
$t$ is the time,
$α (x)$ is the spatially-varying thermal diffusivity of the material,
$u (x, t)$ is the temperature distribution function,
$f (x, t)$ is a time dependent and spatially varying heat source.

2. Initial and boundary conditions

To solve the heat equation, we must specify initial and boundary conditions:

Initial Condition: This refers to the temperature distribution at the initial time $t = 0$ . We need to know this temperature distribution in our entire domain $(0, L)$ .

u (x, 0) = u_{0} (x)

Boundary Conditions: We can specify different types of boundary conditions, i.e., conditions on the boundaries of our domain, $x = 0$ and $x = L$ .

Dirichlet Boundary Conditions: This boundary condition means that the temperature is specified at the boundaries of the domain.
$u (0, t) = u_{0} and u (L, t) = u_{L}$
Neumann Boundary Conditions: This boundary condition means that, instead of the temperature, the derivative of the temperature (which is called the heat flux) is specified at the boundaries of the domain.
$α (0) \frac{\partial u}{\partial x} (0, t) = g_{0} and α (L) \frac{\partial u}{\partial x} (L, t) = g_{L}$

3. Applications

A simplified example where the heat equation can be used is to find out how the temperature is distributed through the outside, insulating walls of your apartment. Look at the image below (source).

The insulation wall is made up of several materials, each with their own thermal diffusivities $α_{i} (x)$ . Imagine that the temperature outside is 0 degrees, and your heating system holds the temperature inside your house at 18 degrees. Then, these are the boundary conditions for the heat equation. Given an initial temperature distribution through the insulation wall, you could use the heat equation to find out how the temperature varies inside the insulation wall.

B. Solving the Heat Equation

1. Disadvantages of the above formulation

Now, if we want to compute the solution of the heat equation as stated above, we run into two difficulties:

Lack of exact solutions: For a general function $f$ , finding out the exact, analytical solution $u$ is not an easy task. Well, this is not entirely true: finding the solution can be easy enough in one dimension on a domain as simple as $(0, L)$ , but in higher dimensions and on more complicated geometries (e.g., imagine the insulation wall of the Guggenheim museum) it is not possible.
A restrictive set of solutions: For the above form of the heat equation to make sense, we must also assume that the second-derivatives of the solution $u (x)$ should exist, and that the first derivatives of the thermal diffusivity $α (x)$ should exist. It turns out that this is too strong of a requirement that it not satisfied by many physical systems.

For example, think of the insulation wall - each material in the insulation wall has its own thermal diffusivity which is completely unrelated to the diffusivities of the other materials. As a result, $α (x)$ is a discontinuous function and its first derivatives do not make sense.

2. Tackling the above disadvantages using a discrete & weaker formulation

The above disadvantages are the reason why, in practice, the above strong formulation of the heat equation is not useful. Instead, we formulate a discrete, weak version of the equation which is much more useful in practice. The motivation is:

Discrete approximation of unknown exact solutions: Since we don't know the exact solution in general, we try to approximate it. This is the process called discretization. In this process, we fix a finite-dimensional vector space of spatially-varying functions $V_{n}$ and say that, for any given time $t$ , we want to find a function $u_{n} (\cdot, t) \in V_{n}$ which approximates the exact solution $u (\cdot, t)$ . Here, $n$ denotes the dimension of the vector space $V_{n}$ . We expect that as $n \to \infty$ , the solution $u_{n} (\cdot, t) \to u (\cdot, t)$ .
Weak version of the equation: Since the original equation imposes too strong requirements on the smoothness of $u (x, t)$ and $α (x)$ , we instead work with an integral formulation where only the first derivatives of $u (x, t)$ should make sense, and where $α (x)$ is allowed to be discontinuous.

ASSUMPTION: For the sake of simplifying the discussion, from now on we will assume that we are imposing Dirichlet boundary conditions at $x = 0$ and $x = L$ .

This discrete, weak version of the problem at a fixed time $t$ is stated as: find $u_{n} (\cdot, t) \in S_{n}$ such that

\int_{0}^{L} w_{n} \frac{\partial u_{n}}{\partial t} d x + \int_{0}^{L} α \frac{\partial u_{n}}{\partial x} \frac{\partial w_{n}}{\partial x} d x = \int_{0}^{L} f w_{n} d x, \forall w_{n} \in W_{n},

where:

$S_{n} := {v_{n} (x) \in V_{n} : v_{n} (0) = u_{0}, v_{n} (L) = u_{L}}$ ,
$W_{n} := {v_{n} (x) \in V_{n} : v_{n} (0) = 0, v_{n} (L) = 0}$ .

Note the following important things:

The above problem tries to find the solution $u_{n} (\cdot, t)$ at the fixed time instant $t$ .
$S_{n}$ consists of spatially-varying functions in $V_{n}$ that satisfy the boundary conditions.
We want the integral equation above to be satisfied for all functions $w_{n} \in W_{n}$ , where the function space $W_{n}$ consists of spatially-varying functions in $V_{n}$ that satisfy homogeneous (or, equivalently, zero) boundary conditions.

The finite element method

Now we are at a stage where, if we make a choice for $V_{n}$ , we can convert the discrete weak problem into a system of ODEs. This section will explain how.

A. How to choose $V_{n}$ ?

1. Choosing $V_{n}$

In the finite element method, we choose $V_{n}$ as the space of piecewise-polynomial functions of degree $p$ on a mesh of the domain $(0, L)$ .

Meshing the domain

We choose a set of $N + 1$ points, $0 = x_{1} < x_{1} < x_{2} < \dots < x_{N + 1} = L$ , and these points divide the domain $(0, L)$ into smaller subdomains, $(x_{i}, x_{i + 1})$ with $x_{i} \in (0, L)$ , called elements. That is, we assume that there are $N$ elements in our mesh.

ASSUMPTION: In the following code, we will always assume that the mesh is uniform. In other words, $x_{i + 1} - x_{i}$ is equal to $L / N$ for all $i$ .

Defining $V_{n}$

The space $V_{n}$ is defined as the space of functions $v_{n}$ such that:

on any element (i.e., on the interval $(x_{i}, x_{i + 1})$ $w i t h i = 1, \dots, N$ ) it is a polynomial of degree $p$ ,
at each $x_{i}$ , $i = 2, \dots, N$ , the function $v_{n}$ is $C^{k}$ smooth for some $k \geq 0$ .

Once we do this, the vector-space dimension of $V_{n}$ can be related to the parameters $N, p, k$ as follows:

n = (p + 1) N - (k + 1) (N - 1) .

In other words, there are $n$ basis functions $ϕ_{i} (x)$ , $i = 1, \dots, n$ , such that any arbitrary $v_{n} \in V_{n}$ can be represented as:

v_{n} (x) = \sum_{i = 1}^{n} c_{i} ϕ_{i} (x),

for some numbers $c_{i} \in R$ .

EXAMPLE: $V_{n}$ with $(N, p, k) = (4, 1, 0)$ .

Consider the space of functions that are linear polynomials over each mesh element, and which are $C^{0}$ smooth (or, equivalently, continuous) at the interfaces $x_{i}$ between the elements. This space of functions has dimension:

n = (p + 1) N - (k + 1) (N - 1) = 2 \times 4 - 1 \times 3 = 5 .

So, we can find 5 basis functions, $ϕ_{1}, ϕ_{2}, \dots ϕ_{5}$ , that span the space $V_{n}$ . Run the code below to create such a $V_{n}$ and look at one such choice of the basis functions called hat functions. Convince yourself that linear combinations of these functions can be used to represent any piecewise-linear polynomial function on the mesh. (Each function is plotted in a different color.)

julia

using Mantis
using GLMakie

# The size of the domain where to solve our problem
L = 1.0

# The degree of the piecewise-polynomial basis functions
p = 1
# The number of elements in the mesh
N = 4
# The smoothness of the basis functions (must be smaller than the polynomial degree, and
# larger than -1)
k = 0

# The number of basis functions in the piecewise-polynomial function space
n = N * (p + 1) - (k + 1) * (N - 1)

# Create the mesh and the function space
breakpoints = LinRange(0.0, L, N+1)
line_geo = Geometry.CartesianGeometry((breakpoints,))
B = FunctionSpaces.BSplineSpace(line_geo, p, k)

# Create a Form Space.
BF = Forms.FormSpace(0, B, "b")

# Plot the basis functions.
n_plot_points_per_element = 25

fig = Figure()
ax = Axis(fig[1, 1],
    title = "Basis functions of V_n",
    xlabel = "x",
    ylabel = "b_i(x)",
)

n_elements = Geometry.get_num_elements(line_geo)
xi = Points.CartesianPoints((LinRange(0.0, 1.0, n_plot_points_per_element),))
BFF = Forms.FormField(BF, " ")

dim_V = Forms.get_num_basis(BF)
colors = [:blue, :green, :red, :purple, :orange]
for basis_idx in 1:dim_V

    BFF.coefficients[basis_idx] = 1.0
    if basis_idx > 1
        BFF.coefficients[basis_idx - 1] = 0.0
    end

    color_i = colors[basis_idx]

    for element_idx in 1:n_elements
        form_eval, _ = Forms.evaluate(BFF, element_idx, xi)
        x = Geometry.evaluate(Forms.get_geometry(BF), element_idx, xi)

        lines!(ax, x[:], form_eval[1], color=color_i, label=L"\phi_{%$basis_idx}")

        scatter!(ax, x[:][[1, end]], [0.0, 0.0], color=:tomato)
    end
end
fig[1, 2] = Legend(fig, ax, marge=true, unique=true)

fig = DisplayAs.Text(DisplayAs.PNG(fig))

EXAMPLE: $V_{n}$ with $(N, p, k) = (4, 2, 1)$ .

Consider now the space of functions that are quadratic polynomials over each mesh element, and which are $C^{1}$ smooth (or, equivalently, continuous and continuously differentiable) at the interfaces $x_{i}$ between the elements. This space of functions has dimension:

n = (p + 1) N - (k + 1) (N - 1) = 3 \times 4 - 2 \times 3 = 6 .

So, we can find 6 basis functions, $ϕ_{1}, ϕ_{2}, \dots ϕ_{6}$ , that span the space $V_{n}$ . Run the code below to create such a $V_{n}$ and look at one such choice of the basis functions called B-splines. (Each function is plotted in a different color.)

julia

p = 2
k = 1
n = N * (p + 1) - (k + 1) * (N - 1)

B = FunctionSpaces.BSplineSpace(line_geo, p, k)

# Create a Form Space.
BF = Forms.FormSpace(0, B, "b")

fig = Figure()
ax = Axis(fig[1, 1],
    title = "Basis functions of V_n",
    xlabel = "x",
    ylabel = "b_i(x)",
)

n_elements = Geometry.get_num_elements(line_geo)
xi = Points.CartesianPoints((LinRange(0.0, 1.0, n_plot_points_per_element),))
BFF = Forms.FormField(BF, " ")

dim_V = Forms.get_num_basis(BF)
colors = [:blue, :green, :red, :purple, :orange, :black]
for basis_idx in 1:dim_V

    BFF.coefficients[basis_idx] = 1.0
    if basis_idx > 1
        BFF.coefficients[basis_idx - 1] = 0.0
    end

    color_i = colors[basis_idx]

    for element_idx in 1:n_elements
        form_eval, _ = Forms.evaluate(BFF, element_idx, xi)
        x = Geometry.evaluate(Forms.get_geometry(BF), element_idx, xi)

        lines!(ax, x[:], form_eval[1], color=color_i, label=L"\phi_{%$basis_idx}")

        scatter!(ax, x[:][[1, end]], [0.0, 0.0], color=:tomato)
    end
end
fig[1, 2] = Legend(fig, ax, marge=true, unique=true)

fig = DisplayAs.Text(DisplayAs.PNG(fig))

Note: In both of the above examples, the only functions non-zero at $x = 0$ and $x = L$ are $ϕ_{1}$ and $ϕ_{n}$ . This means that, in particular, the functions $ϕ_{2}, \dots, ϕ_{n - 1}$ form a basis for $W_{n}$ . We will use this fact later on.

Since, at each time instant $t$ , our approximate solution $u_{n} (x, t)$ is represented as a linear combination of the basis functions $ϕ_{i}$ , $i = 1, \dots, n$ , that span $V_{n}$ , this means that our approximate solution has the following form:

u_{n} (x, t) = \sum_{i = 1}^{n} c_{i} (t) ϕ_{i} (x) .

In other words, the coefficients of the linear combination are time-dependent. But we can say more! Since $u_{n} (0, t) = u_{0}$ and $u_{n} (L, t) = u_{L}$ are the boundary conditions, then we must have:

u_{n} (x, t) = u_{0} ϕ_{1} (x) + \sum_{i = 2}^{n - 1} c_{i} (t) ϕ_{i} (x) + u_{L} ϕ_{n} (x) .

That is, the only unknown coefficients in the above expression are $c_{i} (t)$ , $i = 2, \dots, n - 1$ .

ASSUMPTION: For simplicity, we assume that $u_{0}$ and $u_{L}$ are constants.

B. Assembling the System of ODEs

Now that we have arrived at an explicit form of our approximate solution to the weak problem, let us see how the discrete weak problem leads to a system of ODEs for the coefficients $c_{i} (t)$ . This process is called assembly and it leads to a system of ODEs that looks like:

M \frac{d C}{d t} + K C = F - u_{0} F^{b, 0} - u_{L} F^{b, L},

where we have arranged the unknown coefficients $c_{i} (t)$ in a vector $C (t) := [c_{2} (t), c_{3} (t), \dots, c_{n - 1} (t)]$ .

Some terminology: in the above system of ODEs, $M$ is called the mass matrix, $K$ is called the stiffness matrix, $F$ is called the load vector, $F^{b, 0}$ and $F^{b, L}$ are the contributions of the known coefficients ( $c_{1}$ and $c_{n}$ ) to the loading, respectively, and $C$ is the vector of unknown coefficients that define the solution.

The idea behind assembly is simple. We substitute the assumed form of our discrete solution $u_{n}$ into the discrete weak problem. This gives us:

\int_{0}^{L} w_{n} \frac{\partial}{\partial t} (\sum_{j = 1}^{n} c_{j} ϕ_{j}) d x + \int_{0}^{L} α \frac{\partial w_{n}}{\partial x} \frac{\partial}{\partial x} (\sum_{j = 1}^{n} c_{j} ϕ_{j}) d x = \int_{0}^{L} f w_{n} d x, \forall w_{n} \in W_{n},

Since we need to satisfy the above equation for all $w_{n} \in W_{n}$ , and since the above equation is linear in $w_{n}$ , it is actually enough if we satisfy the above equation for the basis functions that span $W_{n}$ , i.e., $ϕ_{i}$ , $i = 2, \dots, n - 1$ . Then, choosing $w_{n} = ϕ_{i}$ gives us the following equation, and we get one such equation for each $i = 2, \dots, n - 1$ ,

\int_{0}^{L} ϕ_{i} \frac{\partial}{\partial t} (\sum_{j = 1}^{n} c_{j} ϕ_{j}) d x + \int_{0}^{L} α \frac{\partial ϕ_{i}}{\partial x} \frac{\partial}{\partial x} (\sum_{j = 1}^{n} c_{j} ϕ_{j}) d x = \int_{0}^{L} f ϕ_{i} d x .

We can rearrange this equation as:

\sum_{j = 2}^{n - 1} \frac{d c_{j}}{d t} \int_{0}^{L} ϕ_{i} ϕ_{j} d x + \sum_{j = 2}^{n - 1} c_{j} \int_{0}^{L} α \frac{d ϕ_{i}}{d x} \frac{d ϕ_{j}}{d x} d x = \int_{0}^{L} f ϕ_{i} - u_{0} \int_{0}^{L} α \frac{d ϕ_{i}}{d x} \frac{d ϕ_{1}}{d x} d x - u_{L} \int_{0}^{L} α \frac{d ϕ_{i}}{d x} \frac{d ϕ_{n}}{d x} d x .

Then, it is easy to see that this equation represents the ODE system at the beginning of this section by defining:

$M_{i j} = \int_{0}^{L} ϕ_{i} ϕ_{j} d x$ ,
$K_{i j} = \int_{0}^{L} \frac{d ϕ_{i}}{d x} \frac{d ϕ_{j}}{d x} d x$ ,
$F_{i} = \int_{0}^{L} ϕ_{i} f d x$ ,
$F_{i}^{b, 0} = \int_{0}^{L} α \frac{d ϕ_{i}}{d x} \frac{d ϕ_{1}}{d x} d x$ ,
$F_{i}^{b, L} = \int_{0}^{L} α \frac{d ϕ_{i}}{d x} \frac{d ϕ_{n}}{d x} d x .$

To assemble the matrices $M$ and $K$ and the vectors $F$ , $F^{b, 0}$ and $F^{b, L}$ , we first must define $α$ and $f$ .

Before choosing our forcing term, it is relevant to briefly analyse the behavior of our solution. We saw that our weak form of the equation is

\int_{0}^{L} w_{n} \frac{\partial u_{n}}{\partial t} d x + \int_{0}^{L} α \frac{\partial u_{n}}{\partial x} \frac{\partial w_{n}}{\partial x} d x = \int_{0}^{L} f w_{n} d x, \forall w_{n} \in W_{n},

Since we have Dirichlet boundary conditions (i.e., we enforce the value of the temperature on both sides of our interval), if we prescribe a stationary heat source, the solution will evolve to a stationary state. This stationary state, $u_{h}^{s}$ will be the one that satisfies

\int_{0}^{L} α \frac{\partial u_{n}}{\partial x} \frac{\partial w_{n}}{\partial x} d x = \int_{0}^{L} f w_{n} d x, \forall w_{n} \in W_{n},

or in matrix form

K C = F - u_{0} F^{b, 0} - u_{L} F^{b, L} .

Additionally, if $α$ is constant, and if the heat source $f$ is smooth, then we can easily construct analytical solutions to the stationary state. For example,

u^{s} (x, t) = 1 + \frac{1}{2} \cos (\frac{2 π}{L} x),

if the heat source is

f (x, t) = \frac{2 α π^{2}}{L^{2}} \cos (\frac{2 π}{L} x) .

We choose a finite element space with $(N, p, k) = (10, 2, 1)$ , i.e., with more elements compared to the last example. This is to ensure that we have sufficient accuracy for computing a decent solution.

This page was generated using Literate.jl.

Heat Equation ​

Background knowledge ​

A. The Heat Equation in 1D ​

1. The differential equation ​

2. Initial and boundary conditions ​

3. Applications ​

B. Solving the Heat Equation ​

1. Disadvantages of the above formulation ​

2. Tackling the above disadvantages using a discrete & weaker formulation ​

The finite element method ​

A. How to choose Vn? ​

1. Choosing Vn ​

Meshing the domain ​

Defining Vn ​

B. Assembling the System of ODEs ​