Jump to content

Linear system/Solution method/Introduction/Section

From Wikiversity

It is not clear a priori what solving a (linear) system of equation is supposed to mean. Anyway, the goal is to find a quite good description of the solution set. If there is only one solution, then we want to find it. If there is no solution at all, we want to detect this in a reasonable way. In general, the solution set of a system of equations is large. In this case, solving a system means to identify "free“ variables, for which arbitrary values are allowed, and to describe explicitly how the other "dependent“ variables can be expressed in terms of the free variables. This is called an explicit description of the solution set.

Linear systems of equations can be solved systematically with the elimination process. With this method, variables are eliminated step by step, until a very simple equivalent linear system in triangular form arises, which can be solved directly (or from where we can deduce that there is no solution). We consider a typical example in many variables.


We want to solve the inhomogeneous linear system

over (or over ). Firstly, we eliminate by keeping the first row , replacing the second row by , and replacing the third row by . This yields

Now, we can eliminate from the (new) third row, with the help of the second row. Because of the fractions, we rather eliminate (which eliminates also ). We leave the first and the second row as they are, and we replace the third row by . This yields the system, in a new ordering of the variables,[1]

Now we can choose an arbitrary (free) value for . The third row determines uniquely, we must have

In the second equation, we can choose arbitrarily, this determines via

The first row determines , namely

Hence, the solution set is

A particularly simple solution is obtained by equating the free variables and with . This yields the special solution

The general solution set can also be written as

Here,

is a description of the general solution of the corresponding homogeneous linear system.


Let denote a field, and let two (inhomogeneous) systems of linear equations,

with respect to the same set of variables, be given. The systems are called equivalent, if their solution sets are identical.


Let be a field, and let

be an inhomogeneous system of linear equations over . Then the following manipulations on this system yield an equivalent system.

  1. Swapping two equations.
  2. The multiplication of an equation by a scalar .
  3. The omitting of an equation, if it occurs twice.
  4. The duplication of an equation (in the sense to write down the equation again).
  5. The omitting or adding of a zero row (zero equation).
  6. The replacement of an equation by the equation that arises if we add to another equation of the system.

Most statements are immediately clear. (2) follows from the fact that if

holds, then also

holds for every . If , then this implication can be reversed by multiplication with .

(6). Let be the equation

and be the equation

If a tuple satisfies both equations, then it also satisfies the equation . And if the tuple satisfies the equations and , then it also satisfies the equation and .


For finding the solution set of a linear system, the manipulations (2) and (6) are most important. In general, these two steps are combined, and the equation is replaced by an equation of the form (with ). Here, has to be chosen in such a way that the new equation contains one variable less than the old equation. This process is called elimination of a variable. This elimination is not only applied to one equation, but for all equations except one (suitable chosen) "working row“ , and with a fixed "working variable“. The following elimination lemma describes this step.


Let denote a field, and let denote an (inhomogeneous) system of linear equations over in the variables . Suppose that is a variable which occurs in at least one equation with a coefficient . Then every equation , different from ,[2] can be replaced by an equation , in which does not occur any more, and such that the new system of equations that consists of and the equations , is equivalent with the system .

Changing the numbering, we may assume . Let be the equation

(with ), and let be the equation

Then the equation

has the form

and does not occur in it. Because of , the systems are equivalent.


The method of this lemma, called Gauß elimination process, can be applied successively in order to obtain a linear system in triangular form.


Every (inhomogeneous) system of linear equations over a field can be transformed, by the manipulations described in fact, to an equivalent linear system of the form

where, in each row, the first coefficient is different from . Here, either , and the last row can be omitted, or , and then the system has no solution at all.

With the help of renaming the variables, we get an equivalent system of the form

with diagonal elements

.

This follows directly from the elimination lemma, by eliminating successively variables. Elimination is applied firstly to the first variable (in the given ordering), say , which occurs in at least one equation with a coefficient (if it only occurs in one equation, then this elimination step is already done). This elimination process is applied as long as the new subsystem (without the working equation used in the elimination step before) has at least one equation with a coefficient for one variable different from . After this, we have in the end only equations without variables, and they are either only zero equations, or there is no solution.

When we set , and denote the other variables with , then we obtain the described system in triangular form.


It might happen that the variable does not appear in the system with a coefficient , and that, in the elimination process, more than one variable is eliminated at the same time. Then one gets a linear system in echelon form, which can be transformed to a triangular form by a change of variables.

  1. Such a reordering is safe as long as we keep the names of the variables. But if we write down the system in matrix notation without the variables, then one has to be careful and remember the reordering of the columns.
  2. It is enough that these equations have a different index in the system.