Orthogonal coordinates

From The Right Wiki
(Redirected from Orthogonal coordinate)
Jump to navigationJump to search

In mathematics, orthogonal coordinates are defined as a set of d coordinates q=(q1,q2,,qd) in which the coordinate hypersurfaces all meet at right angles (note that superscripts are indices, not exponents). A coordinate surface for a particular coordinate qk is the curve, surface, or hypersurface on which qk is a constant. For example, the three-dimensional Cartesian coordinates (x, y, z) is an orthogonal coordinate system, since its coordinate surfaces x = constant, y = constant, and z = constant are planes that meet at right angles to one another, i.e., are perpendicular. Orthogonal coordinates are a special but extremely common case of curvilinear coordinates.

Motivation

File:Conformal map.svg
A conformal map acting on a rectangular grid. Note that the orthogonality of the curved grid is retained.

While vector operations and physical laws are normally easiest to derive in Cartesian coordinates, non-Cartesian orthogonal coordinates are often used instead for the solution of various problems, especially boundary value problems, such as those arising in field theories of quantum mechanics, fluid flow, electrodynamics, plasma physics and the diffusion of chemical species or heat. The chief advantage of non-Cartesian coordinates is that they can be chosen to match the symmetry of the problem. For example, the pressure wave due to an explosion far from the ground (or other barriers) depends on 3D space in Cartesian coordinates, however the pressure predominantly moves away from the center, so that in spherical coordinates the problem becomes very nearly one-dimensional (since the pressure wave dominantly depends only on time and the distance from the center). Another example is (slow) fluid in a straight circular pipe: in Cartesian coordinates, one has to solve a (difficult) two dimensional boundary value problem involving a partial differential equation, but in cylindrical coordinates the problem becomes one-dimensional with an ordinary differential equation instead of a partial differential equation. The reason to prefer orthogonal coordinates instead of general curvilinear coordinates is simplicity: many complications arise when coordinates are not orthogonal. For example, in orthogonal coordinates many problems may be solved by separation of variables. Separation of variables is a mathematical technique that converts a complex d-dimensional problem into d one-dimensional problems that can be solved in terms of known functions. Many equations can be reduced to Laplace's equation or the Helmholtz equation. Laplace's equation is separable in 13 orthogonal coordinate systems (the 14 listed in the table below with the exception of toroidal), and the Helmholtz equation is separable in 11 orthogonal coordinate systems.[1][2] Orthogonal coordinates never have off-diagonal terms in their metric tensor. In other words, the infinitesimal squared distance ds2 can always be written as a scaled sum of the squared infinitesimal coordinate displacements

ds2=k=1d(hkdqk)2

where d is the dimension and the scaling functions (or scale factors)

hk(q)=defgkk(q)=|ek|

equal the square roots of the diagonal components of the metric tensor, or the lengths of the local basis vectors ek described below. These scaling functions hi are used to calculate differential operators in the new coordinates, e.g., the gradient, the Laplacian, the divergence and the curl. A simple method for generating orthogonal coordinates systems in two dimensions is by a conformal mapping of a standard two-dimensional grid of Cartesian coordinates (x, y). A complex number z = x + iy can be formed from the real coordinates x and y, where i represents the imaginary unit. Any holomorphic function w = f(z) with non-zero complex derivative will produce a conformal mapping; if the resulting complex number is written w = u + iv, then the curves of constant u and v intersect at right angles, just as the original lines of constant x and y did. Orthogonal coordinates in three and higher dimensions can be generated from an orthogonal two-dimensional coordinate system, either by projecting it into a new dimension (cylindrical coordinates) or by rotating the two-dimensional system about one of its symmetry axes. However, there are other orthogonal coordinate systems in three dimensions that cannot be obtained by projecting or rotating a two-dimensional system, such as the ellipsoidal coordinates. More general orthogonal coordinates may be obtained by starting with some necessary coordinate surfaces and considering their orthogonal trajectories.

Basis vectors

Covariant basis

In Cartesian coordinates, the basis vectors are fixed (constant). In the more general setting of curvilinear coordinates, a point in space is specified by the coordinates, and at every such point there is bound a set of basis vectors, which generally are not constant: this is the essence of curvilinear coordinates in general and is a very important concept. What distinguishes orthogonal coordinates is that, though the basis vectors vary, they are always orthogonal with respect to each other. In other words,

eiej=0ifij

These basis vectors are by definition the tangent vectors of the curves obtained by varying one coordinate, keeping the others fixed:

File:OrthogonalCoordinates.png
Visualization of 2D orthogonal coordinates. Curves obtained by holding all but one coordinate constant are shown, along with basis vectors. Note that the basis vectors aren't of equal length: they need not be, they only need to be orthogonal.
ei=rqi

where r is some point and qi is the coordinate for which the basis vector is extracted. In other words, a curve is obtained by fixing all but one coordinate; the unfixed coordinate is varied as in a parametric curve, and the derivative of the curve with respect to the parameter (the varying coordinate) is the basis vector for that coordinate. Note that the vectors are not necessarily of equal length. The useful functions known as scale factors of the coordinates are simply the lengths hi of the basis vectors ei (see table below). The scale factors are sometimes called Lamé coefficients, not to be confused with Lamé parameters (solid mechanics). The normalized basis vectors are notated with a hat and obtained by dividing by the length:

e^i=eihi=ei|ei|

A vector field may be specified by its components with respect to the basis vectors or the normalized basis vectors, and one must be sure which case is meant. Components in the normalized basis are most common in applications for clarity of the quantities (for example, one may want to deal with tangential velocity instead of tangential velocity times a scale factor); in derivations the normalized basis is less common since it is more complicated.

Contravariant basis

The basis vectors shown above are covariant basis vectors (because they "co-vary" with vectors). In the case of orthogonal coordinates, the contravariant basis vectors are easy to find since they will be in the same direction as the covariant vectors but reciprocal length (for this reason, the two sets of basis vectors are said to be reciprocal with respect to each other):

ei=e^ihi=eihi2

this follows from the fact that, by definition, eiej=δij, using the Kronecker delta. Note that:

e^i=eihi=hieie^i

We now face three different basis sets commonly used to describe vectors in orthogonal coordinates: the covariant basis ei, the contravariant basis ei, and the normalized basis êi. While a vector is an objective quantity, meaning its identity is independent of any coordinate system, the components of a vector depend on what basis the vector is represented in. To avoid confusion, the components of the vector x with respect to the ei basis are represented as xi, while the components with respect to the ei basis are represented as xi:

x=ixiei=ixiei

The position of the indices represent how the components are calculated (upper indices should not be confused with exponentiation). Note that the summation symbols Σ (capital Sigma) and the summation range, indicating summation over all basis vectors (i = 1, 2, ..., d), are often omitted. The components are related simply by:

hi2xi=xi

There is no distinguishing widespread notation in use for vector components with respect to the normalized basis; in this article we'll use subscripts for vector components and note that the components are calculated in the normalized basis.

Vector algebra

Vector addition and negation are done component-wise just as in Cartesian coordinates with no complication. Extra considerations may be necessary for other vector operations. Note however, that all of these operations assume that two vectors in a vector field are bound to the same point (in other words, the tails of vectors coincide). Since basis vectors generally vary in orthogonal coordinates, if two vectors are added whose components are calculated at different points in space, the different basis vectors require consideration.

Dot product

The dot product in Cartesian coordinates (Euclidean space with an orthonormal basis set) is simply the sum of the products of components. In orthogonal coordinates, the dot product of two vectors x and y takes this familiar form when the components of the vectors are calculated in the normalized basis:

xy=ixie^ijyje^j=ixiyi

This is an immediate consequence of the fact that the normalized basis at some point can form a Cartesian coordinate system: the basis set is orthonormal. For components in the covariant or contravariant bases,

xy=ihi2xiyi=ixiyihi2=ixiyi=ixiyi

This can be readily derived by writing out the vectors in component form, normalizing the basis vectors, and taking the dot product. For example, in 2D:

xy=(x1e1+x2e2)(y1e1+y2e2)=(x1h1e^1+x2h2e^2)(y1e^1h1+y2e^2h2)=x1y1+x2y2

where the fact that the normalized covariant and contravariant bases are equal has been used.

Cross product

The cross product in 3D Cartesian coordinates is:

x×y=(x2y3x3y2)e^1+(x3y1x1y3)e^2+(x1y2x2y1)e^3

The above formula then remains valid in orthogonal coordinates if the components are calculated in the normalized basis. To construct the cross product in orthogonal coordinates with covariant or contravariant bases we again must simply normalize the basis vectors, for example:

x×y=ixiei×jyjej=ixihie^i×jyjhje^j

which, written expanded out,

x×y=(x2y3x3y2)h2h3h1e1+(x3y1x1y3)h1h3h2e2+(x1y2x2y1)h1h2h3e3

Terse notation for the cross product, which simplifies generalization to non-orthogonal coordinates and higher dimensions, is possible with the Levi-Civita tensor, which will have components other than zeros and ones if the scale factors are not all equal to one.

Vector calculus

Differentiation

Looking at an infinitesimal displacement from some point, it's apparent that

dr=irqidqi=ieidqi

By definition, the gradient of a function must satisfy (this definition remains true if ƒ is any tensor)

df=fdrdf=fieidqi

It follows then that del operator must be:

=ieiqi

and this happens to remain true in general curvilinear coordinates. Quantities like the gradient and Laplacian follow through proper application of this operator.

Basis vector formulae

From dr and normalized basis vectors êi, the following can be constructed.[3][4]

Differential element Vectors Scalars
Line element Tangent vector to coordinate curve qi:

d=hidqie^i=rqidqi

Infinitesimal length

d=drdr=(h1dq1)2+(h2dq2)2+(h3dq3)2

Surface element Normal to coordinate surface qk = constant:

dS=(hidqie^i)×(hjdqje^j)=dqidqj(rqi×rqj)=hihjdqidqje^k

Infinitesimal surface

dSk=hihjdqidqj

Volume element N/A Infinitesimal volume

dV=|(h1dq1e^1)(h2dq2e^2)×(h3dq3e^3)|=|e^1e^2×e^3|h1h2h3dq1dq2dq3=h1h2h3dq1dq2dq3=Jdq1dq2dq3

where

J=|rq1(rq2×rq3)|=|(x,y,z)(q1,q2,q3)|=h1h2h3

is the Jacobian determinant, which has the geometric interpretation of the deformation in volume from the infinitesimal cube dxdydz to the infinitesimal curved volume in the orthogonal coordinates.

Integration

Using the line element shown above, the line integral along a path 𝒫 of a vector F is:

𝒫Fdr=𝒫iFieijejdqj=i𝒫Fidqi

An infinitesimal element of area for a surface described by holding one coordinate qk constant is:

dAk=ikdsi=ikhidqi

Similarly, the volume element is:

dV=idsi=ihidqi

where the large symbol Π (capital Pi) indicates a product the same way that a large Σ indicates summation. Note that the product of all the scale factors is the Jacobian determinant. As an example, the surface integral of a vector function F over a q1 = constant surface 𝒮 in 3D is:

𝒮FdA=𝒮Fn^dA=𝒮Fe^1dA=𝒮F1h2h3h1dq2dq3

Note that F1/h1 is the component of F normal to the surface.

Differential operators in three dimensions

Since these operations are common in application, all vector components in this section are presented with respect to the normalised basis: F^i=Fe^i.

Operator Expression
Gradient of a scalar field ϕ=e^1h1ϕq1+e^2h2ϕq2+e^3h3ϕq3
Divergence of a vector field F=1h1h2h3[q1(F^1h2h3)+q2(F^2h3h1)+q3(F^3h1h2)]
Curl of a vector field ×F=e^1h2h3[q2(h3F^3)q3(h2F^2)]+e^2h3h1[q3(h1F^1)q1(h3F^3)]+e^3h1h2[q1(h2F^2)q2(h1F^1)]=1h1h2h3|h1e^1h2e^2h3e^3q1q2q3h1F^1h2F^2h3F^3|
Laplacian of a scalar field 2ϕ=1h1h2h3[q1(h2h3h1ϕq1)+q2(h3h1h2ϕq2)+q3(h1h2h3ϕq3)]

The above expressions can be written in a more compact form using the Levi-Civita symbol ϵijk and the Jacobian determinant J=h1h2h3, assuming summation over repeated indices:

Operator Expression
Gradient of a scalar field ϕ=e^khkϕqk
Divergence of a vector field F=1Jqk(JhkF^k)
Curl of a vector field (3D only) ×F=hke^kJϵijkqi(hjF^j)
Laplacian of a scalar field 2ϕ=1Jqk(Jhk2ϕqk)

Also notice the gradient of a scalar field can be expressed in terms of the Jacobian matrix J containing canonical partial derivatives:

J=[ϕq1,ϕq2,ϕq3]

upon a change of basis:

ϕ=SRJT

where the rotation and scaling matrices are:

R=[e1,e2,e3]
S=diag([h11,h21,h31]).

Table of two-dimensional orthogonal coordinates

System Complex Transform

x+iy=f(u+iv)

Shape of u and v isolines Comment
Cartesian u+iv line, line
Log-polar exp(u+iv) circle, line for u=lnr becomes Polar
Parabolic 12(u+iv)2 parabola, parabola
Point dipole (u+iv)1 circle, circle
Elliptic cosh(u+iv) ellipse, hyperbola field of a needle, appears Log-polar for large distances
Bipolar coth(u+iv) circle, circle appears like point dipole for large distances
u+iv hyperbola, hyperbola field of an inner edge
u=x2+2y2,y=vx2 ellipse, parabola

Table of three-dimensional orthogonal coordinates

Besides the usual Cartesian coordinates, 13 others are tabulated below.[5] Interval notation is used for compactness in the curvilinear coordinates column, and the entries are grouped by their interval signatures, e.g. COxCCxCO for spherical coordinates, with the x in each signature indicating the Cartesian product, with a theoretical limit of 27 products. From symmetry we may conclude this is a complete listing. The entries are not sorted by their interval signatures in alphabetic order, nor are the signatures included. After the grouping of the entries by interval signature, the sort order here is alphabetic by the curvilinear coordinate system name.

Curvillinear coordinates (q1, q2, q3) Transformation from cartesian (x, y, z) Scale factors
Spherical coordinates

(r,θ,ϕ)[0,)×[0,π]×[0,2π)

x=rsinθcosϕy=rsinθsinϕz=rcosθ h1=1h2=rh3=rsinθ
Parabolic coordinates

(u,v,ϕ)[0,)×[0,)×[0,2π)

x=uvcosϕy=uvsinϕz=12(u2v2) h1=h2=u2+v2h3=uv
Bipolar cylindrical coordinates

(u,v,z)[0,2π)×(,)×(,)

x=asinhvcoshvcosuy=asinucoshvcosuz=z h1=h2=acoshvcosuh3=1
Ellipsoidal coordinates

(λ,μ,ν)[0,c2)×(c2,b2)×(b2,a2)λ<c2<b2<a2,c2<μ<b2<a2,c2<b2<ν<a2,

x2a2qi+y2b2qi+z2c2qi=1

where (q1,q2,q3)=(λ,μ,ν)

hi=12(qjqi)(qkqi)(a2qi)(b2qi)(c2qi)
Paraboloidal coordinates

(λ,μ,ν)[0,b2)×(b2,a2)×(a2,)b2<a2

x2qia2+y2qib2=2z+qi

where (q1,q2,q3)=(λ,μ,ν)

hi=12(qjqi)(qkqi)(a2qi)(b2qi)
Cylindrical polar coordinates

(r,ϕ,z)[0,)×[0,2π)×(,)

x=rcosϕy=rsinϕz=z h1=h3=1h2=r
Elliptic cylindrical coordinates

(u,v,z)[0,)×[0,2π)×(,)

x=acoshucosvy=asinhusinvz=z h1=h2=asinh2u+sin2vh3=1
Oblate spheroidal coordinates

(ξ,η,ϕ)[0,)×[π2,π2]×[0,2π)

x=acoshξcosηcosϕy=acoshξcosηsinϕz=asinhξsinη h1=h2=asinh2ξ+sin2ηh3=acoshξcosη
Prolate spheroidal coordinates

(ξ,η,ϕ)[0,)×[0,π]×[0,2π)

x=asinhξsinηcosϕy=asinhξsinηsinϕz=acoshξcosη h1=h2=asinh2ξ+sin2ηh3=asinhξsinη
Bispherical coordinates

(u,v,ϕ)(π,π]×[0,)×[0,2π)

x=asinucosϕcoshvcosuy=asinusinϕcoshvcosuz=asinhvcoshvcosu h1=h2=acoshvcosuh3=asinucoshvcosu
Toroidal coordinates

(u,v,ϕ)(π,π]×[0,)×[0,2π)

x=asinhvcosϕcoshvcosuy=asinhvsinϕcoshvcosuz=asinucoshvcosu h1=h2=acoshvcosuh3=asinhvcoshvcosu
Parabolic cylindrical coordinates

(u,v,z)(,)×[0,)×(,)

x=12(u2v2)y=uvz=z h1=h2=u2+v2h3=1
Conical coordinates

(λ,μ,ν)ν2<b2<μ2<a2λ[0,)

x=λμνaby=λa(μ2a2)(ν2a2)a2b2z=λb(μ2b2)(ν2b2)b2a2 h1=1h22=λ2(μ2ν2)(μ2a2)(b2μ2)h32=λ2(μ2ν2)(ν2a2)(ν2b2)

See also

Notes

  1. Eric W. Weisstein. "Orthogonal Coordinate System". MathWorld. Retrieved 10 July 2008.
  2. Morse and Feshbach 1953, Volume 1, pp. 494–523, 655–666.
  3. Mathematical Handbook of Formulas and Tables (3rd edition), S. Lipschutz, M.R. Spiegel, J. Liu, Schuam's Outline Series, 2009, ISBN 978-0-07-154855-7.
  4. Vector Analysis (2nd Edition), M.R. Spiegel, S. Lipschutz, D. Spellman, Schaum’s Outlines, McGraw Hill (USA), 2009, ISBN 978-0-07-161545-7
  5. Vector Analysis (2nd Edition), M.R. Spiegel, S. Lipschutz, D. Spellman, Schaum’s Outlines, McGraw Hill (USA), 2009, ISBN 978-0-07-161545-7

References

  • Korn GA and Korn TM. (1961) Mathematical Handbook for Scientists and Engineers, McGraw-Hill, pp. 164–182.
  • Morse and Feshbach (1953). Methods of Theoretical Physics, Volume 1. McGraw-Hill.
  • Margenau H. and Murphy GM. (1956) The Mathematics of Physics and Chemistry, 2nd. ed., Van Nostrand, pp. 172–192.
  • Leonid P. Lebedev and Michael J. Cloud (2003) Tensor Analysis, pp. 81 – 88.