Metric space

From The Right Wiki
(Redirected from Norm induced metric)
Jump to navigationJump to search

File:Manhattan distance.svg
The plane (a set of points) can be equipped with different metrics. In the taxicab metric the red, yellow and blue paths have the same length (12), and are all shortest paths. In the Euclidean metric, the green path has length 628.49, and is the unique shortest path, whereas the red, yellow, and blue paths still have length 12.

In mathematics, a metric space is a set together with a notion of distance between its elements, usually called points. The distance is measured by a function called a metric or distance function.[1] Metric spaces are the most general setting for studying many of the concepts of mathematical analysis and geometry. The most familiar example of a metric space is 3-dimensional Euclidean space with its usual notion of distance. Other well-known examples are a sphere equipped with the angular distance and the hyperbolic plane. A metric may correspond to a metaphorical, rather than physical, notion of distance: for example, the set of 100-character Unicode strings can be equipped with the Hamming distance, which measures the number of characters that need to be changed to get from one string to another. Since they are very general, metric spaces are a tool used in many different branches of mathematics. Many types of mathematical objects have a natural notion of distance and therefore admit the structure of a metric space, including Riemannian manifolds, normed vector spaces, and graphs. In abstract algebra, the p-adic numbers arise as elements of the completion of a metric structure on the rational numbers. Metric spaces are also studied in their own right in metric geometry[2] and analysis on metric spaces.[3] Many of the basic notions of mathematical analysis, including balls, completeness, as well as uniform, Lipschitz, and Hölder continuity, can be defined in the setting of metric spaces. Other notions, such as continuity, compactness, and open and closed sets, can be defined for metric spaces, but also in the even more general setting of topological spaces.

Definition and illustration

Motivation

File:Great-circle distance vs straight line distance.svg
A diagram illustrating the great-circle distance (in cyan) and the straight-line distance (in red) between two points P and Q on a sphere.

To see the utility of different notions of distance, consider the surface of the Earth as a set of points. We can measure the distance between two such points by the length of the shortest path along the surface, "as the crow flies"; this is particularly useful for shipping and aviation. We can also measure the straight-line distance between two points through the Earth's interior; this notion is, for example, natural in seismology, since it roughly corresponds to the length of time it takes for seismic waves to travel between those two points. The notion of distance encoded by the metric space axioms has relatively few requirements. This generality gives metric spaces a lot of flexibility. At the same time, the notion is strong enough to encode many intuitive facts about what distance means. This means that general results about metric spaces can be applied in many different contexts. Like many fundamental mathematical concepts, the metric on a metric space can be interpreted in many different ways. A particular metric may not be best thought of as measuring physical distance, but, instead, as the cost of changing from one state to another (as with Wasserstein metrics on spaces of measures) or the degree of difference between two objects (for example, the Hamming distance between two strings of characters, or the Gromov–Hausdorff distance between metric spaces themselves).

Definition

Formally, a metric space is an ordered pair (M, d) where M is a set and d is a metric on M, i.e., a functiond:M×Msatisfying the following axioms for all points x,y,zM:[4][5]

  1. The distance from a point to itself is zero: d(x,x)=0
  2. (Positivity) The distance between two distinct points is always positive: If xy, then d(x,y)>0
  3. (Symmetry) The distance from x to y is always the same as the distance from y to x: d(x,y)=d(y,x)
  4. The triangle inequality holds: d(x,z)d(x,y)+d(y,z)This is a natural property of both physical and metaphorical notions of distance: you can arrive at z from x by taking a detour through y, but this will not make your journey any shorter than the direct path.

If the metric d is unambiguous, one often refers by abuse of notation to "the metric space M". By taking all axioms except the second, one can show that distance is always non-negative:0=d(x,x)d(x,y)+d(y,x)=2d(x,y)Therefore the second axiom can be weakened to If xy, then d(x,y)0 and combined with the first to make d(x,y)=0x=y.[6]

Simple examples

The real numbers

The real numbers with the distance function d(x,y)=|yx| given by the absolute difference form a metric space. Many properties of metric spaces and functions between them are generalizations of concepts in real analysis and coincide with those concepts when applied to the real line.

Metrics on Euclidean spaces

File:Minkowski distance examples.svg
Comparison of Chebyshev, Euclidean and taxicab distances for the hypotenuse of a 3-4-5 triangle on a chessboard

The Euclidean plane 2 can be equipped with many different metrics. The Euclidean distance familiar from school mathematics can be defined by d2((x1,y1),(x2,y2))=(x2x1)2+(y2y1)2. The taxicab or Manhattan distance is defined by d1((x1,y1),(x2,y2))=|x2x1|+|y2y1| and can be thought of as the distance you need to travel along horizontal and vertical lines to get from one point to the other, as illustrated at the top of the article. The maximum, L, or Chebyshev distance is defined by d((x1,y1),(x2,y2))=max{|x2x1|,|y2y1|}. This distance does not have an easy explanation in terms of paths in the plane, but it still satisfies the metric space axioms. It can be thought of similarly to the number of moves a king would have to make on a chess board to travel from one point to another on the given space. In fact, these three distances, while they have distinct properties, are similar in some ways. Informally, points that are close in one are close in the others, too. This observation can be quantified with the formula d(p,q)d2(p,q)d1(p,q)2d(p,q), which holds for every pair of points p,q2. A radically different distance can be defined by setting d(p,q)={0,if p=q,1,otherwise. Using Iverson brackets, d(p,q)=[pq] In this discrete metric, all distinct points are 1 unit apart: none of them are close to each other, and none of them are very far away from each other either. Intuitively, the discrete metric no longer remembers that the set is a plane, but treats it just as an undifferentiated set of points. All of these metrics make sense on n as well as 2.

Subspaces

Given a metric space (M, d) and a subset AM, we can consider A to be a metric space by measuring distances the same way we would in M. Formally, the induced metric on A is a function dA:A×A defined by dA(x,y)=d(x,y). For example, if we take the two-dimensional sphere S2 as a subset of 3, the Euclidean metric on 3 induces the straight-line metric on S2 described above. Two more useful examples are the open interval (0, 1) and the closed interval [0, 1] thought of as subspaces of the real line.

History

Arthur Cayley, in his article "On Distance", extended metric concepts beyond Euclidean geometry into domains bounded by a conic in a projective space. His distance was given by logarithm of a cross ratio. Any projectivity leaving the conic stable also leaves the cross ratio constant, so isometries are implicit. This method provides models for elliptic geometry and hyperbolic geometry, and Felix Klein, in several publications, established the field of non-euclidean geometry through the use of the Cayley-Klein metric. The idea of an abstract space with metric properties was addressed in 1906 by René Maurice Fréchet[7] and the term metric space was coined by Felix Hausdorff in 1914.[8][9][10] Fréchet's work laid the foundation for understanding convergence, continuity, and other key concepts in non-geometric spaces. This allowed mathematicians to study functions and sequences in a broader and more flexible way. This was important for the growing field of functional analysis. Mathematicians like Hausdorff and Stefan Banach further refined and expanded the framework of metric spaces. Hausdorff introduced topological spaces as a generalization of metric spaces. Banach's work in functional analysis heavily relied on the metric structure. Over time, metric spaces became a central part of modern mathematics. They have influenced various fields including topology, geometry, and applied mathematics. Metric spaces continue to play a crucial role in the study of abstract mathematical concepts.

Basic notions

A distance function is enough to define notions of closeness and convergence that were first developed in real analysis. Properties that depend on the structure of a metric space are referred to as metric properties. Every metric space is also a topological space, and some metric properties can also be rephrased without reference to distance in the language of topology; that is, they are really topological properties.

The topology of a metric space

For any point x in a metric space M and any real number r > 0, the open ball of radius r around x is defined to be the set of points that are strictly less than distance r from x: Br(x)={yM:d(x,y)<r}. This is a natural way to define a set of points that are relatively close to x. Therefore, a set NM is a neighborhood of x (informally, it contains all points "close enough" to x) if it contains an open ball of radius r around x for some r > 0. An open set is a set which is a neighborhood of all its points. It follows that the open balls form a base for a topology on M. In other words, the open sets of M are exactly the unions of open balls. As in any topology, closed sets are the complements of open sets. Sets may be both open and closed as well as neither open nor closed. This topology does not carry all the information about the metric space. For example, the distances d1, d2, and d defined above all induce the same topology on 2, although they behave differently in many respects. Similarly, with the Euclidean metric and its subspace the interval (0, 1) with the induced metric are homeomorphic but have very different metric properties. Conversely, not every topological space can be given a metric. Topological spaces which are compatible with a metric are called metrizable and are particularly well-behaved in many ways: in particular, they are paracompact[11] Hausdorff spaces (hence normal) and first-countable.[lower-alpha 1] The Nagata–Smirnov metrization theorem gives a characterization of metrizability in terms of other topological properties, without reference to metrics.

Convergence

Convergence of sequences in Euclidean space is defined as follows:

A sequence (xn) converges to a point x if for every ε > 0 there is an integer N such that for all n > N, d(xn, x) < ε.

Convergence of sequences in a topological space is defined as follows:

A sequence (xn) converges to a point x if for every open set U containing x there is an integer N such that for all n > N, xnU.

In metric spaces, both of these definitions make sense and they are equivalent. This is a general pattern for topological properties of metric spaces: while they can be defined in a purely topological way, there is often a way that uses the metric which is easier to state or more familiar from real analysis.

Completeness

Informally, a metric space is complete if it has no "missing points": every sequence that looks like it should converge to something actually converges. To make this precise: a sequence (xn) in a metric space M is Cauchy if for every ε > 0 there is an integer N such that for all m, n > N, d(xm, xn) < ε. By the triangle inequality, any convergent sequence is Cauchy: if xm and xn are both less than ε away from the limit, then they are less than away from each other. If the converse is true—every Cauchy sequence in M converges—then M is complete. Euclidean spaces are complete, as is 2 with the other metrics described above. Two examples of spaces which are not complete are (0, 1) and the rationals, each with the metric induced from . One can think of (0, 1) as "missing" its endpoints 0 and 1. The rationals are missing all the irrationals, since any irrational has a sequence of rationals converging to it in (for example, its successive decimal approximations). These examples show that completeness is not a topological property, since is complete but the homeomorphic space (0, 1) is not. This notion of "missing points" can be made precise. In fact, every metric space has a unique completion, which is a complete space that contains the given space as a dense subset. For example, [0, 1] is the completion of (0, 1), and the real numbers are the completion of the rationals. Since complete spaces are generally easier to work with, completions are important throughout mathematics. For example, in abstract algebra, the p-adic numbers are defined as the completion of the rationals under a different metric. Completion is particularly common as a tool in functional analysis. Often one has a set of nice functions and a way of measuring distances between them. Taking the completion of this metric space gives a new set of functions which may be less nice, but nevertheless useful because they behave similarly to the original nice functions in important ways. For example, weak solutions to differential equations typically live in a completion (a Sobolev space) rather than the original space of nice functions for which the differential equation actually makes sense.

Bounded and totally bounded spaces

File:Diameter of a Set.svg
Diameter of a set.

A metric space M is bounded if there is an r such that no pair of points in M is more than distance r apart.[lower-alpha 2] The least such r is called the diameter of M. The space M is called precompact or totally bounded if for every r > 0 there is a finite cover of M by open balls of radius r. Every totally bounded space is bounded. To see this, start with a finite cover by r-balls for some arbitrary r. Since the subset of M consisting of the centers of these balls is finite, it has finite diameter, say D. By the triangle inequality, the diameter of the whole space is at most D + 2r. The converse does not hold: an example of a metric space that is bounded but not totally bounded is 2 (or any other infinite set) with the discrete metric.

Compactness

Compactness is a topological property which generalizes the properties of a closed and bounded subset of Euclidean space. There are several equivalent definitions of compactness in metric spaces:

  1. A metric space M is compact if every open cover has a finite subcover (the usual topological definition).
  2. A metric space M is compact if every sequence has a convergent subsequence. (For general topological spaces this is called sequential compactness and is not equivalent to compactness.)
  3. A metric space M is compact if it is complete and totally bounded. (This definition is written in terms of metric properties and does not make sense for a general topological space, but it is nevertheless topologically invariant since it is equivalent to compactness.)

One example of a compact space is the closed interval [0, 1]. Compactness is important for similar reasons to completeness: it makes it easy to find limits. Another important tool is Lebesgue's number lemma, which shows that for any open cover of a compact space, every point is relatively deep inside one of the sets of the cover.

Functions between metric spaces

File:Functions between metric spaces.svg
Euler diagram of types of functions between metric spaces.

Unlike in the case of topological spaces or algebraic structures such as groups or rings, there is no single "right" type of structure-preserving function between metric spaces. Instead, one works with different types of functions depending on one's goals. Throughout this section, suppose that (M1,d1) and (M2,d2) are two metric spaces. The words "function" and "map" are used interchangeably.

Isometries

One interpretation of a "structure-preserving" map is one that fully preserves the distance function:

A function f:M1M2 is distance-preserving[12] if for every pair of points x and y in M1, d2(f(x),f(y))=d1(x,y).

It follows from the metric space axioms that a distance-preserving function is injective. A bijective distance-preserving function is called an isometry.[13] One perhaps non-obvious example of an isometry between spaces described in this article is the map f:(2,d1)(2,d) defined by f(x,y)=(x+y,xy). If there is an isometry between the spaces M1 and M2, they are said to be isometric. Metric spaces that are isometric are essentially identical.

Continuous maps

On the other end of the spectrum, one can forget entirely about the metric structure and study continuous maps, which only preserve topological structure. There are several equivalent definitions of continuity for metric spaces. The most important are:

  • Topological definition. A function f:M1M2 is continuous if for every open set U in M2, the preimage f1(U) is open.
  • Sequential continuity. A function f:M1M2 is continuous if whenever a sequence (xn) converges to a point x in M1, the sequence f(x1),f(x2), converges to the point f(x) in M2.
(These first two definitions are not equivalent for all topological spaces.)
  • ε–δ definition. A function f:M1M2 is continuous if for every point x in M1 and every ε > 0 there exists δ > 0 such that for all y in M1 we have d1(x,y)<δd2(f(x),f(y))<ε.

A homeomorphism is a continuous bijection whose inverse is also continuous; if there is a homeomorphism between M1 and M2, they are said to be homeomorphic. Homeomorphic spaces are the same from the point of view of topology, but may have very different metric properties. For example, is unbounded and complete, while (0, 1) is bounded but not complete.

Uniformly continuous maps

A function f:M1M2 is uniformly continuous if for every real number ε > 0 there exists δ > 0 such that for all points x and y in M1 such that d(x,y)<δ, we have d2(f(x),f(y))<ε. The only difference between this definition and the ε–δ definition of continuity is the order of quantifiers: the choice of δ must depend only on ε and not on the point x. However, this subtle change makes a big difference. For example, uniformly continuous maps take Cauchy sequences in M1 to Cauchy sequences in M2. In other words, uniform continuity preserves some metric properties which are not purely topological. On the other hand, the Heine–Cantor theorem states that if M1 is compact, then every continuous map is uniformly continuous. In other words, uniform continuity cannot distinguish any non-topological features of compact metric spaces.

Lipschitz maps and contractions

A Lipschitz map is one that stretches distances by at most a bounded factor. Formally, given a real number K > 0, the map f:M1M2 is K-Lipschitz if d2(f(x),f(y))Kd1(x,y)for allx,yM1. Lipschitz maps are particularly important in metric geometry, since they provide more flexibility than distance-preserving maps, but still make essential use of the metric.[14] For example, a curve in a metric space is rectifiable (has finite length) if and only if it has a Lipschitz reparametrization. A 1-Lipschitz map is sometimes called a nonexpanding or metric map. Metric maps are commonly taken to be the morphisms of the category of metric spaces. A K-Lipschitz map for K < 1 is called a contraction. The Banach fixed-point theorem states that if M is a complete metric space, then every contraction f:MM admits a unique fixed point. If the metric space M is compact, the result holds for a slightly weaker condition on f: a map f:MM admits a unique fixed point if d(f(x),f(y))<d(x,y)for allxyM1.

Quasi-isometries

A quasi-isometry is a map that preserves the "large-scale structure" of a metric space. Quasi-isometries need not be continuous. For example, 2 and its subspace 2 are quasi-isometric, even though one is connected and the other is discrete. The equivalence relation of quasi-isometry is important in geometric group theory: the Švarc–Milnor lemma states that all spaces on which a group acts geometrically are quasi-isometric.[15] Formally, the map f:M1M2 is a quasi-isometric embedding if there exist constants A ≥ 1 and B ≥ 0 such that 1Ad2(f(x),f(y))Bd1(x,y)Ad2(f(x),f(y))+B for all x,yM1. It is a quasi-isometry if in addition it is quasi-surjective, i.e. there is a constant C ≥ 0 such that every point in M2 is at distance at most C from some point in the image f(M1).

Notions of metric space equivalence

Given two metric spaces (M1,d1) and (M2,d2):

  • They are called homeomorphic (topologically isomorphic) if there is a homeomorphism between them (i.e., a continuous bijection with a continuous inverse). If M1=M2 and the identity map is a homeomorphism, then d1 and d2 are said to be topologically equivalent.
  • They are called uniformic (uniformly isomorphic) if there is a uniform isomorphism between them (i.e., a uniformly continuous bijection with a uniformly continuous inverse).
  • They are called bilipschitz homeomorphic if there is a bilipschitz bijection between them (i.e., a Lipschitz bijection with a Lipschitz inverse).
  • They are called isometric if there is a (bijective) isometry between them. In this case, the two metric spaces are essentially identical.
  • They are called quasi-isometric if there is a quasi-isometry between them.

Metric spaces with additional structure

Normed vector spaces

A normed vector space is a vector space equipped with a norm, which is a function that measures the length of vectors. The norm of a vector v is typically denoted by v. Any normed vector space can be equipped with a metric in which the distance between two vectors x and y is given by d(x,y):=xy. The metric d is said to be induced by the norm . Conversely,[16] if a metric d on a vector space X is

  • translation invariant: d(x,y)=d(x+a,y+a) for every x, y, and a in X; and
  • absolutely homogeneous: d(αx,αy)=|α|d(x,y) for every x and y in X and real number α;

then it is the metric induced by the norm x:=d(x,0). A similar relationship holds between seminorms and pseudometrics. Among examples of metrics induced by a norm are the metrics d1, d2, and d on 2, which are induced by the Manhattan norm, the Euclidean norm, and the maximum norm, respectively. More generally, the Kuratowski embedding allows one to see any metric space as a subspace of a normed vector space. Infinite-dimensional normed vector spaces, particularly spaces of functions, are studied in functional analysis. Completeness is particularly important in this context: a complete normed vector space is known as a Banach space. An unusual property of normed vector spaces is that linear transformations between them are continuous if and only if they are Lipschitz. Such transformations are known as bounded operators.

Length spaces

File:Approximate arc length.svg
One possible approximation for the arc length of a curve. The approximation is never longer than the arc length, justifying the definition of arc length as a supremum.

A curve in a metric space (M, d) is a continuous function γ:[0,T]M. The length of γ is measured by L(γ)=sup0=x0<x1<<xn=T{k=1nd(γ(xk1),γ(xk))}. In general, this supremum may be infinite; a curve of finite length is called rectifiable.[17] Suppose that the length of the curve γ is equal to the distance between its endpoints—that is, it is the shortest possible path between its endpoints. After reparametrization by arc length, γ becomes a geodesic: a curve which is a distance-preserving function.[15] A geodesic is a shortest possible path between any two of its points.[lower-alpha 3] A geodesic metric space is a metric space which admits a geodesic between any two of its points. The spaces (2,d1) and (2,d2) are both geodesic metric spaces. In (2,d2), geodesics are unique, but in (2,d1), there are often infinitely many geodesics between two points, as shown in the figure at the top of the article. The space M is a length space (or the metric d is intrinsic) if the distance between any two points x and y is the infimum of lengths of paths between them. Unlike in a geodesic metric space, the infimum does not have to be attained. An example of a length space which is not geodesic is the Euclidean plane minus the origin: the points (1, 0) and (-1, 0) can be joined by paths of length arbitrarily close to 2, but not by a path of length 2. An example of a metric space which is not a length space is given by the straight-line metric on the sphere: the straight line between two points through the center of the Earth is shorter than any path along the surface. Given any metric space (M, d), one can define a new, intrinsic distance function dintrinsic on M by setting the distance between points x and y to be the infimum of the d-lengths of paths between them. For instance, if d is the straight-line distance on the sphere, then dintrinsic is the great-circle distance. However, in some cases dintrinsic may have infinite values. For example, if M is the Koch snowflake with the subspace metric d induced from 2, then the resulting intrinsic distance is infinite for any pair of distinct points.

Riemannian manifolds

A Riemannian manifold is a space equipped with a Riemannian metric tensor, which determines lengths of tangent vectors at every point. This can be thought of defining a notion of distance infinitesimally. In particular, a differentiable path γ:[0,T]M in a Riemannian manifold M has length defined as the integral of the length of the tangent vector to the path: L(γ)=0T|γ˙(t)|dt. On a connected Riemannian manifold, one then defines the distance between two points as the infimum of lengths of smooth paths between them. This construction generalizes to other kinds of infinitesimal metrics on manifolds, such as sub-Riemannian and Finsler metrics. The Riemannian metric is uniquely determined by the distance function; this means that in principle, all information about a Riemannian manifold can be recovered from its distance function. One direction in metric geometry is finding purely metric ("synthetic") formulations of properties of Riemannian manifolds. For example, a Riemannian manifold is a CAT(k) space (a synthetic condition which depends purely on the metric) if and only if its sectional curvature is bounded above by k.[20] Thus CAT(k) spaces generalize upper curvature bounds to general metric spaces.

Metric measure spaces

Real analysis makes use of both the metric on n and the Lebesgue measure. Therefore, generalizations of many ideas from analysis naturally reside in metric measure spaces: spaces that have both a measure and a metric which are compatible with each other. Formally, a metric measure space is a metric space equipped with a Borel regular measure such that every ball has positive measure.[21] For example Euclidean spaces of dimension n, and more generally n-dimensional Riemannian manifolds, naturally have the structure of a metric measure space, equipped with the Lebesgue measure. Certain fractal metric spaces such as the Sierpiński gasket can be equipped with the α-dimensional Hausdorff measure where α is the Hausdorff dimension. In general, however, a metric space may not have an "obvious" choice of measure. One application of metric measure spaces is generalizing the notion of Ricci curvature beyond Riemannian manifolds. Just as CAT(k) and Alexandrov spaces generalize sectional curvature bounds, RCD spaces are a class of metric measure spaces which generalize lower bounds on Ricci curvature.[22]

Further examples and applications

Graphs and finite metric spaces

A metric space is discrete if its induced topology is the discrete topology. Although many concepts, such as completeness and compactness, are not interesting for such spaces, they are nevertheless an object of study in several branches of mathematics. In particular, finite metric spaces (those having a finite number of points) are studied in combinatorics and theoretical computer science.[23] Embeddings in other metric spaces are particularly well-studied. For example, not every finite metric space can be isometrically embedded in a Euclidean space or in Hilbert space. On the other hand, in the worst case the required distortion (bilipschitz constant) is only logarithmic in the number of points.[24][25] For any undirected connected graph G, the set V of vertices of G can be turned into a metric space by defining the distance between vertices x and y to be the length of the shortest edge path connecting them. This is also called shortest-path distance or geodesic distance. In geometric group theory this construction is applied to the Cayley graph of a (typically infinite) finitely-generated group, yielding the word metric. Up to a bilipschitz homeomorphism, the word metric depends only on the group and not on the chosen finite generating set.[15]

Metric embeddings and approximations

An important area of study in finite metric spaces is the embedding of complex metric spaces into simpler ones while controlling the distortion of distances. This is particularly useful in computer science and discrete mathematics, where algorithms often perform more efficiently on simpler structures like tree metrics. A significant result in this area is that any finite metric space can be probabilistically embedded into a tree metric with an expected distortion of O(logn), where n is the number of points in the metric space.[26] This embedding is notable because it achieves the best possible asymptotic bound on distortion, matching the lower bound of Ω(logn). The tree metrics produced in this embedding dominate the original metrics, meaning that distances in the tree are greater than or equal to those in the original space. This property is particularly useful for designing approximation algorithms, as it allows for the preservation of distance-related properties while simplifying the underlying structure. The result has significant implications for various computational problems:

  • Network design: Improves approximation algorithms for problems like the Group Steiner tree problem (a generalization of the Steiner tree problem) and Buy-at-bulk network design (a problem in Network planning and design) by simplifying the metric space to a tree metric.
  • Clustering: Enhances algorithms for clustering problems where hierarchical clustering can be performed more efficiently on tree metrics.
  • Online algorithms: Benefits problems like the k-server problem and metrical task system by providing better competitive ratios through simplified metrics.

The technique involves constructing a hierarchical decomposition of the original metric space and converting it into a tree metric via a randomized algorithm. The O(logn) distortion bound has led to improved approximation ratios in several algorithmic problems, demonstrating the practical significance of this theoretical result.

Distances between mathematical objects

In modern mathematics, one often studies spaces whose points are themselves mathematical objects. A distance function on such a space generally aims to measure the dissimilarity between two objects. Here are some examples:

Hausdorff and Gromov–Hausdorff distance

The idea of spaces of mathematical objects can also be applied to subsets of a metric space, as well as metric spaces themselves. Hausdorff and Gromov–Hausdorff distance define metrics on the set of compact subsets of a metric space and the set of compact metric spaces, respectively. Suppose (M, d) is a metric space, and let S be a subset of M. The distance from S to a point x of M is, informally, the distance from x to the closest point of S. However, since there may not be a single closest point, it is defined via an infimum: d(x,S)=inf{d(x,s):sS}. In particular, d(x,S)=0 if and only if x belongs to the closure of S. Furthermore, distances between points and sets satisfy a version of the triangle inequality: d(x,S)d(x,y)+d(y,S), and therefore the map dS:M defined by dS(x)=d(x,S) is continuous. Incidentally, this shows that metric spaces are completely regular. Given two subsets S and T of M, their Hausdorff distance is dH(S,T)=max{sup{d(s,T):sS},sup{d(t,S):tT}}. Informally, two sets S and T are close to each other in the Hausdorff distance if no element of S is too far from T and vice versa. For example, if S is an open set in Euclidean space T is an ε-net inside S, then dH(S,T)<ε. In general, the Hausdorff distance dH(S,T) can be infinite or zero. However, the Hausdorff distance between two distinct compact sets is always positive and finite. Thus the Hausdorff distance defines a metric on the set of compact subsets of M. The Gromov–Hausdorff metric defines a distance between (isometry classes of) compact metric spaces. The Gromov–Hausdorff distance between compact spaces X and Y is the infimum of the Hausdorff distance over all metric spaces Z that contain X and Y as subspaces. While the exact value of the Gromov–Hausdorff distance is rarely useful to know, the resulting topology has found many applications.

Miscellaneous examples

  • Given a metric space (X, d) and an increasing concave function f:[0,)[0,) such that f(t) = 0 if and only if t = 0, then df(x,y)=f(d(x,y)) is also a metric on X. If f(t) = tα for some real number α < 1, such a metric is known as a snowflake of d.[28]
  • The tight span of a metric space is another metric space which can be thought of as an abstract version of the convex hull.
  • The knight's move metric, the minimal number of knight's moves to reach one point in 2 from another, is a metric on 2.
  • The British Rail metric (also called the "post office metric" or the "SNCF metric") on a normed vector space is given by d(x,y)=x+y for distinct points x and y, and d(x,x)=0. More generally can be replaced with a function f taking an arbitrary set S to non-negative reals and taking the value 0 at most once: then the metric is defined on S by d(x,y)=f(x)+f(y) for distinct points x and y, and d(x,x)=0. The name alludes to the tendency of railway journeys to proceed via London (or Paris) irrespective of their final destination.
  • The Robinson–Foulds metric used for calculating the distances between Phylogenetic trees in Phylogenetics[29]

Constructions

Product metric spaces

If (M1,d1),,(Mn,dn) are metric spaces, and N is the Euclidean norm on n, then (M1××Mn,d×) is a metric space, where the product metric is defined by d×((x1,,xn),(y1,,yn))=N(d1(x1,y1),,dn(xn,yn)), and the induced topology agrees with the product topology. By the equivalence of norms in finite dimensions, a topologically equivalent metric is obtained if N is the taxicab norm, a p-norm, the maximum norm, or any other norm which is non-decreasing as the coordinates of a positive n-tuple increase (yielding the triangle inequality). Similarly, a metric on the topological product of countably many metric spaces can be obtained using the metric d(x,y)=i=112idi(xi,yi)1+di(xi,yi). The topological product of uncountably many metric spaces need not be metrizable. For example, an uncountable product of copies of is not first-countable and thus is not metrizable.

Quotient metric spaces

If M is a metric space with metric d, and is an equivalence relation on M, then we can endow the quotient set M/ with a pseudometric. The distance between two equivalence classes [x] and [y] is defined as d([x],[y])=inf{d(p1,q1)+d(p2,q2)++d(pn,qn)}, where the infimum is taken over all finite sequences (p1,p2,,pn) and (q1,q2,,qn) with p1x, qny, qipi+1,i=1,2,,n1.[30] In general this will only define a pseudometric, i.e. d([x],[y])=0 does not necessarily imply that [x]=[y]. However, for some equivalence relations (e.g., those given by gluing together polyhedra along faces), d is a metric. The quotient metric d is characterized by the following universal property. If f:(M,d)(X,δ) is a metric (i.e. 1-Lipschitz) map between metric spaces satisfying f(x) = f(y) whenever xy, then the induced function f:M/X, given by f([x])=f(x), is a metric map f:(M/,d)(X,δ). The quotient metric does not always induce the quotient topology. For example, the topological quotient of the metric space ×[0,1] identifying all points of the form (n,0) is not metrizable since it is not first-countable, but the quotient metric is a well-defined metric on the same set which induces a coarser topology. Moreover, different metrics on the original topological space (a disjoint union of countably many intervals) lead to different topologies on the quotient.[31] A topological space is sequential if and only if it is a (topological) quotient of a metric space.[32]

Generalizations of metric spaces

There are several notions of spaces which have less structure than a metric space, but more than a topological space.

  • Uniform spaces are spaces in which distances are not defined, but uniform continuity is.
  • Approach spaces are spaces in which point-to-set distances are defined, instead of point-to-point distances. They have particularly good properties from the point of view of category theory.
  • Continuity spaces are a generalization of metric spaces and posets that can be used to unify the notions of metric spaces and domains.

There are also numerous ways of relaxing the axioms for a metric, giving rise to various notions of generalized metric spaces. These generalizations can also be combined. The terminology used to describe them is not completely standardized. Most notably, in functional analysis pseudometrics often come from seminorms on vector spaces, and so it is natural to call them "semimetrics". This conflicts with the use of the term in topology.

Extended metrics

Some authors define metrics so as to allow the distance function d to attain the value ∞, i.e. distances are non-negative numbers on the extended real number line.[4] Such a function is also called an extended metric or "∞-metric". Every extended metric can be replaced by a real-valued metric that is topologically equivalent. This can be done using a subadditive monotonically increasing bounded function which is zero at zero, e.g. d(x,y)=d(x,y)/(1+d(x,y)) or d(x,y)=min(1,d(x,y)).

Metrics valued in structures other than the real numbers

The requirement that the metric take values in [0,) can be relaxed to consider metrics with values in other structures, including:

These generalizations still induce a uniform structure on the space.

Pseudometrics

A pseudometric on X is a function d:X×X which satisfies the axioms for a metric, except that instead of the second (identity of indiscernibles) only d(x,x)=0 for all x is required.[34] In other words, the axioms for a pseudometric are:

  1. d(x,y)0
  2. d(x,x)=0
  3. d(x,y)=d(y,x)
  4. d(x,z)d(x,y)+d(y,z).

In some contexts, pseudometrics are referred to as semimetrics[35] because of their relation to seminorms.

Quasimetrics

Occasionally, a quasimetric is defined as a function that satisfies all axioms for a metric with the possible exception of symmetry.[36] The name of this generalisation is not entirely standardized.[37]

  1. d(x,y)0
  2. d(x,y)=0x=y
  3. d(x,z)d(x,y)+d(y,z)

Quasimetrics are common in real life. For example, given a set X of mountain villages, the typical walking times between elements of X form a quasimetric because travel uphill takes longer than travel downhill. Another example is the length of car rides in a city with one-way streets: here, a shortest path from point A to point B goes along a different set of streets than a shortest path from B to A and may have a different length. A quasimetric on the reals can be defined by setting d(x,y)={xyif xy,1otherwise. The 1 may be replaced, for example, by infinity or by 1+yx or any other subadditive function of y-x. This quasimetric describes the cost of modifying a metal stick: it is easy to reduce its size by filing it down, but it is difficult or impossible to grow it. Given a quasimetric on X, one can define an R-ball around x to be the set {yX|d(x,y)R}. As in the case of a metric, such balls form a basis for a topology on X, but this topology need not be metrizable. For example, the topology induced by the quasimetric on the reals described above is the (reversed) Sorgenfrey line.

Metametrics or partial metrics

In a metametric, all the axioms of a metric are satisfied except that the distance between identical points is not necessarily zero. In other words, the axioms for a metametric are:

  1. d(x,y)0
  2. d(x,y)=0x=y
  3. d(x,y)=d(y,x)
  4. d(x,z)d(x,y)+d(y,z).

Metametrics appear in the study of Gromov hyperbolic metric spaces and their boundaries. The visual metametric on such a space satisfies d(x,x)=0 for points x on the boundary, but otherwise d(x,x) is approximately the distance from x to the boundary. Metametrics were first defined by Jussi Väisälä.[38] In other work, a function satisfying these axioms is called a partial metric[39][40] or a dislocated metric.[34]

Semimetrics

A semimetric on X is a function d:X×X that satisfies the first three axioms, but not necessarily the triangle inequality:

  1. d(x,y)0
  2. d(x,y)=0x=y
  3. d(x,y)=d(y,x)

Some authors work with a weaker form of the triangle inequality, such as:

d(x,z)ρ(d(x,y)+d(y,z)) ρ-relaxed triangle inequality
d(x,z)ρmax{d(x,y),d(y,z)} ρ-inframetric inequality

The ρ-inframetric inequality implies the ρ-relaxed triangle inequality (assuming the first axiom), and the ρ-relaxed triangle inequality implies the 2ρ-inframetric inequality. Semimetrics satisfying these equivalent conditions have sometimes been referred to as quasimetrics,[41] nearmetrics[42] or inframetrics.[43] The ρ-inframetric inequalities were introduced to model round-trip delay times in the internet.[43] The triangle inequality implies the 2-inframetric inequality, and the ultrametric inequality is exactly the 1-inframetric inequality.

Premetrics

Relaxing the last three axioms leads to the notion of a premetric, i.e. a function satisfying the following conditions:

  1. d(x,y)0
  2. d(x,x)=0

This is not a standard term. Sometimes it is used to refer to other generalizations of metrics such as pseudosemimetrics[44] or pseudometrics;[45] in translations of Russian books it sometimes appears as "prametric".[46] A premetric that satisfies symmetry, i.e. a pseudosemimetric, is also called a distance.[47] Any premetric gives rise to a topology as follows. For a positive real r, the r-ball centered at a point p is defined as

Br(p)={x|d(x,p)<r}.

A set is called open if for any point p in the set there is an r-ball centered at p which is contained in the set. Every premetric space is a topological space, and in fact a sequential space. In general, the r-balls themselves need not be open sets with respect to this topology. As for metrics, the distance between two sets A and B, is defined as

d(A,B)=infxA,yBd(x,y).

This defines a premetric on the power set of a premetric space. If we start with a (pseudosemi-)metric space, we get a pseudosemimetric, i.e. a symmetric premetric. Any premetric gives rise to a preclosure operator cl as follows:

cl(A)={x|d(x,A)=0}.

Pseudoquasimetrics

The prefixes pseudo-, quasi- and semi- can also be combined, e.g., a pseudoquasimetric (sometimes called hemimetric) relaxes both the indiscernibility axiom and the symmetry axiom and is simply a premetric satisfying the triangle inequality. For pseudoquasimetric spaces the open r-balls form a basis of open sets. A very basic example of a pseudoquasimetric space is the set {0,1} with the premetric given by d(0,1)=1 and d(1,0)=0. The associated topological space is the Sierpiński space. Sets equipped with an extended pseudoquasimetric were studied by William Lawvere as "generalized metric spaces".[48] From a categorical point of view, the extended pseudometric spaces and the extended pseudoquasimetric spaces, along with their corresponding nonexpansive maps, are the best behaved of the metric space categories. One can take arbitrary products and coproducts and form quotient objects within the given category. If one drops "extended", one can only take finite products and coproducts. If one drops "pseudo", one cannot take quotients. Lawvere also gave an alternate definition of such spaces as enriched categories. The ordered set (,) can be seen as a category with one morphism ab if ab and none otherwise. Using + as the tensor product and 0 as the identity makes this category into a monoidal category R*. Every (extended pseudoquasi-)metric space (M,d) can now be viewed as a category M* enriched over R*:

  • The objects of the category are the points of M.
  • For every pair of points x and y such that d(x,y)<, there is a single morphism which is assigned the object d(x,y) of R*.
  • The triangle inequality and the fact that d(x,x)=0 for all points x derive from the properties of composition and identity in an enriched category.
  • Since R* is a poset, all diagrams that are required for an enriched category commute automatically.

Metrics on multisets

The notion of a metric can be generalized from a distance between two elements to a number assigned to a multiset of elements. A multiset is a generalization of the notion of a set in which an element can occur more than once. Define the multiset union U=XY as follows: if an element x occurs m times in X and n times in Y then it occurs m + n times in U. A function d on the set of nonempty finite multisets of elements of a set M is a metric[49] if

  1. d(X)=0 if all elements of X are equal and d(X)>0 otherwise (positive definiteness)
  2. d(X) depends only on the (unordered) multiset X (symmetry)
  3. d(XY)d(XZ)+d(ZY) (triangle inequality)

By considering the cases of axioms 1 and 2 in which the multiset X has two elements and the case of axiom 3 in which the multisets X, Y, and Z have one element each, one recovers the usual axioms for a metric. That is, every multiset metric yields an ordinary metric when restricted to sets of two elements. A simple example is the set of all nonempty finite multisets X of integers with d(X)=max(X)min(X). More complex examples are information distance in multisets;[49] and normalized compression distance (NCD) in multisets.[50]

See also

Notes

  1. Balls with rational radius around a point x form a neighborhood basis for that point.
  2. In the context of intervals in the real line, or more generally regions in Euclidean space, bounded sets are sometimes referred to as "finite intervals" or "finite regions". However, they do not typically have a finite number of elements, and while they all have finite volume, so do many unbounded sets. Therefore this terminology is imprecise.
  3. This differs from usage in Riemannian geometry, where geodesics are only locally shortest paths. Some authors define geodesics in metric spaces in the same way.[18][19]

Citations

  1. Čech 1969, p. 42.
  2. Burago, Burago & Ivanov 2001.
  3. Heinonen 2001.
  4. 4.0 4.1 Burago, Burago & Ivanov 2001, p. 1.
  5. Gromov 2007, p. xv.
  6. Gleason, Andrew (1991). Fundamentals of Abstract Analysis (1st ed.). Taylor & Francis. p. 223. doi:10.1201/9781315275444. ISBN 9781315275444. S2CID 62222843.
  7. Fréchet, M. (December 1906). "Sur quelques points du calcul fonctionnel". Rendiconti del Circolo Matematico di Palermo. 22 (1): 1–72. doi:10.1007/BF03018603. S2CID 123251660.
  8. F. Hausdorff (1914) Grundzuge der Mengenlehre
  9. Blumberg, Henry (1927). "Hausdorff's Grundzüge der Mengenlehre". Bulletin of the American Mathematical Society. 6: 778–781. doi:10.1090/S0002-9904-1920-03378-1.
  10. Mohamed A. Khamsi & William A. Kirk (2001) Introduction to Metric Spaces and Fixed Point Theory, page 14, John Wiley & Sons
  11. Rudin, Mary Ellen. A new proof that metric spaces are paracompact Archived 2016-04-12 at the Wayback Machine. Proceedings of the American Mathematical Society, Vol. 20, No. 2. (Feb., 1969), p. 603.
  12. Burago, Burago & Ivanov 2001, p. 2.
  13. Burago, Burago & Ivanov 2001, p. 2.
    Some authors refer to any distance-preserving function as an isometry, e.g. Munkres 2000, p. 181.
  14. Gromov 2007, p. xvii.
  15. 15.0 15.1 15.2 Margalit & Thomas 2017.
  16. Narici & Beckenstein 2011, pp. 47–66.
  17. Burago, Burago & Ivanov 2001, Definition 2.3.1.
  18. Burago, Burago & Ivanov 2001, Definition 2.5.27.
  19. Gromov 2007, Definition 1.9.
  20. Burago, Burago & Ivanov 2001, p. 127.
  21. Heinonen 2007, p. 191.
  22. Gigli, Nicola (2018-10-18). "Lecture notes on differential calculus on RCD spaces". Publications of the Research Institute for Mathematical Sciences. 54 (4): 855–918. arXiv:1703.06829. doi:10.4171/PRIMS/54-4-4. S2CID 119129867.
  23. Linial, Nathan (2003). "Finite metric-spaces—combinatorics, geometry and algorithms". Proceedings of the ICM, Beijing 2002. Vol. 3. pp. 573–586. arXiv:math/0304466.
  24. Bourgain, J. (1985). "On lipschitz embedding of finite metric spaces in Hilbert space". Israel Journal of Mathematics. 52 (1–2): 46–52. doi:10.1007/BF02776078. S2CID 121649019.
  25. Jiří Matoušek and Assaf Naor, ed. "Open problems on embeddings of finite metric spaces". Archived 2010-12-26 at the Wayback Machine.
  26. Fakcharoenphol, J.; Rao, S.; Talwar, K. (2004). "A tight bound on approximating arbitrary metrics by tree metrics". Journal of Computer and System Sciences. 69 (3): 485–497. doi:10.1016/j.jcss.2004.04.011.
  27. Ó Searcóid 2006, p. 107.
  28. Gottlieb, Lee-Ad; Solomon, Shay (2014-06-08). Light spanners for snowflake metrics. SOCG '14: Proceedings of the thirtieth annual symposium on Computational geometry. pp. 387–395. arXiv:1401.5014. doi:10.1145/2582112.2582140.
  29. Robinson, D.F.; Foulds, L.R. (February 1981). "Comparison of phylogenetic trees". Mathematical Biosciences. 53 (1–2): 131–147. doi:10.1016/0025-5564(81)90043-2. S2CID 121156920.
  30. Burago, Burago & Ivanov 2001, Definition 3.1.12.
  31. See Burago, Burago & Ivanov 2001, Example 3.1.17, although in this book the quotient ×[0,1]/×{0} is incorrectly claimed to be homeomorphic to the topological quotient.
  32. Goreham, Anthony. Sequential convergence in Topological Spaces Archived 2011-06-04 at the Wayback Machine. Honours' Dissertation, Queen's College, Oxford (April, 2001), p. 14
  33. Hitzler & Seda 2016, Definition 4.3.1.
  34. 34.0 34.1 Hitzler & Seda 2016, Definition 4.2.1.
  35. Burago, Burago & Ivanov 2001, Definition 1.1.4.
  36. Steen & Seebach (1995); Smyth (1988)
  37. Rolewicz (1987) calls them "semimetrics". That same term is also frequently used for two other generalizations of metrics.
  38. Väisälä 2005.
  39. "Partial metrics: welcome". www.dcs.warwick.ac.uk. Archived from the original on 2017-07-27. Retrieved 2018-05-02.
  40. Bukatin, Michael; Kopperman, Ralph; Matthews, Steve; Pajoohesh, Homeira (2009-10-01). "Partial Metric Spaces" (PDF). American Mathematical Monthly. 116 (8): 708–718. doi:10.4169/193009709X460831. S2CID 13969183.
  41. Xia 2009.
  42. Xia 2008.
  43. 43.0 43.1 Fraigniaud, Lebhar & Viennot 2008.
  44. Buldygin & Kozachenko 2000.
  45. Helemskii 2006.
  46. Arkhangel'skii & Pontryagin (1990); Aldrovandi & Pereira (2017)
  47. Deza & Laurent 1997.
  48. Lawvere (1973); Vickers (2005)
  49. 49.0 49.1 Vitányi 2011.
  50. Cohen & Vitányi 2012.

References

External links