Covering number

From The Right Wiki
Jump to navigationJump to search

In mathematics, a covering number is the number of balls of a given size needed to completely cover a given space, with possible overlaps between the balls. The covering number quantifies the size of a set and can be applied to general metric spaces. Two related concepts are the packing number, the number of disjoint balls that fit in a space, and the metric entropy, the number of points that fit in a space when constrained to lie at some fixed minimum distance apart.

Definition

Let (M, d) be a metric space, let K be a subset of M, and let r be a positive real number. Let Br(x) denote the ball of radius r centered at x. A subset C of M is an r-external covering of K if:

KxCBr(x).

In other words, for every yK there exists xC such that d(x,y)r. If furthermore C is a subset of K, then it is an r-internal covering. The external covering number of K, denoted Nrext(K), is the minimum cardinality of any external covering of K. The internal covering number, denoted Nrint(K), is the minimum cardinality of any internal covering. A subset P of K is a packing if PK and the set {Br(x)}xP is pairwise disjoint. The packing number of K, denoted Nrpack(K), is the maximum cardinality of any packing of K. A subset S of K is r-separated if each pair of points x and y in S satisfies d(x, y) ≥ r. The metric entropy of K, denoted Nrmet(K), is the maximum cardinality of any r-separated subset of K.

Examples

  1. The metric space is the real line . K is a set of real numbers whose absolute value is at most k. Then, there is an external covering of 2kr intervals of length r, covering the interval [k,k]. Hence:
    Nrext(K)2kr
  2. The metric space is the Euclidean space m with the Euclidean metric. Km is a set of vectors whose length (norm) is at most k. If K lies in a d-dimensional subspace of m, then:[1]: 337 
    Nrext(K)(2kdr)d.
  3. The metric space is the space of real-valued functions, with the l-infinity metric. The covering number Nrint(K) is the smallest number k such that, there exist h1,,hkK such that, for all hK there exists i{1,,k} such that the supremum distance between h and hi is at most r. The above bound is not relevant since the space is -dimensional. However, when K is a compact set, every covering of it has a finite sub-covering, so Nrint(K) is finite.[2]: 61 

Properties

  1. The internal and external covering numbers, the packing number, and the metric entropy are all closely related. The following chain of inequalities holds for any subset K of a metric space and any positive real number r.[3]
    N2rmet(K)Nrpack(K)Nrext(K)Nrint(K)Nrmet(K)
  2. Each function except the internal covering number is non-increasing in r and non-decreasing in K. The internal covering number is monotone in r but not necessarily in K.

The following properties relate to covering numbers in the standard Euclidean space, m:[1]: 338 

  1. If all vectors in K are translated by a constant vector k0m, then the covering number does not change.
  2. If all vectors in K are multiplied by a scalar k, then:
    for all r: N|k|rext(kK)=Nrext(K)
  3. If all vectors in K are operated by a Lipschitz function ϕ with Lipschitz constant k, then:
    for all r: N|k|rext(ϕK)Nrext(K)

Application to machine learning

Let K be a space of real-valued functions, with the l-infinity metric (see example 3 above). Suppose all functions in K are bounded by a real constant M. Then, the covering number can be used to bound the generalization error of learning functions from K, relative to the squared loss:[2]: 61 

Prob[suphK|GeneralizationError(h)EmpiricalError(h)|ϵ]Nrint(K)2expmϵ22M4

where r=ϵ8M and m is the number of samples.

See also

References

  1. 1.0 1.1 Shalev-Shwartz, Shai; Ben-David, Shai (2014). Understanding Machine Learning – from Theory to Algorithms. Cambridge University Press. ISBN 9781107057135.
  2. 2.0 2.1 Mohri, Mehryar; Rostamizadeh, Afshin; Talwalkar, Ameet (2012). Foundations of Machine Learning. US, Massachusetts: MIT Press. ISBN 9780262018258.
  3. Tao, Terence. "Metric entropy analogues of sum set theory". Retrieved 2 June 2014.