Decision-theoretic rough sets

From The Right Wiki
Jump to navigationJump to search

In the mathematical theory of decisions, decision-theoretic rough sets (DTRS) is a probabilistic extension of rough set classification. First created in 1990 by Dr. Yiyu Yao,[1] the extension makes use of loss functions to derive α and β region parameters. Like rough sets, the lower and upper approximations of a set are used.

Definitions

The following contains the basic principles of decision-theoretic rough sets.

Conditional risk

Using the Bayesian decision procedure, the decision-theoretic rough set (DTRS) approach allows for minimum-risk decision making based on observed evidence. Let A={a1,,am} be a finite set of m possible actions and let Ω={w1,,ws} be a finite set of s states. P(wj[x]) is calculated as the conditional probability of an object x being in state wj given the object description [x]. λ(aiwj) denotes the loss, or cost, for performing action ai when the state is wj. The expected loss (conditional risk) associated with taking action ai is given by:

R(ai[x])=j=1sλ(aiwj)P(wj[x]).

Object classification with the approximation operators can be fitted into the Bayesian decision framework. The set of actions is given by A={aP,aN,aB}, where aP, aN, and aB represent the three actions in classifying an object into POS(A), NEG(A), and BND(A) respectively. To indicate whether an element is in A or not in A, the set of states is given by Ω={A,Ac}. Let λ(aA) denote the loss incurred by taking action a when an object belongs to A, and let λ(aAc) denote the loss incurred by take the same action when the object belongs to Ac.

Loss functions

Let λPP denote the loss function for classifying an object in A into the POS region, λBP denote the loss function for classifying an object in A into the BND region, and let λNP denote the loss function for classifying an object in A into the NEG region. A loss function λN denotes the loss of classifying an object that does not belong to A into the regions specified by . Taking individual can be associated with the expected loss R(a[x])actions and can be expressed as:

R(aP[x])=λPPP(A[x])+λPNP(Ac[x]),
R(aN[x])=λNPP(A[x])+λNNP(Ac[x]),
R(aB[x])=λBPP(A[x])+λBNP(Ac[x]),

where λP=λ(aA), λN=λ(aAc), and =P, N, or B.

Minimum-risk decision rules

If we consider the loss functions λPPλBP<λNP and λNNλBN<λPN, the following decision rules are formulated (P, N, B):

  • P: If P(A[x])γ and P(A[x])α, decide POS(A);
  • N: If P(A[x])β and P(A[x])γ, decide NEG(A);
  • B: If βP(A[x])α, decide BND(A);

where,

α=λPNλBN(λBPλBN)(λPPλPN),
γ=λPNλNN(λNPλNN)(λPPλPN),
β=λBNλNN(λNPλNN)(λBPλBN).

The α, β, and γ values define the three different regions, giving us an associated risk for classifying an object. When α>β, we get α>γ>β and can simplify (P, N, B) into (P1, N1, B1):

  • P1: If P(A[x])α, decide POS(A);
  • N1: If P(A[x])β, decide NEG(A);
  • B1: If β<P(A[x])<α, decide BND(A).

When α=β=γ, we can simplify the rules (P-B) into (P2-B2), which divide the regions based solely on α:

  • P2: If P(A[x])>α, decide POS(A);
  • N2: If P(A[x])<α, decide NEG(A);
  • B2: If P(A[x])=α, decide BND(A).

Data mining, feature selection, information retrieval, and classifications are just some of the applications in which the DTRS approach has been successfully used.

See also

References

  1. Yao, Y.Y.; Wong, S.K.M.; Lingras, P. (1990). "A decision-theoretic rough set model". Methodologies for Intelligent Systems, 5, Proceedings of the 5th International Symposium on Methodologies for Intelligent Systems. Knoxville, Tennessee, USA: North-Holland: 17–25.

External links