Fuzzy retrieval methods are a radical new technique which was introduced to compensate the limitation of traditional boolean and vector methods which could not take user's preference into account. Many times the user query is fuzzy as he himself doesn't know much about the subject. At such times the traditional 1-0 logic of boolean methods or vector logics don't work out and the documents returned might either be too many or too few. To offset this problem, fuzzy techniques were introduced.
In this blog, I will try to introduce the logic of fuzzy sets and then we will see how fuzzy information retrieval methods work.
Fuzzy sets:
Traditional sets theory follow boolean logic with elements either being or not being part of the sets. But with fuzzy sets an element can belong to a set partially. Here, an element belongs to a set with varying degree between 0 and 1.
Thus we can define membership function in fuzzy sets as:
Let ,
set U={x1,x2,x3,....,xn}=universe of discourse (set of all elements)
function m=membership function
then for each element x in U,
x is not included if m(x)=0
x is included if m(x)=1
x is partially included if 0<m(x)<1
and function :
d(a and b)=min(da,db)
or function:
d(a or b)=max(da,db)
Fuzzy Information retrieval methods:
The two famous fuzzy retrieval models are:
SlM(Qor, D) = Cor1 * max(dA1, dA2, ..., dAn) + Cor2 * min(dA1, dA2, ..., dAn)
SlM(Qand, D) = Cand1 * min(dA1, dA2, ..., dAn) + Cand2 * max(dA1, dA2 ..., dAn)
where Cor1,Cor2,Cand1 and are constants such that:
In this blog, I will try to introduce the logic of fuzzy sets and then we will see how fuzzy information retrieval methods work.
Fuzzy sets:
Traditional sets theory follow boolean logic with elements either being or not being part of the sets. But with fuzzy sets an element can belong to a set partially. Here, an element belongs to a set with varying degree between 0 and 1.
Thus we can define membership function in fuzzy sets as:
Let ,
set U={x1,x2,x3,....,xn}=universe of discourse (set of all elements)
function m=membership function
then for each element x in U,
x is not included if m(x)=0
x is included if m(x)=1
x is partially included if 0<m(x)<1
and function :
d(a and b)=min(da,db)
or function:
d(a or b)=max(da,db)
Fuzzy Information retrieval methods:
The two famous fuzzy retrieval models are:
- The MMM model
- The Piace model
- The MMM (Mixed Min and Max) model:
SlM(Qor, D) = Cor1 * max(dA1, dA2, ..., dAn) + Cor2 * min(dA1, dA2, ..., dAn)
SlM(Qand, D) = Cand1 * min(dA1, dA2, ..., dAn) + Cand2 * max(dA1, dA2 ..., dAn)
where Cor1,Cor2,Cand1 and are constants such that:
Cor2=1-Cor1
Cand2=1-Cand1
Cand2=1-Cand1
- Piace Model
Here, r is a constant usually set to 1 for and and 0.7 for or queries and w's are the document weights arranged in descending order for or queries and ascending for and queries.
Thus the fuzzy techniques are a relatively new method used to cater to user requirement implemented in the above methods.
References:
(1)https://en.wikipedia.org/wiki/Fuzzy_set
(2)https://en.wikipedia.org/wiki/Fuzzy_retrieval
(3)http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=1411824
Comments
Post a Comment