International Research Journal of Commerce , Arts and Science

 ( Online- ISSN 2319 - 9202 )     New DOI : 10.32804/CASIRJ

Impact Factor* - 6.2311


**Need Help in Content editing, Data Analysis.

Research Gateway

Adv For Editing Content

   No of Download : 153    Submit Your Rating     Cite This   Download        Certificate

TRUSTRANK: THE FUZZY LOGIC APPROACH TO MITIGATE WEB SPAM

    1 Author(s):  MR. AGNIBHA DE

Vol -  3, Issue- 2 ,         Page(s) : 655 - 668  (2012 ) DOI : https://doi.org/10.32804/CASIRJ

Abstract

By the word web spam, we mean the techniques through which manipulations can be done to the web page ranking algorithm of the web search engines and by doing so compelling the engines to show certain search results in the result page having higher rank than what they deserve. There are various sort of spam pages – They might appear as if providing the assistance about some product or some particular subject of interest but most often such information is deliberately created to only manipulate the search engine algorithm and thus lacks any authentic and reliable information that it is purported to furnish. Recently, the number of such web pages and web sites has increased much in volume causing a degradation of the quality of the search results that are thrown by the popular search engines. Though the search engines nowadays use some sort of measure to provide resistance to such manipulations in form of spam resilience, but they are not at par with the extent of “spamming” that are taking place. Pagerank is considered as one of such measures, which while counting the number of hyperlinks referring to a concerned web page, also reckons the Pageranks of those pages in turn. The primary problem with such a measure is that it is easily manipulable which is happening nowadays [12]. TrustRank is the concept which overcomes the drawbacks and vulnerabilities of Pagerank and thus stands out as a superior measure when it comes to combating web spam. The TrustRank concept involves a human operator who is supposed to assign the TrustRank while judging a page and deciding if it is spam or not. The cases where a human mediator is unable to assign any value as TrustRank, human sentiment involved is considered while obtaining the TrustRank of the page. This work is based on the human sentiment involved in the judgement of seed set and thus a model is also proposed that reduces the human mediations and involvement of human sentiment while employing the concept of Fuzzy Logic in seed selection process.

[1]Gyongyi, Z. and Garcia-Molina, H., Web spam taxonomy. First International Workshop on Adversarial Information Retrieval on the Web (AIRWeb), 2005.
[2]Becchetti,L.,Castillo,C., Donato, D., Baeza, R., Leonardi, L., Link analysis for Web spam detection., AIRWeb 2006 workshop, 2006.
[3]Gyongyi, Z., Garcia-Molina, H. and Pedersen, J., Combating web spam with TrustRank, In VLDB Proceedings of the Thirtieth international conference on
Very large data bases, VLDB Endowment, 2004.
[4]Levene, M., An introduction to search engines and web navigation, Wiley & Sons, Inc., Hoboken, New Jersey, pages 209-272, 2010.
[5] Page, L., Brin, S., Motwani, R., Winograd, T., The PageRank citation ranking: Bringing order to the web, Stanford Digital Library Technologies Project,
1998.
[6] Brin, S., Page, L, The Anatomy of a Large-Scale Hyper textual Web Search Engine, Computer Network and ISDN Systems, Vol. 30, pages 107-117, 1998.
[7] Krishnan, V. and Raj, R., Web spam detection with antitrust rank, In AIRWeb’06, 2006.
[8] Timothy J. Ross., Fuzzy Logic with Engineering Applications, Third Edition, Wiley & Sons, Inc., Hoboken, New Jersey, pages 15-168, 2010.
[9]Gyongyi, Z. and Garcia-Molina, H., Seed selection in TrustRank, Tech. report. Stanford University, 2004.
[10] Jiang, Q., Zhang, L., Zhu, Y., Zhang, Y., Larger is better: Seed Selection in Link-based Anti-spamming Algorithms, WWW 2008, Beijing, China, 2008.
[11] Spamdexing, http://en.wikipedia.org/wiki/Spamdexing.
[12] The Ranking of pages via search engines: http://en.wikipedia.org/wiki/PageRank.
[13] Bai, Y., Zhuang, H., Wang, D., Advance fuzzy logic technologies in industrial applications, Springer, 2006.

*Contents are provided by Authors of articles. Please contact us if you having any query.






Bank Details