Journal of Computers, Vol 7, No 7 (2012), 1599-1606, Jul 2012
doi:10.4304/jcp.7.7.1599-1606

A Personalization Recommendation Method Based on Deep Web Data Query

Tao Tan, Hongjun Chen

Abstract


Deep Web is becoming a hot research topic in the area of database. Most of the existing researches mainly focus on Deep Web data integration technology. Deep Web data integration can partly satisfy people's needs of Deep Web information search, but it cannot learn users’ interest, and people search the same content online repeatedly would cause much unnecessary waste. According to this kind of demand, this paper introduced personalization recommendation to the Deep Web data query, proposed a user interest model based on fine-grained management of structured data and a similarity matching algorithm based on attribute eigenvector in allusion to personalization recommendation. Secondly, As for Deep Web information crawl, a crawl technology based on the tree structure is presented, with the traversal method of tree to solve the information crawl problems in the personalization service distributed in various web databases.  Finally, developed a prototype recommendation system based on recruitment information, verified the efficiency and effectiveness of the personalization recommendation and the coverage and cost of Deep Web crawl through the experiment.


Keywords


Deep Web; Personalization Recommendation; Similarity Matching; User Interest Model; data crawl

References


 

[1] Chang KCC, He B, Li CK, Patel M, Zhang Z, “Structured databases on the Web: Observations and implications,” SIGMOD Record, Vol.33, 2004, pp.61−70.
http://dx.doi.org/10.1145/1031570.1031584

[2] BrightPlanet.com, “The deep Web: Surfacing hidden value,” 2000, http://brightplanet.com.

[3] Wang Jingfan, “Two-Step Job Information Retrieval based on Document Similarity,” 2007.

[4] Ma Jun, Song Ling, Han Xiaohui and Yan Po, “Classification of Deep Web Databases Based on the Context of Web Pages,” Journal of Software,Vol.19,No.2, pp.267-274,February 2008.
http://dx.doi.org/10.3724/SP.J.1001.2008.00267

[5] He B, Tao T, Chang KCC, “Clustering structured Web sources: A schema-based, model-differentiation approach,” Springer-Verlag. Heraklion, pp. 536−546, 2004 [the 9th Int’l Conf. on Extending Database Technology].

[6] Peng Q, Meng WY, He H, Yu C, “WISE-Cluster: Clustering e-commerce search engines automatically,” ACM Press. Washington, pp.104−111, 2004 [the 6th ACM Int’l Workshop Conf. on Web Information and Data Management].

[7] Wu WS, Yu C, Doan AH, Meng WY, “An interactive clustering-based approach to integrating source query interfaces on the deep Web,” ACM Press. Paris, pp.95−106, 2004 [the 24th ACM SIGMOD Int’l Conf. on Management of Data].

[8] He H, Meng WY, Yu C, Wu ZH, “WISE-Integrator: An automatic integrator of Web search interfaces for e-commerce,” Morgan Kaufmann Publishers. San Fransisco, pp.357−368, 2003 [the 29th Int’l Conf. on Very Large Data Bases].

[9] Zhai YH, Liu B, “Web data extraction based on partial tree alignment,” ACM Press. Chiba, pp. 76−85, 2005 [Proc. of the 14th Int’l World Wide Web Conf.].

[10] Zhao HK, Meng WY, Wu ZH, Raghavan V, Yu C, “Fully automatic wrapper generation for search engines,” ACM Press. Chiba, pp. 66−75, 2005 [the 14th Int’l World Wide Web Conf.] .

[11] Li Xueqing,Liu Ruihua, “An approach for User Interest Model Based on Ontology,” Information Research.

[12] Li Shan, “The Representation and Update for User Profile in Personalized Service,” Journal of the China society for scientific and technical information, Vol.29, No.1, 2010, pp.67-71.

[13] Xu Hexiang, “Research on Deep Web Integration and Its Related Several Technologies [Dr. Dissertation],” 2008.

[14] T.Joachims, T.Mitchell, D.Freitag, and R.Armstrong, “Webwatcher: machine learning and hypertext,” GI Fachgruppentreffen Maschinelles Lernen, Universtiy of Dortmund, August 1995.

[15] H.Lieberman, “Letizia: an agent that assists web browsing,” 1995 [the 14th International Joint Conf. on Artificial Intelligence].

[16] M.Balabanovic and Y. Shoham, “Learning information retrieval agents: experiments with automated web browsing,” Stanford, CA, March 1995 [AAAI Spring Symposium on Information Gathering].

[17] Liu Wei, Meng Xiao-Feng and Meng Wei-Yi, “A Survey of Deep Web Data Integration,” Chinese Journal of Computers, Vol.30, No.9, 2007, pp.1475-1489.

[18] Liu Wei, Meng Xiaofeng and Ling Yanyan, “A Gragh-Based Approach for Web Database Sampling,” Journal of Software, Vol.19, No.2, 2008.pp.179-193.
http://dx.doi.org/10.3724/SP.J.1001.2008.00179

[19] Tian Jianwei and Li Shijun, “Retrieving Deep Web Data Based on Hierarchy Tree Model,” Journal of Computer Research and Development, Vol.48, No.1, 2011, pp.94-102.

[20] Dong Yongquan, “Research on Key Issues in Deep Web Data Integration [Dr. Dissertation],” 2010.


Full Text: PDF


Journal of Computers (JCP, ISSN 1796-203X)

Copyright @ 2006-2013 by ACADEMY PUBLISHER – All rights reserved.