Journal of Software, Vol 7, No 6 (2012), 1385-1392, Jun 2012
doi:10.4304/jsw.7.6.1385-1392

A Novel PIM System and its Effective Storage Compression Scheme

Liang Huai Yang, Jian Zhou, Jiacheng Wang, Mong Li Lee

Abstract


The increasingly large amount of personal information poses a critical problem to users. Traditional file organization in hierarchical directories is not suited to the effective management of personal information. In order to overcome the shortcomings of the current hierarchical file system and efficiently organize and maintain personal information, some new tools are expected to be invented. In this paper, we propose a novel scheme called concept space - a network of concepts and their associations – and use topic map as the underlying data model. We present a materialized view scheme to provide users with a flexible view of the file system according to their own cognition. We also reduce the storage requirement to save space usage of this system by borrowing some ideas from XML data management and contriving a novel and efficient data compression scheme. To demonstrate the effectiveness of the above idea, we have implemented a prototype personal information management system called NovaPIM and presented its system architecture. Extensive experiments show that our proposed scheme is both efficient and effective.


Keywords


Personal Information Management; concept space; data compression

References


 

[1]M. Lansdale. The psychology of personal information management. Applied Ergonomics, 19(1), 1988, pp.55-66.
http://dx.doi.org/10.1016/0003-6870(88)90199-8

[2]W. Jones. Finders, keepers? The present and future perfect in support of personal information management. First Monday,2004,http://www.firstmonday.dk/issues/issue9_3/jones/index.html.

[3] L. H. DENG. Library and Information Mathematics, Northeast Normal University, 1983.

[4]P. O'Neil, E. O'neil, S. Pal, L. Cseri, G. Schaller, and N. Westbury, “ORDPATHs: Insert-Friendly XML Node Labels”, Proceedings of the ACM SIGMOD, 2004, pp.903-908.

[5]R. Alkhatib and M. H. Scholl. Compacting XML Structures Using a Dynamic Labeling Scheme. BNCOD, 2009, pp.158-170.

[6] X. Dong and A. Halevy. A Platform for Personal Information Management and Integration. CIDR, 2005.

[7]S. T. Dumais, E. Cutrell, J. J. Cadiz E., G. Jancke, R. Sarin, and D. C. Robbins. Stuff I've seen: A system for personal information retrieval and re-use. SIGIR, 2003, pp.72-79.

[8]S. Chaudhuri, R. Ramakrishnan, G. Weikum. Integrating DB and IR Technologies: What is the Sound of One Hand Clapping?. CIDR, 2005.

[9]C. C. Shilakes and J. Tylman, "Enterprise Information Portals", Merrill Lynch, 16 November, 1998.

[10] V. Bush. As we may think. Atlantic Monthly, 176(1), 1945, p:101-108.

[11] J. Gemmell, G. Bell, R. Lueder, SM Drucker, C. Wong. MyLifeBits: Fulfilling the Memex vision. Proc. of the 10th ACM International Conference on Multimedia, 2002, pp.235-238.
http://dx.doi.org/10.1145/641007.641053

[12] D. R. Karger, K. Bakshi, D. Huynh, D. Quan, V. Sinha. Haystack: A customizable general-purpose information management tool for end users of semistructured data. CIDR, 2005, pp.13-26.

[13] J.P. Dittrich, M. Antonio, M. Salles. iDM: A unified and versatile data model for personal dataspace management. VLDB, 2006, pp.367-378.

[14] L. Blunschi, J. Dittrich, O. R. Girard, S. K. Karakashian, and M. A. V. Salles. A Dataspace Odyssey: The iMeMex Personal Dataspace Management System. CIDR, 2007, pp.114-119.

[15] W. Jones, J. Teevan. Personal Information Management. Communications of the ACM, 49(1), 2006, pp.40-42.

[16] D. K. Barreau. Context as a factor in personal information management systems. Journal of the American Society for Information Science, 46(5), 1995, pp.327-339.
http://dx.doi.org/10.1002/(SICI)1097-4571(199506)46:5<327::AID-ASI4>3.0.CO;2-C

[17]Y. Y. YAO. Concept Formation and Learning: A Cognitive Informatics Perspective. Proceedings of the Third IEEE International Conference on Cognitive Informatics, 2004, pp. 42–51.

[18] Y.Y. Yao. A step towards the foundations of data mining. Data Mining and Knowledge Discovery: Theory, Tools, and Technology V, B.V.Dasarathy(Ed.), The International Society for Optical Engineering, 254-263, 2003.

[19] Topic Maps - XML Syntax. http://www.isotopic-maps.org/sam/sam-xtm/2006-06-19/

[20] S. Dumais, E. Cutrell, J. J. Cadiz, G. Jancke, R. Sarin, D. C. Robbins. Stuff I've seen: a system for personal information retrieval and re-use. SIGIR conference, 2003, pp.72–79.

[21] S. Fertig, E. Freeman, and D. Gelernter. Lifestreams: An alternative to the desktop metaphor. In Conference Companion on Human Factors in Computing Systems: Common Ground, 1996, pp. 410–411.

[22] S. Davies. Still Building the Memex. Communications of the ACM, 2011, 54(2):80-88.
http://dx.doi.org/10.1145/1897816.1897840

[23] S. J. Kaplan, M. D. Kapor, E. J. Belove, R. A. Landsman, and T. R. Drake. Agenda: A personal information manager. Commun. ACM 33, 7 (July 1990), pp.105–116.
http://dx.doi.org/10.1145/79204.79212

[24] J. Conklin and M. L. Begeman. gIBIS: A hypertext tool for exploratory policy discussion. ACM Transactions on Office Information Systems, Vol. 6, No. 4, October 1988, pp.303-331.
http://dx.doi.org/10.1145/58566.59297

[25] K. Shoens, A. Luniewski, P. Schwarz, J. Stamos, J. Thomas. The Rufus System: Information Organization for Semi-Structured Data. In VLDB, pp.97-107, 1993.

[26] W. Wang, A. Marian, T. D. Nguyen. Unified Structure and Content Search for Personal Information Management Systems. International Conference on Extending Database Technology, pp. 201-212, 2011.

[27] S. Abiteboul, O. Benjelloun, T. Milo. Positive Active XML. PODS Conference, 2004, pp.35-45.
http://dx.doi.org/10.1145/1055558.1055564

[28] S. Whittaker. Personal Information Management: from information consumption to curation. Annual review of information science and technology (ARIST), Vol. 45 (2011), pp. 3-62.


Full Text: PDF


Journal of Software (JSW, ISSN 1796-217X)

Copyright @ 2006-2013 by ACADEMY PUBLISHER – All rights reserved.