A Novel Chinese Domain Ontology Construction Method for Petroleum Exploration Information
Abstract
Ontology is playing an important role in knowledge management and sharing, both users and system can communicate with each other using a common understanding knowledge of a domain. This study proposes a context-based ontology construction method for extracting petroleum exploration domain information from unstructured Chinese text documents. The proposed mechanism of domain ontology construction includes four steps. First, domain documents preprocessing aims to separates the text into sentences, including a Chinese Part-of-Speech (POS) Tag and a Chinese corpus extract from the HowNet. Next, the concept clustering based on the fuzzy c-means aims to cluster concepts and instances from documents. In third step, context extraction aims to obtain the contexts. Finally, domain ontology construction aims to generate a petroleum exploration Chinese domain ontology. Experimental results show that the proposed approach can effectively construct Chinese domain ontology from unstructured text documents. This study implements a context-based ontology construction mechanism that can automatically mine domain concepts out of domain document, thereby reducing cost and burden that would be incurred in a manual construction process.
Keywords
References
[1] S. L. Nimmagadda, H. Dreher, and A. Rudra, “Data Warehouse Structuring Methodologies for Efficient Mining of Western Australian Petroleum Data Sources,” The 3rd IEEE International Conference on Industrial Informatics, pp. 611-616, 2005.
[2] N. Lammari, and E. Metais, “Building and maintaining ontologies: A set of algorithms,” Data and Knowledge Engineering, vol. 48, pp. 155-176, 2004.
http://dx.doi.org/10.1016/S0169-023X(03)00103-4
[3] C. Brewster, and K. O’Hara, “Knowledge representation with ontologies: Present challenges–Future possibilities,” International Journal of Human-Computer Studies, vol. 65, pp. 563-568, 2007.
http://dx.doi.org/10.1016/j.ijhcs.2007.04.003
[4] V. Sugumaran, and V. C. Storey, “The role of domain ontologies in database design: An ontology management and conceptual modeling environment,” ACM Transactions on Database Systems, vol. 31, pp. 1064-1094, 2006.
http://dx.doi.org/10.1145/1166074.1166083
[5] L. Wang, and M. Li, “Information retrieval model for the semantic search,” Journal of Computational Information Systems, vol. 3, pp. 1359-1366, 2007.
[6] P. C. Smits, and A. Friis-Christensen, “Resource discovery in a european spatial data infrastructure,” IEEE Transactions on Knowledge and Data Engineering, vol. 19, pp. 85-95, 2007.
http://dx.doi.org/10.1109/TKDE.2007.250587
[7] S. K. Bechhofer, R. D. Stevens, and P. W. Lord, “GOHSE: Ontology driven linking of biology resources,” Journal of Web Semantics, vol. 4, pp. 155-163, 2006.
http://dx.doi.org/10.1016/j.websem.2005.09.003
[8] Y. Liu, Z. Sui, Y. Hu, and Q. Zhao, “Research on automatic construction of medical ontology,” International Conference on Biomedical Engineering and Computer Science, pp. 1-4, 2010.
http://dx.doi.org/10.1109/ICBECS.2010.5462404
[9] W. Zhou, Z. Liu, and Y. Zhao, “Ontology learning by clustering based on fuzzy formal concept analysis,” In Proceedings of the 31st Annual International Computer Software and Applications Conference, pp. 204-210, 2007.
[10] C. S. Lee, Y. F. Kao, Y. H. Kuo, and M. H. Wang, “Automated ontology construction for unstructured text documents,” Data & Knowledge Engineering, vol. 60, pp. 547-566, 2007.
http://dx.doi.org/10.1016/j.datak.2006.04.001
[11] D. Zhang, L. Zhang, and H. Jiang, “Research on Semi-automatic Domain Ontology Construction,” Seventh Web Information Systems and Applications Conference, pp. 115-118, 2010.
http://dx.doi.org/10.1109/WISA.2010.41
[12] X. Zhou, G. Xu, and L. Liu, “An Approach for Ontology Construction Based on Relational Databases,” International Journal of Research and Reviews in Artificial Intelligence, vol. 1, pp. 16-19, 2011.
[13] X. Hou, S.K. Ong, A.Y.C. Nee, X.T. Zhang, and W.J. Liu, “GRAONTO: A graph-based approach for automatic construction of domain ontology,” Expert Systems with Applications, vol. 38, pp. 11958-11975, 2011.
http://dx.doi.org/10.1016/j.eswa.2011.03.090
[14] J. Kim, P. Kim, and H. Chung, “Ontology construction using online ontologies based on selection, mapping and merging,” International Journal of Web and Grid Services, vol. 7 pp. 170-189, 2011.
http://dx.doi.org/10.1504/IJWGS.2011.040447
[15] S. L. Nimmagadda, and H. Dreher, “Petroleum Ontology: an effective data integration and mining methodology aiding exploration of commercial petroleum plays,” The IEEE International Conference on Industrial Informatics, pp. 1289-1295, 2008.
http://dx.doi.org/10.1109/INDIN.2008.4618302
[16] H. Li, and W. Ko, “Automated food ontology construction mechanism for diabetes diet care,” International conference on machine learning and cybernetics, vol. 5, pp. 2953-2958, 2007.
[17] C. W. Shih, M. Y. Chen, H. C Chu, and Y. M. Chen, “Enhancement of domain ontology construction using a crystallizing approach,” Expert Systems with Applications, vol. 38, pp. 7544-7557, 2011.
http://dx.doi.org/10.1016/j.eswa.2010.12.112
[18] R. Chen, J. Liang, and R. Pan, “Using recursive ART network to construction domain ontology based on term frequency and inverse document frequency,” Expert Systems with Applications, vol. 34, pp. 488-501, 2008.
http://dx.doi.org/10.1016/j.eswa.2006.09.019
[19] M. Y. Dahab, H. A. Hassan, and A. Rafea, “TextOntoEx: Automatic ontology construction from natural English text,” Expert Systems with Applications, vol. 34, pp. 1474-1480, 2008.
http://dx.doi.org/10.1016/j.eswa.2007.01.043
[20] M. Fernández-López, and A. Gómez-Pérez, “Overview and analysis of methodologies for building ontologies,” The Knowledge Engineering Review, vol. 17, pp. 129-156, 2002.
[21] O. Corcho, M. Fernandez-Lopez, and A. Gomez-Perez, “Methodologies, tools and languages for building ontologies: where is their meeting point?,” Data Knowledge Engineering, vol. 46, pp. 41-64, 2003.
http://dx.doi.org/10.1016/S0169-023X(02)00195-7
[22] T. R. Gruber, “A translation approach to portable ontology specification,” Knowledge Acquisition, vol. 5, pp. 199-220, 1993.
http://dx.doi.org/10.1006/knac.1993.1008
[23] M. Gruninger, and M. S. Fox, “Methodology for the design and evaluation of ontologies,” Workshop on Basic Ontological Issues in Knowledge Sharing in International Joint Conference on Artificial Intelligence, pp. 1-10, 1995.
[24] M. Uschold, and M. King, “Towards a methodology for building ontologies,” Workshop on Basic Ontological Issues in Knowledge Sharing in International Joint Conference on Artificial Intelligence, 1995.
[25] A. De Nicola, M. Missikoff, and R. Navigli, “A software engineering approach to ontology building,” Information Systems, vol. 34, pp. 258-275, 2009.
http://dx.doi.org/10.1016/j.is.2008.07.002
[26] F. Ensan, and W. Du, “Towards domain-centric ontology development and maintenance frameworks,” Nineteenth International Conference on Software Engineering & Knowledge Engineering, pp. 622-627, 2007.
[27] C. S. Lee, C. C. Jiang, and T. C. Hsieh, “A genetic fuzzy agent using ontology model for meeting scheduling system,” Information Sciences, vol. 176, pp. 1131-1155, 2006.
http://dx.doi.org/10.1016/j.ins.2005.07.012
[28] A. Gómez-Pérez, and D. Manzano-Macho, “An overview of methods and tools for ontology learning from texts,” The Knowledge Engineering Review, vol. 19, pp. 187-212, 2004.
[29] D. Sánchez, and A. Moreno, “Learning non-taxonomic relationships from web documents for domain ontology construction,” Data and Knowledge Engineering, vol. 64, pp. 600-623, 2008.
http://dx.doi.org/10.1016/j.datak.2007.10.001
[30] G. A. Miller, “WordNet: A lexical database for English,” Communications of the ACM, vol. 38, pp. 39-41, 1995.
http://dx.doi.org/10.1145/219717.219748
[31] Z. Dong, “Bigger Context and Better Understanding -- Expectation on Future MT Technology,” Proceedings of the International Conference on Machine Translation & Computer Language Information Processing, pp.17-25, 1999.
[32] K. Tan, H. Han, and R. Elmasri, “Web data cleansing and preparation for ontology extraction using WordNet,” In Proceedings of the First International Conference on Web Information Systems Engineering, pp. 11-18, 2000.
[33] J. Park, J. Nam, Q. Hu, and H. Suh, “Product ontology construction from engineering documents,” International conference on smart manufacturing application, pp. 305-310, 2008.
[34] J. C. Bezdek, and R. Ehrlich, “FCM: the fuzzy c-means clustering algorithm,” Computers & Geosciences, vol. 10, pp.191–203, 1984.
http://dx.doi.org/10.1016/0098-3004(84)90020-7
[35] A. Shiri, Introduction to Modern Information Retrieval, 2nd ed., Emerald Group Publishing Limited, CA, 2004.
[36] J. Kang, L. Min, Q. Luan, and X. Li, “Novel modified fuzzy c-means algorithm with applications,” Digital Signal Processing. Vol.19, pp. 309-319, 2009.
http://dx.doi.org/10.1016/j.dsp.2007.11.005
Full Text: PDF


