Skip to main content
WorldCist'18 - 6th World Conference on Information Systems and Technologies

Full Program »

An approach for Knowledge Extraction from Source Code (KNESC) of Typed Programming Languages

Knowledge extraction is the discovery of knowledge from structured and/or unstructured sources. This knowledge can be used to build or enrich a domain ontology. Source code is rarely used. But implementation platforms evolve faster than business logic and these evolutions are usually integrated directly into source code without updating the conceptual model. In this paper, we present a generic approach for knowledge extraction from source code of typed programming languages using Hidden Markov Model. This approach consist of the definition of the HMM so that it can be used to extract any type of knowledge from the source code. The method is experimented on EPICAM and GeoServer developed in Java and on MapServer developed in C/C++. Structural evaluation shows that source code contains a structure that permit to build a domain ontology and functional evaluation shows that source code contains more knowledge than those contained in both databases and meta-models.

Azanzi Jiomekong
University of Yaoundé I, UMMISCO, Faculty of Science, Yaoundé, Cameroon

Gaoussou Camara
Université Alioune Diop de Bambey, Bambey, Sénégal


Powered by OpenConf®
Copyright ©2002-2017 Zakon Group LLC