An automatic ontology-driven approach for the semantic annotation of documents
SHIRI is an automatic ontology-driven and unsupervised approach for the semantic annotation of documents which contain more or less structured parts. The aim of this project is to build an integration system which allows the user access to documents related to a specific domain. In this system, the querying process is guided by an ontology of the domain and the answers are only made of the pertinent parts of the documents unlike keywords-based search engines. The ontology describing the domain of interest is defined using a set of concepts, their properties, their relations and the associated cardinalities and it is described using RDFS (Resource Description Framework Schema) language.
The annotation step exploits
a set of metadata and a set of logical rule patterns which are automatically instanciated from the domain description. Some of these metadata provide from the ontology and others are defined specifically for the annotation task. The resulting annotations are represented in RDF (Resource Description Framework) language.
Research activities
Information integration Semantic Web
Participants
More information : http://wwwdi.supelec.fr/~bennacer/SHIRI.htm