Possibility to operate with various document formats is a feature of the approach.
It is necessary to interpret terms describing format and structure of particular document to match ontologies and semistructured documents). Such terms are “table”, ”tuple”, ”date”, ”number” and so on. So special components which provide unificated access to documents with different format should be included to system architecture.
Such functionality can be realized by using “installable format driver”. It is components implementing predetermined interface, which allows access to document of a specific type. Implementation of such driver can be based on using patterns, samples, which make possible document structure recognition.
Conclusion The features of the developed approach are universality of its using, capability to integrate with existing document search systems, powerful intelligent capabilities. Mechanism of ontological clusterization together with functionality of particular information system makes possible effective document management both for documents which are generated in the system and documents which are imported from outer heterogeneous sources.
CASE-technology METAS developed by “Computing institute” makes possible effective using of described tools both during information system development process and when users operate with information system. Also these tools can be used in process of document interpretation to adapt a system dynamically.
The technology gives to user not only customization tools. It also provides components which are used to navigate through information objects, representing entities of data domain, and its relations. Object explorer displays object tree, which can be customized by users due to their information necessaries. Object tree is used not only for object preview, appropriate business operations can be run from the node too. Each document created in the system is represented in the database as an entity. So it is displayed in a tree node, which corresponds to the entity, and user can see it. A document can be reached in tree-walk by different ways according to user’s tasks.
User can include to the tree nodes intended for document clusterization. Such node can be associated with an ontology. So the same mechanism is used for working with documents imported from outer sources and clusterizated in the system.
Adding to CASE-system tools for analysis of semistructured documents, which are categorized on the base of ontologies created by developers and users, essentially reduce labor intensiveness of maintenance and customization.
Bibliography [Лядова, 2003] Л.Н. Лядова, С.А. Рыжков. CASE-технология METAS. В кн.: Математика программных систем.
Пермский государственный университет, Пермь, 2003. С. 4-18.
[Chuprina, 2004] S. Chuprina, V. Lanin, D. Borisova, S. Khaeva. Internet Intelligent Search System SmartFinder. In: Proc.
of the European Workshop on the Integration of Knowledge, Semantics and Digital Media Technology. KnowledgeBased Media Analysis for Self-Adaptive and Agile Multimedia Technology / The Royal Statistical Society, November 25-26, 2004, London, U.K. P. 151-156.
Authors' Information Vyacheslav Lanin – Perm state university, student of computer since department; Russia, 614990, Perm city, 15, Bukireva st.; e-mail: email@example.com Ludmila Lyadova – Institute of Computing, Deputy Director; 19/2-38, Podlesnaya St., Perm, Russia; e-mail: