New Generation Computing, 23(2005)291-313
Ohmsha, Ltd. and
Springer
Received
29 January 2004
Revised manuscript received 25 August 2004
The needs of efficient and flexible information retrieval on multi-structural data stored in database and network are significantly growing. Especially, its flexibility plays one of the key roles to acquire relevant information desired by users in retrieval process. However, most of the existing approaches are dedicated to a single content and data structure respectively, e.g., relational database and natural text. In this work, we propose "Multi-Structure Information Retrieval" (MSIR) approach applicable to various types of contents and data structures by adapting a small part of the approach to data structures. The power of this approach comes from the use of the invariant feature information obtained from byte patterns in the files through some mathematical transformation. The experimental evaluation of the proposed approach for both artificial and real data indicates its high feasibility.
Keywords:Information Retrieval,
Flexibility, Mathematical Transformation, Invariance, Similarity.