New Generation Computing, 23(2005)291-313
Ohmsha, Ltd. and Springer

Multi-structure Information Retrieval Method Based on Transformation Invariance

Fuminori ADACHI, Takashi WASHIO, Atsushi FUJIMOTO and Hiroshi MOTODA
ISIR, Osaka University
8-1 Mihogaoka, Ibaraki-shi, Osaka 567-0047 Japan

{adachi,washio,fujimoto,motoda}@ar.sanken.osaka-u.jp
Hidemitsu HANAFUSA
The Kansai Electric Power Co., Inc.
3-6-16, Nakano-shima, Kita-ku, Osaka-shi, Osaka, 530-8270 Japan

hanafusa.hidemitsu@b2.kepco.co.jp

Received 29 January 2004
Revised manuscript received 25 August 2004

Abstract

The needs of efficient and flexible information retrieval on multi-structural data stored in database and network are significantly growing. Especially, its flexibility plays one of the key roles to acquire relevant information desired by users in retrieval process. However, most of the existing approaches are dedicated to a single content and data structure respectively, e.g., relational database and natural text. In this work, we propose "Multi-Structure Information Retrieval" (MSIR) approach applicable to various types of contents and data structures by adapting a small part of the approach to data structures. The power of this approach comes from the use of the invariant feature information obtained from byte patterns in the files through some mathematical transformation. The experimental evaluation of the proposed approach for both artificial and real data indicates its high feasibility.

Keywords:Information Retrieval, Flexibility, Mathematical Transformation, Invariance, Similarity.

[Back]