登录    注册    忘记密码

详细信息

An Efficient Approach for Building Compressed Full-text Index for Structured Data  ( CPCI-S收录 EI收录)  

文献类型:会议论文

英文题名:An Efficient Approach for Building Compressed Full-text Index for Structured Data

作者:Liang, Jun[1];Xiao, Lin[2];Zhang, Di[3]

通讯作者:Liang, J[1]

机构:[1]Beijing Union Univ, Training Ctr Elect Informat, Beijing, Peoples R China;[2]Beijing Union Univ, Coll Appl Sci & Technol, Beijing, Peoples R China;[3]Chinese Acad Sci, Inst Software, Beijing, Peoples R China

第一机构:北京联合大学工科综合实验教学示范中心

通讯机构:[1]corresponding author), Beijing Union Univ, Training Ctr Elect Informat, Beijing, Peoples R China.|[1141791]北京联合大学工科综合实验教学示范中心;[11417]北京联合大学;

会议论文集:4th International Conference on Computer Sciences and Convergence Information Technology

会议日期:NOV 24-26, 2009

会议地点:Seoul, SOUTH KOREA

语种:英文

外文关键词:self-index; compressed full-text index; data compression; structured data

摘要:The self-index is a kind of highly compressed, self-contained full-text index. It is designed for indexing plain texts in order to reduce its permanent storage, as well as to enhance searching performance. Apart from being a sequence of characters, usually the text has specific internal structure. The data record, as a basic model of structured data, is therefore employed to represent and organize such form of data widespread. In this paper, we design and implement an approach to building the self-index for data records via text medium. Our approach indexes the data records through an intermediate text which accommodates aligned record fields by stuffing delimiters among them. By theoretical analysis, we give the upper bounds of permanent space of our approach in a worst case. In addition, we report a series of experimental results to validate the correctness and efficiency of the proposed approach.

参考文献:

正在载入数据...

版权所有©北京联合大学 重庆维普资讯有限公司 渝B2-20050021-8 
渝公网安备 50019002500408号 违法和不良信息举报中心