Filtered Indexing: An Alternate Indexing Mechanism for De-Duplication
Main Article Content
Abstract
Years ago internet was merely used to retrieve information. Later on paradigm shifted and internet came a long way of providing different types of services to the users. Today cloud computing has become the buzz word. Cloud describes a new supplement, consumption and a delivery model for internet based services. Cloud further offers virtualized services over the internet. When the data is spread wide across the cloud, duplication becomes inevitable. It is virtually impossible to eliminate duplication. At present, there is a vast amount of duplicated data or redundant data in storage systems. Data de-duplication can eliminate multiple copies of the same file and duplicated segments or chunks of data within those files. Instead we can avoid redundancy through data de-duplication. Data de-duplication is a technique where in the redundant data is deleted keeping only the unique copy of the data. Current issue for data de duplication is to avoid full-chunk indexing to identify the incoming data is new, which is time consuming process.Thereby improving storage utilization. In current scenario Full chunk indexing is a major issue over the cloud. In this paper we propose an efficient indexing mechanism using the filtered index databases. In this paper first we divide the variable length chunks using the sliding window. Then each chunk is given a chunk ID using a hash function. The disk storage is much less than that required by a table and search time is much reduced with the use of filtered index databases.
Â
Â
Keywords: Cloud computing, Data De-duplication, Full Chunk Indexing, Filtered index.
Downloads
Article Details
COPYRIGHT
Submission of a manuscript implies: that the work described has not been published before, that it is not under consideration for publication elsewhere; that if and when the manuscript is accepted for publication, the authors agree to automatic transfer of the copyright to the publisher.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work
- The journal allows the author(s) to retain publishing rights without restrictions.
- The journal allows the author(s) to hold the copyright without restrictions.