Document Type
Article
Author ORCID Identifier
Curtis Dyreson https://orcid.org/0000-0003-0236-1515
Amani Shatnawi https://orcid.org/0000-0002-5096-9405
Sourav S. Bhowmick https://orcid.org/0000-0003-1957-8016
Vishal Sharma https://orcid.org/0000-0002-5054-5522
Journal/Book Title/Conference
Proceedings of the ACM on Management of Data
Volume
2
Issue
3
Publisher
Association for Computing Machinery
Publication Date
5-30-2024
Journal Article Version
Version of Record
First Page
1
Last Page
27
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.
Abstract
JSON keyword search searches the current versions of documents in a collection. However, JSON documents change over time due to edits. Some applications, such as data forensics and auditing, need to search past versions of documents and for changes to documents. This paper introduces a system called Temporal JSON Keyword Search (Tjks) for search in a collection of JSON documents that vary over time. Tjks lets users control which temporal slice, or part of the history, can be searched using a temporal search semantics; we support both of the major temporal semantics: sequenced and nonsequenced search. This paper presents the semantics of temporal JSON keyword search, discusses an efficient implementation, and evaluates the implementation. Our extensions are largely orthogonal to specific keyword search techniques, so this research provides a blueprint for extending keyword search to include time and potentially other kinds of metadata.
Recommended Citation
Curtis Dyreson, Amani Shatnawi, Sourav S. Bhowmick, and Vishal Sharma. 2024. Temporal JSON Keyword Search. Proc. ACM Manag. Data 2, 3, Article 177 (June 2024), 27 pages. https://doi.org/10.1145/3654980