Vis enkel innførsel

dc.contributor.authorBjørklund, Truls A.nb_NO
dc.date.accessioned2014-12-19T13:38:06Z
dc.date.available2014-12-19T13:38:06Z
dc.date.created2011-12-06nb_NO
dc.date.issued2011nb_NO
dc.identifier462035nb_NO
dc.identifier.isbn978-82-471-2838-1 (printed version)nb_NO
dc.identifier.isbn978-82-471-2840-4 (electronic version)nb_NO
dc.identifier.urihttp://hdl.handle.net/11250/252735
dc.description.abstractSearch engines and database systems both play important roles as we store and organize ever increasing amounts of information and still require the information to be easily accessible. Research on these two types of systems has traditionally been partitioned into two fields, information retrieval and databases, and the integration of these two fields has been a popular research topic. Rather than attempting to integrate the two fields, this thesis begins with a comparison of the technical similarities between search engines and a specific type of database system often used in decision support systems: column stores. Based on an initial assessment of the technical similarities, which includes an evaluation of the feasibility of creating a hybrid system that supports both workloads, the papers in this thesis investigate how the identi_ed similarities can be used as a basis for improving the effciency of the different systems. To improve the efficiency of processing decision support workloads, the use of inverted indexes as an alternative to bitmap indexes is evaluated. We develop a query processing framework for compressed inverted indexes in decision support workloads and find that it outperforms state-of-the-art compressed bitmap indexes by being significantly more compact, and also improves the query processing e_ciency for most queries. Keyword search in social networks with access control is also addressed in this thesis, and a space of solutions is developed along two axes. One of the axes defines the set of inverted indexes that are used in the solution, and the other defines the meta-data used to filter out inaccessible results. With a exible and efficient search system based on a column-oriented storage system, we conduct a thorough set of experiments that illuminate the trade-offs between different extremes in the solution space. We also develop a hybrid scheme in between two of the best extremes. The hybrid approach uses cost models to find the most efficient solution for a particular workload. Together with an effcient query processing framework based on our novel HeapUnion operator, this results in a system that is e_cient for a wide range of workloads that consist of updates and searches with access control in a social network.nb_NO
dc.languageengnb_NO
dc.publisherNorges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskapnb_NO
dc.relation.ispartofseriesDoktoravhandlinger ved NTNU, 1503-8181; 2011:148nb_NO
dc.relation.haspartBjørklund, Truls A.; Gehrke, Johannes; Torbjørnsen, Øystein. A Confluence of Column Stores and Search Engines: Opportunities and Challenges. Using Search Engine Technology for Information Management (USETIM) 2009, 2009.nb_NO
dc.relation.haspartBjørklund, Truls A.; Grimsmo, Nils; Gehrke, Johannes; Øystein, Torbjørnsen. Inverted Indexes vs. Bitmap Indexes in Decision Support Systems. Proceedings of 18th ACM Conference on Information and Knowledge Management (CIKM'09): 1509-1512, 2009. <a href='http://dx.doi.org/10.1145/1645953.1646158'>10.1145/1645953.1646158</a>.nb_NO
dc.relation.haspartBjørklund, Truls Amundsen; Götz, Michaela; Gehrke, Johannes. Search in social networks with access control. KEYS '10 Proceedings of the 2nd International Workshop on Keyword Search on Structured Data, 2010. <a href='http://dx.doi.org/10.1145/1868366.1868370'>10.1145/1868366.1868370</a>.nb_NO
dc.relation.haspartBjørklund, Truls A.; Götz,, Michaela; Gehrke, Johannes; Grimsmo, Nils. Workload-Aware Indexing for Keyword Search in Social Networks. , 2011.nb_NO
dc.relation.haspartGrimsmo, Nils; Bjørklund, Truls Amundsen; Hetland, Magnus Lie. Fast Optimal Twig Joins. Proceedings of the VLDB Endowment, 2010.nb_NO
dc.relation.haspartGrimsmo, Nils; Bjørklund, Truls Amundsen; Hetland, Magnus Lie. Linear Computation of the Maximum Simultaneous Forward and Backward Bisimulation for Node-Labeled Trees. proceedings of the 7th International XML Database Symposium, 2010.nb_NO
dc.titleColumn Stores versus Search Engines and Applications to Search in Social Networksnb_NO
dc.typeDoctoral thesisnb_NO
dc.contributor.departmentNorges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskapnb_NO
dc.description.degreePhD i Informasjonsteknologinb_NO
dc.description.degreePhD in Information Technologyen_GB


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel