• A generic and flexible Framework for focusing Search at Yahoo! Shopping 

      Eriksen, Trond Øivind; Korsen, Anne Siri (Master thesis, 2006)
      Information retrieval is concerned with extraction of documents from a collection, according to the user's information need. The ranking returned by a search engine is determined by the relevance function in use. The amount ...
    • A Metadata Approach to Predicting Twitter User Geolocation 

      Sandve, Sigurd (Master thesis, 2016)
      As the volume of available data created from social media grows, creating value out of this information becomes an interesting challenge. This work presents an approach to identify a users country of origin from just looking ...
    • Advances in Databases and Information Systems (ADBIS) 2016 & 2017 

      Nørvåg, Kjetil; Ivanovic, Mirjana; Kirikova, Marite; Thalheim, Bernhard (Journal article; Peer reviewed, 2019)
    • Applying temporal dependence to detect changes in streaming data 

      Duong, Quang-Huy; Ramampiaro, Heri; Nørvåg, Kjetil (Journal article; Peer reviewed, 2018)
      Detection of changes in streaming data is an important mining task, with a wide range of real-life ap- plications. Numerous algorithms have been proposed to efficiently detect changes in streaming data. However, the ...
    • Authoritative K-Means for Clustering of Web Search Results 

      He, Gaojie (Master thesis, 2010)
      Clustering is currently more and more applied on hyperlinked documents, especially for web search results. Although most commercial web search engines will provide their ranking algorithms sorting the matched results to ...
    • Authorship Identification of Research Papers 

      Skoglund, Simen (Master thesis, 2015)
      Authorship identification is a technique used to identify anonymous documents by identifying and extracting an authors stylometric features. The focus of this thesis is to apply an authorship identification technique, ...
    • Automated tuning of MapReduce performance in Vespa Document Store 

      Grythe, Knut Auvor (Master thesis, 2007)
      MapReduce is a programming model for distributed processing, originally designed by Google Inc. It is designed to simplify the implementation and deployment of distributed programs. Vespa Document Store (VDS) is a distributed ...
    • Automatic Document Timestamping 

      Gumpen, Kristoffer Berg; Nygard, Øyvind (Master thesis, 2017)
      When searching for information, the temporal dimension of the results is an important fac-tor regarding the information quality. Using temporal intent as a condition when searchingfor information is a field that is gaining ...
    • Continuous Queries on Streaming Data 

      Norrhall, Sara Phrida K. (Master thesis, 2018)
      We are all living in a world that is becoming more digital for every day. While people are connecting to the Internet, there has been an explosion of applications that are being used on a daily basis. All of these applications ...
    • Continuously adapting continuous Queries for Data Streams in Raincoat 

      Stenersen, Steffen Rendahl; Grønnbeck, Ken Oscar (Master thesis, 2013)
      In the last decade, the world wide web has grown from being a platform where users passively viewed content, to an active platform where the users themselves contributed with new content. With this came an explosion of ...
    • Cumulative Citation Recommendation 

      Roligheten, Christian Barth (Master thesis, 2018)
      Keeping knowledge bases such as Wikipedia up-to-date with the latest information is a difficult task in the information age: Every day thousands of news articles, blog posts, opinions are published on the Internet and if ...
    • Database Operations on Multi-Core Processors 

      Liknes, Stian (Master thesis, 2013)
      The focus of this thesis is on investigating efficient database algorithmsand methods for modern multi-core processors in main memory environments.We describe central features of modern processors in a historic perspectivebefore ...
    • Density Guarantee on Finding Multiple Subgraphs and Subtensors 

      Duong, Quang-Huy; Ramampiaro, Heri; Nørvåg, Kjetil (Peer reviewed; Journal article, 2021)
      Dense subregion (subgraph & subtensor) detection is a well-studied area, with a wide range of applications, and numerous efficient approaches and algorithms have been proposed. Approximation approaches are commonly used ...
    • Density-Based Spatial Clustering with Application and Noise with Spark 

      Harper, Vegard (Master thesis, 2015)
      With increasing number of devices that being connected to the Internet every day, analyzing the increasing amount data is being generated will be more and more important to the process. The data that are being generated ...
    • Detecting Influential Events 

      Domben, Ingrid Seip (Master thesis, 2021)
      I en verden hvor sosiale medier har blitt brennhett og genererer enorme mengder data er det enormt potensiale i forsking og analysiering av denne dataen. Hva folk tenker og mener om ting, hvor de befinner seg og hva som ...
    • Distributed distance-preserving graph approximations 

      Sund, Arne Lyngstad (Master thesis, 2020)
      Det er en voksende interesse for forskning og analyse av store grafer. Disse grafene kan være grafer man blant annet finner i sosiale medier, hvordan nettsider linker til hverandre og hvordan smitte sprer seg i globale ...
    • Diversifying Top-k Point-of-Interest Queries via Collective Social Reach 

      Maropaki, Stella; Chester, Sean; Doulkeridis, Christos; Nørvåg, Kjetil (Chapter, 2020)
      By "checking into'' various points-of-interest (POIs), users create a rich source of location-based social network data that can be used in expressive spatio-social queries. This paper studies the use of popularity as a ...
    • Document Dating Using Temporal Information Extracted From Wikipedia 

      Aalen, Didrik Pemmer (Master thesis, 2018)
      An important dimension in several information retrieval systems is the temporal dimension. In information retrieval systems, one aspect of the temporal dimension is the time of creation for the documents in the system. ...
    • Dokument-klynging (document clustering) 

      Galåen, Magnus (Master thesis, 2008)
      As document searching becomes more and more important with the rapid growth of document bases today, document clustering also becomes more important. Some of the most commonly used document clustering algorithms today, are ...
    • Durability in a data-flow storage system 

      Ek, Lars Martin Bævre (Master thesis, 2018)
      Traditional database systems do not meet the throughput demands of today's web applications. Mitigation strategies on the form of intricate cache hierarchies and manual view materialization solve parts of the performance ...