• norsk
    • English
  • English 
    • norsk
    • English
  • Login
View Item 
  •   Home
  • Fakultet for informasjonsteknologi og elektroteknikk (IE)
  • Institutt for datateknologi og informatikk
  • View Item
  •   Home
  • Fakultet for informasjonsteknologi og elektroteknikk (IE)
  • Institutt for datateknologi og informatikk
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Søkeapplikasjon i skyene

Standahl, Stian
Master thesis
Thumbnail
View/Open
656459_ATTACHMENT01.zip (190.7Mb)
656459_FULLTEXT01.pdf (735.0Kb)
656459_COVER01.pdf (131.3Kb)
URI
http://hdl.handle.net/11250/253444
Date
2013
Metadata
Show full item record
Collections
  • Institutt for datateknologi og informatikk [7444]
Abstract
This thesis has focused on how to process and store big data in thecloud, with a special focus on challenges on creating an informationretrieval system and how distributed information retrieval methods canbe used in the cloud. After evaluating three cloud platforms, WindowsAzure was chosen because it gave more hardware resources in the freetrial than the others, and due to the fact that it had an emulator thatcould be used to set up the system locally before testing it on the cloud.The search engine should also be chosen, but since Windows Azurewas the preferred platform, the search engine choices was limited tothose that were created in the .NET languages. I ended up withLucene.NET because it is a powerful search tool. In addition, Lucene.NETis open source.The evaluation was done on a distributed information retrieval sys-tem that had a server-client set up, and used partial indexes that wasdistributed out to the clients. The evaluation was done with a smalldata set to nd optimization problems that has to be attended whencreating a distributed system that handles large amounts of data. Icarried out four evaluations on four dierent clients.The results revealed optimization problems that was special for thecloud, and that has to be attended when creating a distributed systemthat has to process and store big data in the cloud. Also, since scalingsystems in the cloud is easier, the recommendation was that scaling ofthe clients should be dependent on how much Azure Cache is left onthe clients due to a optimization problem that has to do with the searchspeed of the search engine.With some more tweaking and solving these optimization problems,the Cloud should provide an advantageous place to process and storebig data.
Publisher
Institutt for datateknikk og informasjonsvitenskap

Contact Us | Send Feedback

Privacy policy
DSpace software copyright © 2002-2019  DuraSpace

Service from  Unit
 

 

Browse

ArchiveCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsDocument TypesJournalsThis CollectionBy Issue DateAuthorsTitlesSubjectsDocument TypesJournals

My Account

Login

Statistics

View Usage Statistics

Contact Us | Send Feedback

Privacy policy
DSpace software copyright © 2002-2019  DuraSpace

Service from  Unit