Meeting with Andrii and Kyle

Version 3 - Updated on 21 Nov 2017 at 8:29PM by Joachim Hansen

Description

  1. hits and misses should be in the results 
  2. the methodology section should be rearenged and (maby present the reseach question differently?... maybe present 1 for theoretical and one for practical)
  3. \par could indent paragrahs and make them more readable.
  4. Should fix some sections
  5. some tables are missing content
  6. Should note that documented functionality might not work. Experiments should be done to verify... future work... could point to my tables of comparison as something to start from.
  7. I should create a flowchart (Setup environment -D setup search engines -D Aquire data set -> General prepoccing -> bulk files folder -> lines and size -- search engine spesific prepcocing -> solr prepcconses bulk file folder,elasticsearch processed  bulk file folder -> search engine spesfic indexing ->  searching.
  8. The flowchart can also be helpful in the presentation. Code may not be presentable in a presentation setting.
  9. The flowchart can be generated in paint or with drawio
  10. I should have some sections that describes the following sections
  11. Kyle reccomended that I keep the code description in the thesis and move the code to appendix. Both said that I need to find the balance between what should be in the thesis and what should be in the appendix.  
  12. I should have a similar appendix structure to the rest of the thesis so that it makes sense with respect to section names and referancing and that the reader knows that we are talking about Elasticsearch and prepcoccing or Solr and indexing.
  13. The andrii-malware dataset... should be renamed to something more academic like windows P32 malware.  
  14. Can have ID in brackets  e.g, [49] with respect to dataset names
  15. I should use the same units, so far I have table of both ms values and min:sec:ms... I should get all units in min:sec:ms
  16. I could argue that I pick the enron dataset due to financial fraud (email dataset).
  17. proof reading 4th Desember