Strumenti Utente

Strumenti Sito


magistraleinformatica:ir:ir13:ir_project_2013

Differenze

Queste sono le differenze tra la revisione selezionata e la versione attuale della pagina.

Link a questa pagina di confronto

Entrambe le parti precedenti la revisioneRevisione precedente
Prossima revisione
Revisione precedente
magistraleinformatica:ir:ir13:ir_project_2013 [28/11/2013 alle 17:29 (11 anni fa)] – [Using ''IRProjectHelper''] Marco Cornoltimagistraleinformatica:ir:ir13:ir_project_2013 [20/01/2014 alle 10:40 (11 anni fa)] (versione attuale) – [Submitting your project] Marco Cornolti
Linea 78: Linea 78:
   - Before testing your relatedness function, let's have a look at the [[http://ferrax-2.itc.unipi.it|scoreboard page]]. This page shows the achievements of the other groups. It also shows the baseline given by TagMe.   - Before testing your relatedness function, let's have a look at the [[http://ferrax-2.itc.unipi.it|scoreboard page]]. This page shows the achievements of the other groups. It also shows the baseline given by TagMe.
   - We are ready to launch. Enter: <code>   - We are ready to launch. Enter: <code>
-java -cp $IRLIB:bin Main+java -cp $IRLIB:./bin Main
 </code>On the first launch, the program will have to query Wikipedia and retrieve some data. Don't worry: this data gets cached, and if you run the program again, the output will be way smaller. Running the program again will generate the following output:<code> </code>On the first launch, the program will have to query Wikipedia and retrieve some data. Don't worry: this data gets cached, and if you run the program again, the output will be way smaller. Running the program again will generate the following output:<code>
 Results for the Evaluation of TagMe: Results for the Evaluation of TagMe:
Linea 115: Linea 115:
  
 ==== Using ''IRProjectHelper'' ==== ==== Using ''IRProjectHelper'' ====
-You can use ''irproject.IRProjectHelper'' to access some pre-computed data that you may found useful to develop your function. Note that we do not suggest to limit your scope to these methods: if you need a new one, implement it!+You can use ''irproject.IRProjectHelper'' to access some pre-computed data that you may found useful to develop your function. Note that we do not suggest to limit your scope to these methods: if you need more methodsask Marco. You may need to implement them!
  
-These are the methods provided by ''IRProjectHelper''+To use ''IRProjectHelper'', please refer to the [[http://ferrax-2.itc.unipi.it/static/javadoc/index.html|javadoc]]
-<code java> +
-public static int[] getInlinks(int page_id); +
-public static int[] getOutlinks(int page_id); +
-public static int TitleToId(String title); +
-public static String getCategoryTitle(int catId); +
-public static IntSet getAllWids(); +
-public static boolean isDisambiguation(int pageId); +
-public static boolean isNormalPage(int page_id) +
-public static boolean isPerson(int pageId); +
-public static int[] getCategories(int pageId); +
-public static int dereference(int pageId); +
-public static float linkProbability(string anchor); +
-public static float commonness(string anchor, int pageId); +
-</code> +
-Most of them are self-explanatory. +
-  * Elenco puntato''getInlinks'' and ''getOutlinks'' respectively give the pages pointing to and pointed by ''page_id''. +
-  * ''TitleToId'' turns a page title into its ID. +
-  * ''getAllWids'' returns the set of all the Wikipedia IDs. +
-  * ''isDisambiguation'' tells you if ''pageId'' is a disambiguation page. +
-  * ''isNormalPage'' tells you if ''pageId'' is a page describing one single concept. +
-  * ''isPerson'' tries to guess if the page is about a person. +
-  * ''getCategories'' returns all the categories ''pageId'' is part of. +
-  * ''getCategoryTitle'' turns a category ID into its title. +
-  * Some Wikipedia pages are redirects (E.g. Obama -> Barack_Obama). ''dereference'' turns a redirect link into its actual page. +
-  * ''linkProbability'', given an anchor text, returns how many times that text appears as a link in the whole Wikipedia. +
-  * ''commonness'', given an anchor and a page ID, returns how many times that anchor points to that page ID.+
  
 ===== Submitting your project ===== ===== Submitting your project =====
 The submission will happen on Feb 9, 12:00 am. You have to leave in your home directory a ''Main.java'' that runs the experiments with your relatedness function and prints the results. We need to understand from the code how your function works. Please remove all unnecessary data and code from your home directory. If needed, please leave a short ''README'' explaining how to produce your results. The submission will happen on Feb 9, 12:00 am. You have to leave in your home directory a ''Main.java'' that runs the experiments with your relatedness function and prints the results. We need to understand from the code how your function works. Please remove all unnecessary data and code from your home directory. If needed, please leave a short ''README'' explaining how to produce your results.
  
-You will make a pitch (5min presentation) on Feb 10, 9:30 @ Aula Seminari Ovest, quickly explaining your idea and results.+You will make a pitch (5min presentation) on Feb 11, 9:30 @ Aula Seminari Ovest, quickly explaining your idea and results.
  
  
Linea 158: Linea 132:
   * We encourage the development of good ideas rather than good results (but we like good results!)   * We encourage the development of good ideas rather than good results (but we like good results!)
   * Numbers are big: do not engineer, but be careful with complexity   * Numbers are big: do not engineer, but be careful with complexity
-  * Tools like ''scp''[[http://linux.die.net/man/1/scp]] and ''sshfs''[[http://linux.die.net/man/1/sshfs]] may make your life easier.+  * Tools like [[http://linux.die.net/man/1/scp|scp]] and [[http://linux.die.net/man/1/sshfs|sshfs]] may make your life easier.
   * There could be bugs: contact Marco in case something is not working as you expect.   * There could be bugs: contact Marco in case something is not working as you expect.
   * You are responsible for what happens with your account: keep it secret, keep it safe, and don't misuse it.   * You are responsible for what happens with your account: keep it secret, keep it safe, and don't misuse it.
magistraleinformatica/ir/ir13/ir_project_2013.1385659749.txt.gz · Ultima modifica: 28/11/2013 alle 17:29 (11 anni fa) da Marco Cornolti

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki