Strumenti Utente

Strumenti Sito


mds:txa:start

Differenze

Queste sono le differenze tra la revisione selezionata e la versione attuale della pagina.

Link a questa pagina di confronto

Entrambe le parti precedenti la revisioneRevisione precedente
Prossima revisione
Revisione precedente
mds:txa:start [23/11/2023 alle 15:11 (20 mesi fa)] – [Lecture Notes] Laura Pollaccimds:txa:start [09/12/2024 alle 10:48 (7 mesi fa)] (versione attuale) – [Previous editions] Laura Pollacci
Linea 1: Linea 1:
-<html> 
-<!-- Google Analytics --> 
-<script type="text/javascript" charset="utf-8"> 
-(function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){ 
-(i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o), 
-m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m) 
-})(window,document,'script','//www.google-analytics.com/analytics.js','ga'); 
- 
-ga('create', 'UA-34685760-1', 'auto', 'personalTracker', {'allowLinker': true}); 
-ga('personalTracker.require', 'linker'); 
-ga('personalTracker.linker:autoLink', ['pages.di.unipi.it', 'enforce.di.unipi.it', 'didawiki.di.unipi.it', 'luciacpassaro.github.io'] );     
-ga('personalTracker.require', 'displayfeatures'); 
-ga('personalTracker.send', 'pageview', 'courses/txa/'); 
-setTimeout("ga('send','event','adjusted bounce rate','30 seconds')",30000);  
-</script> 
-<!-- End Google Analytics --> 
-<!-- Global site tag (gtag.js) - Google Analytics --> 
-<script async src="https://www.googletagmanager.com/gtag/js?id=G-LPWY0VLB5W"></script> 
-<script> 
-  window.dataLayer = window.dataLayer || []; 
-  function gtag(){dataLayer.push(arguments);} 
-  gtag('js', new Date()); 
- 
-  gtag('config', 'G-LPWY0VLB5W'); 
-</script> 
-<!-- Capture clicks --> 
-<script> 
-jQuery(document).ready(function(){ 
-  jQuery('a[href$=".pdf"]').click(function() { 
-    var fname = this.href.split('/').pop(); 
-    ga('personalTracker.send', 'event',  'TXA', 'PDFs', fname); 
-  }); 
-  jQuery('a[href$=".r"]').click(function() { 
-    var fname = this.href.split('/').pop(); 
-    ga('personalTracker.send', 'event',  'TXA', 'Rs', fname); 
-  }); 
-  jQuery('a[href$=".zip"]').click(function() { 
-    var fname = this.href.split('/').pop(); 
-    ga('personalTracker.send', 'event',  'TXA', 'ZIPs', fname); 
-  }); 
-}); 
-</script> 
-</html> 
 ====== Text Analytics (635AA) A.Y. 2023/24 ====== ====== Text Analytics (635AA) A.Y. 2023/24 ======
  
Linea 49: Linea 6:
 [[https://laurapollacci.github.io/txa.html|Laura Pollacci]] (laura.pollacci [at] di [dot] unipi [dot] it) [[https://laurapollacci.github.io/txa.html|Laura Pollacci]] (laura.pollacci [at] di [dot] unipi [dot] it)
  
-Office hours: +Office hours:
  
  
 ==== Schedule ==== ==== Schedule ====
  
-^ Day ^ Hour ^ Room ^ +^ Day ^ Hour ^ Room ^
 | Thursday | 16-18 | Fib C1 | | Thursday | 16-18 | Fib C1 |
 | Friday| 11-13 | Fib M1 | | Friday| 11-13 | Fib M1 |
Linea 62: Linea 19:
  
 ==== Objectives ==== ==== Objectives ====
-The course targets text analytics systems and applications to respond to business problems by discovering and presenting knowledge that is otherwise locked in textual form. +The course targets text analytics systems and applications to respond to business problems by discovering and presenting knowledge that is otherwise locked in textual form.
 The main objectives of the course are: The main objectives of the course are:
   - Learning essential techniques, algorithms, and models used in natural language processing.   - Learning essential techniques, algorithms, and models used in natural language processing.
-  - Understanding of the architectures of typical text analytics applications and of libraries for building them. +  - Understanding of the architectures of typical text analytics applications and of libraries for building them.
   - Expertise in design, implementation, and evaluation of applications that exploit analysis, interpretation, and transformation of texts.   - Expertise in design, implementation, and evaluation of applications that exploit analysis, interpretation, and transformation of texts.
  
Linea 92: Linea 49:
 | 2023/09/22 | Reminds on probability. | [[https://drive.google.com/file/d/1fH8sjhnh9dlPcPMwpAYSbsP0tbCaamSV/view?usp=sharing| 2 - Reminds on probability]]| | 2023/09/22 | Reminds on probability. | [[https://drive.google.com/file/d/1fH8sjhnh9dlPcPMwpAYSbsP0tbCaamSV/view?usp=sharing| 2 - Reminds on probability]]|
 | 2023/09/28 | Introduction to Python. | [[https://drive.google.com/file/d/1fOn73KfDqlaU-0dgXs4-qkIbm8ZCg8Px/view?usp=sharing| 3 - Introduction to Python]]| [[https://drive.google.com/file/d/16BIcJuP4vB5b5oUmV03R7fX_-wRaFI8Y/view?usp=sharing | L3 - Introduction_to_Python.ipynb]] | | 2023/09/28 | Introduction to Python. | [[https://drive.google.com/file/d/1fOn73KfDqlaU-0dgXs4-qkIbm8ZCg8Px/view?usp=sharing| 3 - Introduction to Python]]| [[https://drive.google.com/file/d/16BIcJuP4vB5b5oUmV03R7fX_-wRaFI8Y/view?usp=sharing | L3 - Introduction_to_Python.ipynb]] |
-| 2023/09/29 | Introduction to Python - part 2. Project and Dates | [[https://drive.google.com/file/d/11E-3DWARykKVZDuB1vuDoXySAPPWYFoq/view?usp=sharing| 4 - Project and Dates]]| +| 2023/09/29 | Introduction to Python - part 2. Project and Dates | [[https://drive.google.com/file/d/11E-3DWARykKVZDuB1vuDoXySAPPWYFoq/view?usp=sharing| 4 - Project and Dates]]|
 | 2023/10/05 | Probabilistic language models| [[https://drive.google.com/file/d/1Nj6FgcBSK9otmJwjDj2bxWWulCzPlHZb/view?usp=drive_link|5 - Probabilistic language models]]| D. Jurafsky, J.H. Martin. [[ https://drive.google.com/file/d/1K3B0s0-T3NnpfgmR6NGsZdwWqGoa0S5Q/view?usp=drive_link|Ch3]] [[https://drive.google.com/file/d/13r6wn4jlrOncZ0zUc5efmu2RgqDGUz2g/view?usp=drive_link|L5 Probabilistic Language Model.ipynb]] | | 2023/10/05 | Probabilistic language models| [[https://drive.google.com/file/d/1Nj6FgcBSK9otmJwjDj2bxWWulCzPlHZb/view?usp=drive_link|5 - Probabilistic language models]]| D. Jurafsky, J.H. Martin. [[ https://drive.google.com/file/d/1K3B0s0-T3NnpfgmR6NGsZdwWqGoa0S5Q/view?usp=drive_link|Ch3]] [[https://drive.google.com/file/d/13r6wn4jlrOncZ0zUc5efmu2RgqDGUz2g/view?usp=drive_link|L5 Probabilistic Language Model.ipynb]] |
 | 2023/10/06| Text Indexding: Strings, Regular Expressions and BS4. | [[https://drive.google.com/file/d/1Zp6vqh5Wj9YzwtpcgMSxm7NUZ_oN8SW7/view?usp=sharing| 6 - Text indexing 1]] | D. Jurafsky, J.H. Martin. [[https://drive.google.com/file/d/1SH4Em84AEHNzc6OzrhjvW_ggo_0nJiOx/view?usp=sharing|Ch2]]  [[https://drive.google.com/file/d/13miwALDtad7ERoObFnlPjeUYBaAfwZGF/view?usp=sharing|L6.1 - Strings Regular expressions and BS4.ipynb]]| | 2023/10/06| Text Indexding: Strings, Regular Expressions and BS4. | [[https://drive.google.com/file/d/1Zp6vqh5Wj9YzwtpcgMSxm7NUZ_oN8SW7/view?usp=sharing| 6 - Text indexing 1]] | D. Jurafsky, J.H. Martin. [[https://drive.google.com/file/d/1SH4Em84AEHNzc6OzrhjvW_ggo_0nJiOx/view?usp=sharing|Ch2]]  [[https://drive.google.com/file/d/13miwALDtad7ERoObFnlPjeUYBaAfwZGF/view?usp=sharing|L6.1 - Strings Regular expressions and BS4.ipynb]]|
Linea 107: Linea 64:
 |2023/11/16| Topic Modeling | [[https://drive.google.com/file/d/1M7EMWkYfqDWZjf6W22yIVJLK0QbJTh_v/view?usp=sharing|12 - Topic Modeling]] | Zhai and Massung (2016) Text Data Management and Analysis. [[https://drive.google.com/file/d/1Cwzon44c0-7b_4bbHyUO6ArolacQFY_5/view?usp=sharing|Chp 17]]. [[https://drive.google.com/file/d/1-Iyz860uAII3pplAk_VMqi5gK5N_S4pD/view?usp=sharing |L.12 -Topic Modeling - Notebook.]]. [[https://drive.google.com/file/d/1H60PV4Wt5gRs_B6MB4J2YJ-gsiySf6lv/view?usp=sharing|L.12.1 - Topic Modeling pyLDAvis - Notebook]]| |2023/11/16| Topic Modeling | [[https://drive.google.com/file/d/1M7EMWkYfqDWZjf6W22yIVJLK0QbJTh_v/view?usp=sharing|12 - Topic Modeling]] | Zhai and Massung (2016) Text Data Management and Analysis. [[https://drive.google.com/file/d/1Cwzon44c0-7b_4bbHyUO6ArolacQFY_5/view?usp=sharing|Chp 17]]. [[https://drive.google.com/file/d/1-Iyz860uAII3pplAk_VMqi5gK5N_S4pD/view?usp=sharing |L.12 -Topic Modeling - Notebook.]]. [[https://drive.google.com/file/d/1H60PV4Wt5gRs_B6MB4J2YJ-gsiySf6lv/view?usp=sharing|L.12.1 - Topic Modeling pyLDAvis - Notebook]]|
 |2023/11/17| A primer on Neural Networks |[[https://drive.google.com/file/d/1MS7upbsydqkPMIRfYv9pKHXz2mfGb1ST/view?usp=sharing |13 - A primer on Neural Networks]] | |2023/11/17| A primer on Neural Networks |[[https://drive.google.com/file/d/1MS7upbsydqkPMIRfYv9pKHXz2mfGb1ST/view?usp=sharing |13 - A primer on Neural Networks]] |
-|2023/11/23|Neural Networks | [[https://drive.google.com/file/d/13tQ1m-ogPR3R_PSAWLDomvPmsBal8E55/view?usp=sharing | 14 - Neural Networks]] |+|2023/11/23|Neural Networks | [[https://drive.google.com/file/d/13tQ1m-ogPR3R_PSAWLDomvPmsBal8E55/view?usp=sharing | 14 - Neural Networks]] | [[https://drive.google.com/file/d/1ZP9WN4OTSw2VoO7jWIpJlWBh_oGFwxjN/view?usp=sharing| From SVM to NN, Classification with Keras - Notebooks.]] | 
 +|2023/11/24| Neural Language Models | [[https://drive.google.com/file/d/1vezeT7l6Wd9D0otEYXSAjg0ih1XoggmW/view?usp=sharing| 15 - Neural Language Models]]| D. Jurafsky, J.H. Martin. Chps. [[https://drive.google.com/file/d/10SjSlr4bk6jBWTEkA4vsTUomB8y4iJ-C/view?usp=sharing|7]] [[https://drive.google.com/file/d/1MkfAsC-rY6HuWM6ZTS1TB8LoLxN-sPPq/view?usp=sharing|9]] [[https://drive.google.com/file/d/1P3j4qTH6IH_R42huYLL83cvPd1Ci2Ar1/view?usp=sharing|11]] | 
 +|2023/11/30| Student project presentations: ongoing experiments. Neural Language Models Practice | [[https://drive.google.com/file/d/1Dc0l2zQfX9poOymZKrhYiHMUiv9TT7m_/view?usp=sharing|16 - Neural Language Models Word2Vec]]| [[https://drive.google.com/file/d/14BIROGvYzNjbmmVzZqeiY-tLkhRAR8tW/view?usp=sharing |Word2vec - Notebook.]]| 
 +|2023/12/01| Student project presentations: ongoing experiments. Neural Language Models Practice | [[https://drive.google.com/file/d/1R4Yfr5v8ygsK61dV-h-mZhU_iY0OuZmK/view?usp=sharing|17 - Neural Language Models Doc2Vec]]|[[https://drive.google.com/file/d/1JaGXJE-rF3Yvmtd1Je8NCdDapLiL17Pg/view?usp=sharing|Doc2Vec - Notebook]]| 
 +|2023/12/07| Neural Language Models - part 2 |[[https://drive.google.com/file/d/1QxmavpSIjX1x46UkNR1RflY64Sbc3vLs/view?usp=sharing|Neural Language Models - part 2]]| 
 +|2023/12/11| BERT. Project Submission |[[https://drive.google.com/file/d/1JX6HCObZYtLUApYJDl1ftDTl5nKn-aHi/view?usp=sharing| 19 - Bert]]. [[https://drive.google.com/file/d/1GOwUTqWnkONM-SI8D0JANGKuqX0pBp35/view?usp=sharing|Project Submission]]| [[ https://drive.google.com/file/d/1JX6HCObZYtLUApYJDl1ftDTl5nKn-aHi/view?usp=sharing|Bert - Notebooks]] | 
 +|2023/12/14| Advanced Topics | [[ https://drive.google.com/file/d/14zg2w7-s_cpIJQBwGfXoj_yfjZNZLYQh/view?usp=sharing |20 - Advanced Topics]]| Recommended chapters: D. Jurafsky, J.H. Martin. [[https://drive.google.com/file/d/1ik_BGxKUNAi5GwQZQv4vI9Gqvv4wkWK9/view?usp=sharing|20]];[[https://drive.google.com/file/d/1VJbNelq63EagAxdgleJu2isJVBBb_vkl/view?usp=sharing|24]].| 
 ==== Exam ==== ==== Exam ====
  
Linea 116: Linea 80:
  
  
-** Non-Attending students ** +** Non-Attending students **
  
 The exam for non attending students will consist in a written exam with open question and exercises, and an oral discussion on the topics of the course. The exam for non attending students will consist in a written exam with open question and exercises, and an oral discussion on the topics of the course.
Linea 132: Linea 96:
  
 Further bibliography will be indicated as a material for the single lessons. Further bibliography will be indicated as a material for the single lessons.
-==== Previous editions ==== 
  
-  * [[http://didawiki.di.unipi.it/doku.php/mds/txa/start?rev=1671529070|2022-2023]]+ 
 +==== Previous editions ==== 
 +  * [[http://didawiki.di.unipi.it/doku.php/mds/txa/start?rev=1730200187|2022-2023]]
   * [[http://didawiki.cli.di.unipi.it/doku.php/mds/txa/start?rev=1649067582|2021-2022]]   * [[http://didawiki.cli.di.unipi.it/doku.php/mds/txa/start?rev=1649067582|2021-2022]]
   * [[http://didawiki.di.unipi.it/doku.php/mds/txa/start?rev=1612257498|2020-2021]]   * [[http://didawiki.di.unipi.it/doku.php/mds/txa/start?rev=1612257498|2020-2021]]
mds/txa/start.1700752273.txt.gz · Ultima modifica: 23/11/2023 alle 15:11 (20 mesi fa) da Laura Pollacci

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki