Entrambe le parti precedenti la revisioneRevisione precedenteProssima revisione | Revisione precedente |
mds:txa:start [23/11/2023 alle 15:11 (20 mesi fa)] – [Lecture Notes] Laura Pollacci | mds:txa:start [09/12/2024 alle 10:48 (7 mesi fa)] (versione attuale) – [Previous editions] Laura Pollacci |
---|
<html> | |
<!-- Google Analytics --> | |
<script type="text/javascript" charset="utf-8"> | |
(function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){ | |
(i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o), | |
m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m) | |
})(window,document,'script','//www.google-analytics.com/analytics.js','ga'); | |
| |
ga('create', 'UA-34685760-1', 'auto', 'personalTracker', {'allowLinker': true}); | |
ga('personalTracker.require', 'linker'); | |
ga('personalTracker.linker:autoLink', ['pages.di.unipi.it', 'enforce.di.unipi.it', 'didawiki.di.unipi.it', 'luciacpassaro.github.io'] ); | |
ga('personalTracker.require', 'displayfeatures'); | |
ga('personalTracker.send', 'pageview', 'courses/txa/'); | |
setTimeout("ga('send','event','adjusted bounce rate','30 seconds')",30000); | |
</script> | |
<!-- End Google Analytics --> | |
<!-- Global site tag (gtag.js) - Google Analytics --> | |
<script async src="https://www.googletagmanager.com/gtag/js?id=G-LPWY0VLB5W"></script> | |
<script> | |
window.dataLayer = window.dataLayer || []; | |
function gtag(){dataLayer.push(arguments);} | |
gtag('js', new Date()); | |
| |
gtag('config', 'G-LPWY0VLB5W'); | |
</script> | |
<!-- Capture clicks --> | |
<script> | |
jQuery(document).ready(function(){ | |
jQuery('a[href$=".pdf"]').click(function() { | |
var fname = this.href.split('/').pop(); | |
ga('personalTracker.send', 'event', 'TXA', 'PDFs', fname); | |
}); | |
jQuery('a[href$=".r"]').click(function() { | |
var fname = this.href.split('/').pop(); | |
ga('personalTracker.send', 'event', 'TXA', 'Rs', fname); | |
}); | |
jQuery('a[href$=".zip"]').click(function() { | |
var fname = this.href.split('/').pop(); | |
ga('personalTracker.send', 'event', 'TXA', 'ZIPs', fname); | |
}); | |
}); | |
</script> | |
</html> | |
====== Text Analytics (635AA) A.Y. 2023/24 ====== | ====== Text Analytics (635AA) A.Y. 2023/24 ====== |
| |
[[https://laurapollacci.github.io/txa.html|Laura Pollacci]] (laura.pollacci [at] di [dot] unipi [dot] it) | [[https://laurapollacci.github.io/txa.html|Laura Pollacci]] (laura.pollacci [at] di [dot] unipi [dot] it) |
| |
Office hours: | Office hours: |
| |
| |
==== Schedule ==== | ==== Schedule ==== |
| |
^ Day ^ Hour ^ Room ^ | ^ Day ^ Hour ^ Room ^ |
| Thursday | 16-18 | Fib C1 | | | Thursday | 16-18 | Fib C1 | |
| Friday| 11-13 | Fib M1 | | | Friday| 11-13 | Fib M1 | |
| |
==== Objectives ==== | ==== Objectives ==== |
The course targets text analytics systems and applications to respond to business problems by discovering and presenting knowledge that is otherwise locked in textual form. | The course targets text analytics systems and applications to respond to business problems by discovering and presenting knowledge that is otherwise locked in textual form. |
The main objectives of the course are: | The main objectives of the course are: |
- Learning essential techniques, algorithms, and models used in natural language processing. | - Learning essential techniques, algorithms, and models used in natural language processing. |
- Understanding of the architectures of typical text analytics applications and of libraries for building them. | - Understanding of the architectures of typical text analytics applications and of libraries for building them. |
- Expertise in design, implementation, and evaluation of applications that exploit analysis, interpretation, and transformation of texts. | - Expertise in design, implementation, and evaluation of applications that exploit analysis, interpretation, and transformation of texts. |
| |
| 2023/09/22 | Reminds on probability. | [[https://drive.google.com/file/d/1fH8sjhnh9dlPcPMwpAYSbsP0tbCaamSV/view?usp=sharing| 2 - Reminds on probability]]| | | 2023/09/22 | Reminds on probability. | [[https://drive.google.com/file/d/1fH8sjhnh9dlPcPMwpAYSbsP0tbCaamSV/view?usp=sharing| 2 - Reminds on probability]]| |
| 2023/09/28 | Introduction to Python. | [[https://drive.google.com/file/d/1fOn73KfDqlaU-0dgXs4-qkIbm8ZCg8Px/view?usp=sharing| 3 - Introduction to Python]]| [[https://drive.google.com/file/d/16BIcJuP4vB5b5oUmV03R7fX_-wRaFI8Y/view?usp=sharing | L3 - Introduction_to_Python.ipynb]] | | | 2023/09/28 | Introduction to Python. | [[https://drive.google.com/file/d/1fOn73KfDqlaU-0dgXs4-qkIbm8ZCg8Px/view?usp=sharing| 3 - Introduction to Python]]| [[https://drive.google.com/file/d/16BIcJuP4vB5b5oUmV03R7fX_-wRaFI8Y/view?usp=sharing | L3 - Introduction_to_Python.ipynb]] | |
| 2023/09/29 | Introduction to Python - part 2. Project and Dates | [[https://drive.google.com/file/d/11E-3DWARykKVZDuB1vuDoXySAPPWYFoq/view?usp=sharing| 4 - Project and Dates]]| | | 2023/09/29 | Introduction to Python - part 2. Project and Dates | [[https://drive.google.com/file/d/11E-3DWARykKVZDuB1vuDoXySAPPWYFoq/view?usp=sharing| 4 - Project and Dates]]| |
| 2023/10/05 | Probabilistic language models| [[https://drive.google.com/file/d/1Nj6FgcBSK9otmJwjDj2bxWWulCzPlHZb/view?usp=drive_link|5 - Probabilistic language models]]| D. Jurafsky, J.H. Martin. [[ https://drive.google.com/file/d/1K3B0s0-T3NnpfgmR6NGsZdwWqGoa0S5Q/view?usp=drive_link|Ch3]] [[https://drive.google.com/file/d/13r6wn4jlrOncZ0zUc5efmu2RgqDGUz2g/view?usp=drive_link|L5 Probabilistic Language Model.ipynb]] | | | 2023/10/05 | Probabilistic language models| [[https://drive.google.com/file/d/1Nj6FgcBSK9otmJwjDj2bxWWulCzPlHZb/view?usp=drive_link|5 - Probabilistic language models]]| D. Jurafsky, J.H. Martin. [[ https://drive.google.com/file/d/1K3B0s0-T3NnpfgmR6NGsZdwWqGoa0S5Q/view?usp=drive_link|Ch3]] [[https://drive.google.com/file/d/13r6wn4jlrOncZ0zUc5efmu2RgqDGUz2g/view?usp=drive_link|L5 Probabilistic Language Model.ipynb]] | |
| 2023/10/06| Text Indexding: Strings, Regular Expressions and BS4. | [[https://drive.google.com/file/d/1Zp6vqh5Wj9YzwtpcgMSxm7NUZ_oN8SW7/view?usp=sharing| 6 - Text indexing 1]] | D. Jurafsky, J.H. Martin. [[https://drive.google.com/file/d/1SH4Em84AEHNzc6OzrhjvW_ggo_0nJiOx/view?usp=sharing|Ch2]] [[https://drive.google.com/file/d/13miwALDtad7ERoObFnlPjeUYBaAfwZGF/view?usp=sharing|L6.1 - Strings Regular expressions and BS4.ipynb]]| | | 2023/10/06| Text Indexding: Strings, Regular Expressions and BS4. | [[https://drive.google.com/file/d/1Zp6vqh5Wj9YzwtpcgMSxm7NUZ_oN8SW7/view?usp=sharing| 6 - Text indexing 1]] | D. Jurafsky, J.H. Martin. [[https://drive.google.com/file/d/1SH4Em84AEHNzc6OzrhjvW_ggo_0nJiOx/view?usp=sharing|Ch2]] [[https://drive.google.com/file/d/13miwALDtad7ERoObFnlPjeUYBaAfwZGF/view?usp=sharing|L6.1 - Strings Regular expressions and BS4.ipynb]]| |
|2023/11/16| Topic Modeling | [[https://drive.google.com/file/d/1M7EMWkYfqDWZjf6W22yIVJLK0QbJTh_v/view?usp=sharing|12 - Topic Modeling]] | Zhai and Massung (2016) Text Data Management and Analysis. [[https://drive.google.com/file/d/1Cwzon44c0-7b_4bbHyUO6ArolacQFY_5/view?usp=sharing|Chp 17]]. [[https://drive.google.com/file/d/1-Iyz860uAII3pplAk_VMqi5gK5N_S4pD/view?usp=sharing |L.12 -Topic Modeling - Notebook.]]. [[https://drive.google.com/file/d/1H60PV4Wt5gRs_B6MB4J2YJ-gsiySf6lv/view?usp=sharing|L.12.1 - Topic Modeling pyLDAvis - Notebook]]| | |2023/11/16| Topic Modeling | [[https://drive.google.com/file/d/1M7EMWkYfqDWZjf6W22yIVJLK0QbJTh_v/view?usp=sharing|12 - Topic Modeling]] | Zhai and Massung (2016) Text Data Management and Analysis. [[https://drive.google.com/file/d/1Cwzon44c0-7b_4bbHyUO6ArolacQFY_5/view?usp=sharing|Chp 17]]. [[https://drive.google.com/file/d/1-Iyz860uAII3pplAk_VMqi5gK5N_S4pD/view?usp=sharing |L.12 -Topic Modeling - Notebook.]]. [[https://drive.google.com/file/d/1H60PV4Wt5gRs_B6MB4J2YJ-gsiySf6lv/view?usp=sharing|L.12.1 - Topic Modeling pyLDAvis - Notebook]]| |
|2023/11/17| A primer on Neural Networks |[[https://drive.google.com/file/d/1MS7upbsydqkPMIRfYv9pKHXz2mfGb1ST/view?usp=sharing |13 - A primer on Neural Networks]] | | |2023/11/17| A primer on Neural Networks |[[https://drive.google.com/file/d/1MS7upbsydqkPMIRfYv9pKHXz2mfGb1ST/view?usp=sharing |13 - A primer on Neural Networks]] | |
|2023/11/23|Neural Networks | [[https://drive.google.com/file/d/13tQ1m-ogPR3R_PSAWLDomvPmsBal8E55/view?usp=sharing | 14 - Neural Networks]] | | |2023/11/23|Neural Networks | [[https://drive.google.com/file/d/13tQ1m-ogPR3R_PSAWLDomvPmsBal8E55/view?usp=sharing | 14 - Neural Networks]] | [[https://drive.google.com/file/d/1ZP9WN4OTSw2VoO7jWIpJlWBh_oGFwxjN/view?usp=sharing| From SVM to NN, Classification with Keras - Notebooks.]] | |
| |2023/11/24| Neural Language Models | [[https://drive.google.com/file/d/1vezeT7l6Wd9D0otEYXSAjg0ih1XoggmW/view?usp=sharing| 15 - Neural Language Models]]| D. Jurafsky, J.H. Martin. Chps. [[https://drive.google.com/file/d/10SjSlr4bk6jBWTEkA4vsTUomB8y4iJ-C/view?usp=sharing|7]] [[https://drive.google.com/file/d/1MkfAsC-rY6HuWM6ZTS1TB8LoLxN-sPPq/view?usp=sharing|9]] [[https://drive.google.com/file/d/1P3j4qTH6IH_R42huYLL83cvPd1Ci2Ar1/view?usp=sharing|11]] | |
| |2023/11/30| Student project presentations: ongoing experiments. Neural Language Models Practice | [[https://drive.google.com/file/d/1Dc0l2zQfX9poOymZKrhYiHMUiv9TT7m_/view?usp=sharing|16 - Neural Language Models Word2Vec]]| [[https://drive.google.com/file/d/14BIROGvYzNjbmmVzZqeiY-tLkhRAR8tW/view?usp=sharing |Word2vec - Notebook.]]| |
| |2023/12/01| Student project presentations: ongoing experiments. Neural Language Models Practice | [[https://drive.google.com/file/d/1R4Yfr5v8ygsK61dV-h-mZhU_iY0OuZmK/view?usp=sharing|17 - Neural Language Models Doc2Vec]]|[[https://drive.google.com/file/d/1JaGXJE-rF3Yvmtd1Je8NCdDapLiL17Pg/view?usp=sharing|Doc2Vec - Notebook]]| |
| |2023/12/07| Neural Language Models - part 2 |[[https://drive.google.com/file/d/1QxmavpSIjX1x46UkNR1RflY64Sbc3vLs/view?usp=sharing|Neural Language Models - part 2]]| |
| |2023/12/11| BERT. Project Submission |[[https://drive.google.com/file/d/1JX6HCObZYtLUApYJDl1ftDTl5nKn-aHi/view?usp=sharing| 19 - Bert]]. [[https://drive.google.com/file/d/1GOwUTqWnkONM-SI8D0JANGKuqX0pBp35/view?usp=sharing|Project Submission]]| [[ https://drive.google.com/file/d/1JX6HCObZYtLUApYJDl1ftDTl5nKn-aHi/view?usp=sharing|Bert - Notebooks]] | |
| |2023/12/14| Advanced Topics | [[ https://drive.google.com/file/d/14zg2w7-s_cpIJQBwGfXoj_yfjZNZLYQh/view?usp=sharing |20 - Advanced Topics]]| Recommended chapters: D. Jurafsky, J.H. Martin. [[https://drive.google.com/file/d/1ik_BGxKUNAi5GwQZQv4vI9Gqvv4wkWK9/view?usp=sharing|20]];[[https://drive.google.com/file/d/1VJbNelq63EagAxdgleJu2isJVBBb_vkl/view?usp=sharing|24]].| |
==== Exam ==== | ==== Exam ==== |
| |
| |
| |
** Non-Attending students ** | ** Non-Attending students ** |
| |
The exam for non attending students will consist in a written exam with open question and exercises, and an oral discussion on the topics of the course. | The exam for non attending students will consist in a written exam with open question and exercises, and an oral discussion on the topics of the course. |
| |
Further bibliography will be indicated as a material for the single lessons. | Further bibliography will be indicated as a material for the single lessons. |
==== Previous editions ==== | |
| |
* [[http://didawiki.di.unipi.it/doku.php/mds/txa/start?rev=1671529070|2022-2023]] | |
| ==== Previous editions ==== |
| * [[http://didawiki.di.unipi.it/doku.php/mds/txa/start?rev=1730200187|2022-2023]] |
* [[http://didawiki.cli.di.unipi.it/doku.php/mds/txa/start?rev=1649067582|2021-2022]] | * [[http://didawiki.cli.di.unipi.it/doku.php/mds/txa/start?rev=1649067582|2021-2022]] |
* [[http://didawiki.di.unipi.it/doku.php/mds/txa/start?rev=1612257498|2020-2021]] | * [[http://didawiki.di.unipi.it/doku.php/mds/txa/start?rev=1612257498|2020-2021]] |