Seminar: 10.33.348 Linguistic factors affecting speech realizations - Details

Seminar: 10.33.348 Linguistic factors affecting speech realizations - Details

You are not logged into Stud.IP.

General information

Course name Seminar: 10.33.348 Linguistic factors affecting speech realizations
Subtitle
Course number 10.33.348
Semester SoSe2025
Current number of participants 7
expected number of participants 15
Home institute Institute of Dutch Studies
Courses type Seminar in category Teaching
Next date Monday, 28.04.2025 10:00 - 12:00, Room: A09 0-018
Type/Form S
Pre-requisites In this practice (Übung) course, you will learn a series of linguistic factors that are known/said to affect speech realizations, and you will learn how to calculate them from a corpus (with python) and apply to linguistic data (with R). We learn a few of such factors every week. After each session, you need to submit your own script of the week and complete a web-based quiz. The final grade will be determined based on attendance (30%), the submitted script (40%), and the weekly-quiz (30%). You will get 3 CPs upon successful completion of the course. You can also join the course even when you do not pursue for a grade or a CP, though you will be encouraged to join the discussion and complete the assignments in the course. [IMPORTANT:] You need to have your own laptop, on which python and R can run.
Learning organisation In this practice (Übung) course, you will learn how to calculate a series of factors affecting speech realizations from a corpus and how to apply them to linguistic data, using python and R. For example, think about the following questions:

1. When do you think you may not pronounce a word clearly or even omit it? It is said to have something to do with probability of words. How do you calculate such probability from a corpus?
2. Some studies, however, suggest that sometimes you nevertheless clearly pronounce words in certain situations. When? It is said to have something to do with morphology (structures of words). How do you extract such morphological information of words from a corpus?
3. Sometimes you pronounce a certain word shorter or longer than usual. How do you calculate words' duration from a corpus in the first place?

By the end of the course, you will learn the basic techniques to extract necessary information from corpora, so that you will be able to calculate and obtain linguistic variables such as above, when needed, from corpora. In each session, participants will discuss a certain task/question/objective together. This course will be organized in such a way to focus on interaction among participants, and therefore, you will be highly encouraged to join an active discussion. Remember that there are usually (almost always) multiple different ways of solving the same problem. You will be encouraged to find your own solution to a certain problem. After each session, you need to submit your script file of the week, which you have worked on during the week. You will additionally need to finish a web-based quiz with a few simple questions related to the topics of the week.
Below are possible variables to learn in the course (subject to change):

- Frequency/probability measures
- Information-theoretic measures
- Morphological variables
- Neighborhood density
- Family size
- Prosodic boundaries
- Coarticulation
- Stress
- Speech rate
- Semantic similarity
- Semantic supprot for word forms

[IMPORTANT] You need to bring your own laptop with the internet connection, on which python and R must be able to run. Below are some useful websites for installing and setting up python and R for common operating systems. Please install python and R ready to use before the course begins.

R for Linux:
https://cran.r-project.org/bin/linux/ubuntu/fullREADME.html
https://dh-r.lincolnmullen.com/installing-r-and-packages.html

R for Mac OS:
https://dh-r.lincolnmullen.com/installing-r-and-packages.html
https://pages.cms.hu-berlin.de/EOL/gcg_quantitative-methods/HowTo_r-on-macos.html

R for Windows:
https://cran.r-project.org/bin/windows/base/rw-FAQ.html
https://www.dataquest.io/blog/installing-r-on-your-computer/
https://teacherscollege.screenstepslive.com/a/1108074-install-r-and-rstudio-for-windows
https://dh-r.lincolnmullen.com/installing-r-and-packages.html

Python for Linux:
https://docs.python-guide.org/starting/install3/linux/
https://www.geeksforgeeks.org/how-to-install-python-on-linux/

Python for Mac OS:
https://docs.python-guide.org/starting/install3/osx/
https://www.dataquest.io/blog/installing-python-on-mac/
https://www.geeksforgeeks.org/how-to-install-python-on-mac/

Python for Windows:
https://www.digitalocean.com/community/tutorials/install-python-windows-10
https://learn.microsoft.com/en-us/windows/python/beginners
https://www.geeksforgeeks.org/how-to-install-python-on-windows/

For python, I strongly recommend to use the conda environment (Miniconda/Anaconda) (mamba is also fine), because the system also uses python (especially on Mac OS and Linux where python is usually pre-installed for that reason) and arbitrary modifitions in python itself and its modules from the user-side may compromise certain functions of the whole system. With virtual environments, you can make an environment for yourself, and if something goes wrong, you can demolish the environment and create another new environment, while the system-wide python (and its modules) are untouched.

Conda for Linux:
https://docs.conda.io/projects/conda/en/stable/user-guide/install/linux.html
https://medium.com/@mustafa_kamal/a-step-by-step-guide-to-installing-conda-in-ubuntu-and-creating-an-environment-d4e49a73fc46

Conda for Mac OS:
https://docs.conda.io/projects/conda/en/stable/user-guide/install/macos.html
https://medium.com/@sophieowen_40339/how-to-install-conda-and-create-virtual-environments-on-mac-m1-a3a15093820b

Conda for Windows:
https://docs.conda.io/projects/conda/en/stable/user-guide/install/windows.html
https://www.geeksforgeeks.org/how-to-install-conda-in-windows/
Performance record The final grade will be determined based on attendance (30%), submission of the script after each session (40%), and performance on the weekly-quiz (30%). You will get 3 CPs upon successful completion of the course.
You will have an option of pursuing for 6 CPs, instead of 3 CPs. For 6 CPs, you will get an additional assignment of calculating linguistic variables from a corpus, using python and the Jupyter notebook. You will need to figure out how to calculate these variables based on what you will have learned in the course, calculate them, explain the processes and functions you employed with text in a jupyter notebook script. The deadline of submission of the assignment is now planned to be the end of August.

You can also join the course without pursuing for grades and/or CPs. Even if you will not need a grade or a CP, you will still be encouraged to join a dicussion, submit the script of the week, and complete the web-based quiz, so that you can learn/practice as much and as effectively as possible.
Lehrsprache englisch
ECTS points 3 / 6

Rooms and times

A09 0-018
Monday: 10:00 - 12:00, weekly (12x)

Module assignments

Comment/Description

In this practice (Übung) course, you will learn a series of linguistic factors that are known/said to affect speech realizations, and you will learn how to calculate them from a corpus (with python) and apply to linguistic data (with R). We learn a few of such factors every week. After each session, you need to submit your own script of the week and complete a web-based quiz. The final grade will be determined based on attendance (30%), the submitted script (40%), and the weekly-quiz (30%). You will get 3 CPs upon successful completion of the course. You can also join the course even when you do not pursue for a grade or a CP, though you will be encouraged to join the discussion and complete the assignments in the course. [IMPORTANT:] You need to have your own laptop, on which python and R can run.