Principles of the St. Petersburg Phonological School in Speech Corpora Design

Authors

Keywords:

Phonetics, Phonology, Speech corpus, Speech annotation, St. Petersburg phonological school

Abstract

The paper discusses the main principles in designing and annotating speech corpora within the framework of the Saint Petersburg phonological school, and provides examples of using corpus data in phonetic research. One of the major principles that we follow is to analyse the speech material at all levels: from segmental to intonational, including speech disfluencies. During segmental phonetic annotation, we suggest listening to each speech sound in isolation (without knowing its context) and relying on spectrographic data. At the syllabic tier, it is crucial to reflect resyllabification. During prosodic annotation, we suggest to rely on listener’s perception of the intonation pattern first, then analyse the actual melodic curves. A speech corpus with multi-level annotation that follows these principles is a valuable source of phonetic data — as segmental and prosodic factors are in constant interaction with each other, and one cannot analyse units of one annotation tier without reference to other tiers.

Downloads

Download data is not yet available.

References

Prilozhenie №3 k Byulletenyu Foneticheskogo fonda russkogo yazyka. Fond zvukovyh edinits russkoi rechi [Appendix # 3 to the Bulletin of the Russian Phonetic Fund]. Russian Phonetic Fund. St. Petersburg - Bochum, 1993.

BONDARKO, L. V.; SVETOZAROVA, N. D.; SKRELIN, P. A. Foneticheskii fond russkogo yazyka kak issledovatel’skaya programma kafedry fonetiki Leningradskogo universiteta [Russian Phonetic Fund as a Research Program of Department of Phonetics, Leningrad University]. Byulleten’ Foneticheskogo fonda russkogo yazyka, St. Petersburg - Bochum, n.4, 1992.

BONDARKO, L. V.; VERBITSKAYA, L. A. (ed.) Interferenciya zvukovyx sistem [Cross-Language Influence of Sound Systems], Leningrad: Izdatel’stvo LGU, 1987.

BONDARKO, L. V.; VERBITSKAYA, L. A.; GORDINA, M. V.; KASEVICH, V. B. Stili proiznosheniya i tipy proizneseniya [Styles and Types of Pronunciation]. Voprosy yazykoznaniya, Moscow, n. 2. pp.64-70, 1974.

CHISTOVICH, L. A.; BONDARKO, L. V. Ob upravlenii artikulyatsionnymi organami v processe rechi [About Controlling Articulatory Organs in Speech Production]. In: Issledovalia po strukturnoj tipologii [Research in Structural Typology]. Мoscow: Nauka, pp.169-182, 1963.

CHODROFF, E.; WILSON, C. Structure in Talker-Specific Phonetic Realization: Covariation of Stop Consonant VOT in American English. Journal of Phonetics, v. 61, pp.30-47, 2017.

EVDOKIMOVA, V.; EVGRAFOVA, K.; CHUKAEVA T. The Database of Normal and Pathological Singers’ Voices: An Approach to Collecting Data. In: The 10th International Workshop Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA), 10., 2017, Florence. Proceedings [...]. Florence: Firenze University Press, 2017. pp.23-24.

EVDOKIMOVA, V.; KOCHAROV, D.; SKRELIN, P. Method for Constructing Formants for Studying Phonetic Characteristics of Vowels. SPIIRAS Proceedings, v. 19(2), pp.302-329, 2020.

EVDOKIMOVA, V.; SKRELIN, P.; CHUKAEVA, T. Automatic Phonetic Transcription for Russian: Speech Variability Modeling. In: International Conference on Speech and Computer (SPECOM), 19, 2017, Hatfield. Proceedings [...]. Springer International Publishing, 2017. pp.192-199.

EVDOKIMOVA, V.; ZAKHARCHENKO, E.; SKRELIN, P.; EVGRAFOVA, K.; CHUKAEVA, T.; SHVALEV, N. Akusticheskie xarakteristiki golosa v rechi i penii opernyx pevczov v norme i pri patologii [Acoustic Characteristics of Voice in Speech and Singing of Opera Singer’s for Normal and Pathological Voice]. In: Interdisciplinary Seminar on Conversational Russian Speech Analysis, 8., 2019, Saint Petersburg. Proceedings […], Saint Petersburg: Polytechnika-print, 2019. pp.21-30.

EVGRAFOVA, K.; EVDOKIMOVA, V.; CHUKAEVA, T.; SKRELIN, P. Vocal Fatigue in Voice Professionals: Collecting Data and Acoustic Analysis. In: Tutorial and Research Workshop on Experimental Linguistics (EXLING 2016), 7., 2016, Saint-Petersburg. Proceedings [...], Saint-Petersburg: Saint Petersburg State University, 2016. pp.59-62.

GAROFOLO, J.; LAMEL, L.; FISHER, W.; FISCUS, J.; PALLETT, D.; DAHLGREN, N.; ZUE, V. TIMIT Acoustic-Phonetic Continuous Speech Corpus, 1993.

KACHKOVSKAIA, T.; CHUKAEVA, T.; EVDOKIMOVA, V.; KHOLIAVIN, P.; KRIAKINA, N.; KOCHAROV, D.; MAMUSHINA, A.; MENSHIKOVA, A.; ZIMINA, S. SibLing Corpus of Russian Dialogue Speech Designed for Research on Speech Entrainment. In: Conference on International Language Resources and Evaluation (LREC 2020), 12., Marseille. Proceedings [...], Marseille: ELRA, 2020. pp.6556-6561.

KACHKOVSKAIA, T.; KOCHAROV, D.; SKRELIN, P.; VOLSKAYA, N. CoRuSS—A New Prosodically Annotated Corpus of Russian Spontaneous Speech. In: Conference on International Language Resources and Evaluation (LREC 2016), 10., 2016, Portorož. Proceedings [...], Portorož: ELRA, 2016. pp.1949-1954.

KACHKOVSKAIA, T.; MAMUSHINA, A.; PORTNOVA, A. Typical and Rare Post-Nuclear Melodic Movements in Russian. In: Speech Prosody, 10., 2020, Tokyo. Proceedings [...], Tokyo: ISCA, 2020. pp.464-468.

KACHKOVSKAIA, T., MENSHIKOVA, A.; KOCHAROV, D.; KHOLIAVIN, P.; MAMUSHINA, A. Social and Situational Factors of Speaker Variability in Collaborative Dialogues. In: Speech Prosody, 11., 2022, Lisbon. Proceedings [...], Lisbon: ISCA, 2022. pp.455-459

KACHKOVSKAIA, T.; SKRELIN, P. Prosodic Phrasing in Russian Spontaneous and Read Speech: Evidence from Large Speech Corpora. In: Speech Prosody, 10., 2020, Tokyo. Proceedings [...], Tokyo: ISCA, 2020. pp.166-170.

KACHKOVSKAIA, T.; VOLSKAYA, N.; SKRELIN, P. Final Lengthening in Russian: A Corpus-Based Study. In: Interspeech 2013, 14., Lyon. Proceedings [...], Lyon: ISCA, 2013. pp.1438-1442.

KOCHAROV, D.; KACHKOVSKAIA, T.; SKRELIN, P. Prosodic Boundary Detection Using Syntactic and Acoustic Information. Computer Speech and Language, v. 53, pp.231-241, 2019a.

KOCHAROV, D.; KACHKOVSKAIA, T.;SKRELIN, P. Prosodic Factors Influencing Vowel Reduction in Russian. In: Interspeech 2019, 20., Graz. Proceedings [...], Graz: ISCA, 2019b. pp.1956-1960.

KOCHAROV, D.; VOLSKAYA, N.; SKRELIN, P. F0 Declination in Russian Revisited. In: International Congress of Phonetic Sciences (ICPHS), 18., 2015, Glasgow. Proceedings [...], Glasgow: International Phonetic Association, 2015.

KOCHETKOVA, U.; SKRELIN, P.; EVDOKIMOVA, V.; NOVOSELOVA, D. Perception of Irony in Speech. In: International Conference on Neurobiology of Speech and Language, 4., 2020, Saint Petersburg. Proceedings [...], Saint Petersburg: Skifia-Print, 2020. pp.72-73.

KOCHETKOVA, U.; SKRELIN, P.; EVDOKIMOVA, V.; NOVOSELOVA, D. The Speech Corpus for Studying Phonetic Properties of Irony. In: Language, Music and Gesture: Informational Crossroads, 2021, Saint Petersburg. Proceedings [...], Springer International Publishing, 2021. pp.203-214.

LIBERMAN, M. Corpus Phonetics. Annual Review of Linguistics, 5, pp.91-107, 2019.

MAKAROVA, V. A.; USENKOVA, E. V.; EVDOKIMOVA, V. V.; EVGRAFOVA, K. V. Yazyk saskachevanskix duxoborov: vvedenie v analiz [The Language of the Saskatchevan Doukhobors: Introduction and Analysis]. Izvestiya vysshix uchebnyx zavedenij. Seriya «Gumanitarnye nauki». Razdel lingvistika [New of Higher School. Humanities. Linguistics], Ivanovo, v. 2, n. 2, pp.146-152, 2011.

MENSHIKOVA, A.; KOCHAROV, D.; KACHKOVSKAIA, T. Phonetic Entrainment in Cooperative Dialogues: A Case of Russian. In: Interspeech 2020, 21., Shanghai. Proceedings [...], Shanghai: ISCA, 2020. pp.4148-4152.

O’CONNOR, J. D.; ARNOLD, G. F., Intonation of Colloquial English. Bristol, U.K.: Longman Group Ltd., 1973.

OSTENDORF, M.; PRICE P. J.; SHATTUCK-HUFNAGEL, S., The Boston University Radio News Corpus, Boston University Technical Report No. ECS-95-001, 1995.

PANAYOTOV, V.; CHEN, G.; POVEY, D.; KHUDANPUR, S. Librispeech: an ASR Corpus Based on Public Domain Audio Books. In: 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 40., 2015, Brisbane. Proceedings [...], Brisbane: IEEE, pp.5206-5210.

PAPERNO, S.; LEED, R.L. Vocabulary Words in Elementary Russian Textbooks. Slavic and Eats European languages journal, v. 32, n. 2, 1988. pp.305-312

SKRELIN, P. Concatenative Russian Speech Synthesis: Sound Database Formation Principles. In: International Conference on Speech and Computer (SPECOM), 2., 1997. Cluj-Napoca. Proceedings [...], Cluj-Napoca: Editura Promedia Plus, 1997a.

SKRELIN, P. A. (ed.) Skazki Russkogo Severa [Tales of the North of Russia]. In: Byulleten` foneticheskogo fonda russkogo yazyka. Prilozhenie 6 [The Bulletin of the Russian Phonetic Fund. Appendix 6]. Saint Petersburg - Bochum, 1997b.

SKRELIN, P. A. (ed.) Obryadovaya poeziya Russkogo Severa: plachi [Poetic Folklore of the North of Russia (Lamentations)]. In: Byulleten` foneticheskogo fonda russkogo yazyka. Prilozhenie 6 [The Bulletin of the Russian Phonetic Fund. Appendix 6], Saint Petersburg - Bochum, 1998.

SKRELIN, P. A. Segmentaciya i transkripciya [Segmentation and Transcription], Saint Petersburg: Saint Petersburg State University, 1999.

SKRELIN, P. Russian Material and Methods. In: DE SILVA, V.; ULLAKONOJA, R. (ed.) Phonetics of Russian and Finnish, Frankfurt am Main: Peter Lang, 2009.

SKRELIN, P. A.; KOCHETKOVA, U. E.; EVDOKIMOVA, V. V.; NOVOSELOVA, D. D.; GERMAN, R. D. Prosodicheskie xarakteristiki ironicheskix vyskazyvanij v russkom i franczuzskom yazykax [Prosodic Features of Ironic Utterances in Russian and French]. In: Interdisciplinary Seminar on Conversational Russian Speech Analysis, 9., 2021, Saint Petersburg. Proceedings […], Saint Petersburg: Skifia-Print, 2021. pp.81-86.

SKRELIN, P.; VOLSKAYA, N.; KOCHAROV, D.; EVGRAFOVA, K.; GLOTOVA, O.; EVDOKIMOVA, V. A Fully Annotated Corpus of Russian Speech. In: Conference on International Language Resources and Evaluation (LREC 2010), 7., 2010, Valletta. Proceedings [...], Valletta: ELRA, 2010. pp.109-112.

SVETOZAROVA, N. Zhirmunsky’s Collection of German Folk Songs in the Sound Archives of the Pushkinsky Dom. In: Archives of the Languages of Russia. Saint Petersburg - Groningen, 1996. pp.33-38.

TURK, A.; NAKAI, S.; SUGAHARA, M. Acoustic Segment Durations in Prosodic Research: A Practical Guide. Methods. In: Empirical Prosody Research, Berlin, Boston: De Gruyter, pp.1-28, 2012.

VOLSKAYA, N.; KACHKOVSKAIA, T. Prosodic Annotation in the New Corpus of Russian Spontaneous Speech CoRuSS. In: Speech Prosody, 8., 2016, Boston. Proceedings [...], Boston: ISCA, 2016. pp.917-921.

WANG, C.; RIVIERE, M.; LEE, A.; WU, A.; TALNIKAR, C.; HAZIZA, D.; WILLIAMSON, M.; PINO, J.; DUPOUX, E. VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation. In: ACL 2021 (Volume 1: Long Papers), Bangkok, Proceedings […], Bangkok: ACL, 2021. pp.993-1003.

Published

2023-06-20

How to Cite

Skrelin, P., Kachkovskaia, T., Kocharov, D., Evdokimova, V., & Kochetkova, U. (2023). Principles of the St. Petersburg Phonological School in Speech Corpora Design . Bakhtiniana. Revista De Estudos Do Discurso, 18(2), Port. 203–225 / Eng. 205. Retrieved from https://revistas.pucsp.br/index.php/bakhtiniana/article/view/55822

Issue

Section

Articles