Chrome Extension
WeChat Mini Program
Use on ChatGLM

SpokesBiz - an Open Corpus of Conversational Polish.

Piotr Pęzik,Sylwia Karasińska,Anna Cichosz, Łukasz Jałowiecki, Konrad Kaczyński, Małgorzata Krawentek, Karolina Walkusz, Paweł Wilk,Mariusz Kleć,Krzysztof Szklanny, Szymon Marszałkowski

arXiv (Cornell University)(2023)

Cited 0|Views10
No score
Abstract
This paper announces the early release of SpokesBiz, a freely available corpus of conversational Polish developed within the CLARIN-BIZ project and comprising over 650 hours of recordings. The transcribed recordings have been diarized and manually annotated for punctuation and casing. We outline the general structure and content of the corpus, showcasing selected applications in linguistic research, evaluation and improvement of automatic speech recognition (ASR) systems
More
Translated text
Key words
Multilingualism
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined