diabiz
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
diabiz [2022/04/20 14:03] – [The domains covered:] madamczyk | diabiz [2023/09/27 09:49] (current) – pezik | ||
---|---|---|---|
Line 4: | Line 4: | ||
**DiaBiz corpus** is a dialog corpus comprising **recordings** and annotated **transcriptions** of **phone-based customer-agent interactions** in several key business domains. | **DiaBiz corpus** is a dialog corpus comprising **recordings** and annotated **transcriptions** of **phone-based customer-agent interactions** in several key business domains. | ||
+ | A general overview of the corpus can be found in this paper: | ||
+ | |||
+ | * Pęzik, Piotr, Gosia Krawentek, Sylwia Karasińska, | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | Also see the accompanying poster here: | ||
+ | * [[https:// | ||
=== The corpus comprises: === | === The corpus comprises: === | ||
- | * 4,036 conversations amounting to nearly 410 hours and over 3 million words | + | * 4,036 conversations amounting to nearly 410 hours and over 3.2 million words |
- | * dialogues between 5 professional | + | * dialogues between 5 call-center agents and 191 participants as customers |
* data from 9 business domains with high commercial demand for conversational analytics and automation solutions | * data from 9 business domains with high commercial demand for conversational analytics and automation solutions | ||
* dialogues based on 251 real-life interaction scenarios | * dialogues based on 251 real-life interaction scenarios | ||
Line 29: | Line 38: | ||
- | The data was **manually | + | The data was automatically automatically |
Line 53: | Line 62: | ||
=====Availability===== | =====Availability===== | ||
- | Click [[https:// | + | All the samples and supplementary materials available on this webpage are copyrighted. They are only included |
- | The current version of the recording catalog is available | + | Click [[https:// |
+ | |||
+ | The current version of the recording catalog is available [[https:// | ||
+ | |||
+ | For more information about the DiaBiz license for both commercial and scientific use, please contact piotr.pezik@uni.lodz.pl. | ||
- | For more information, | ||
=====Project Team==== | =====Project Team==== | ||
* Piotr Pęzik | * Piotr Pęzik | ||
Line 80: | Line 92: | ||
* Zuzanna Deckert | * Zuzanna Deckert | ||
* Piotr Górniak | * Piotr Górniak | ||
+ | * Konrad Kaczyński | ||
+ | * Łukasz Jałowiecki | ||
+ | |||
+ | |||
+ | =====DiaBiz EN===== | ||
+ | |||
+ | [[https:// | ||
+ | |||
=====Acknowledgments==== | =====Acknowledgments==== |
diabiz.1650456236.txt.gz · Last modified: 2022/04/20 14:03 by madamczyk