User Tools

Site Tools


spokes_documentation

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
spokes_documentation [2017/02/01 16:38] pezikspokes_documentation [2023/08/18 15:19] (current) – [REST API] pezik
Line 3: Line 3:
 ======Spokes PL====== ======Spokes PL======
  
-This page contains the documentation for the [[http://spokes.clarin-pl.eu/|Spokes PL conversational search engine]]. Spokes PL currently gives access to a corpus of 2 319 291  words (247 580  utterances) of conversational Polish.+This page contains the documentation for the [[http://spokes.clarin-pl.eu/|Spokes PL conversational search engine]]. Spokes PL currently gives access to a corpus of 2 319 291  words (247 580  utterances) of conversational Polish, which makes it a unique resource for scholars, researchers and engineers interested in the spoken register of Polish. 
  
 +
 +Please make sure you cite Spokes properly: 
 +
 +[[http://www.ep.liu.se/ecp/article.asp?issue=116&volume=&article=009|
 +Pęzik, Piotr. “Spokes – a Search and Exploration Service for Conversational Corpus Data.” In Selected Papers from the CLARIN 2014 Conference, October 24-25, 2014, Soesterberg, The Netherlands, 99–109. Linköping Electronic Conference Proceedings. Linköping University Electronic Press, Linköpings universitet, 2015]].
 +
 +Here is a BibTeX record:
 +
 +<code>
 +@inproceedings{pezik_spokes_2015,
 + series = {Linköping {Electronic} {Conference} {Proceedings}},
 + title = {Spokes – a search and exploration service for conversational corpus data},
 + copyright = {CC-BY-NC},
 + isbn = {978-91-7685-954-4},
 + url = {http://www.ep.liu.se/ecp_article/index.en.aspx?issue=116;article=009},
 + abstract = {Spokes is an online service for conversational corpus data search and exploration, currently developed as part of CLARIN-PL – the Polish CLARIN infrastructure. This paper describes the data sets currently available through Spokes, the architecture of the service and the data and metadata search functionality it provides to its users. We also introduce some of the more experimental features which have been developed to facilitate more advanced research on multimodal conversational corpora.},
 + booktitle = {Selected {Papers} from {CLARIN} 2014},
 + publisher = {Linköping University Electronic Press, Linköpings universitet},
 + author = {Pęzik, Piotr},
 + year = {2015},
 + pages = {99--109}
 +}
 +</code>
 =====SlopeQ syntax===== =====SlopeQ syntax=====
  
Line 523: Line 546:
  
 ;#; ;#;
-''[[http://spokes.clarin-pl.eu/#search/pl/spokes/%3Cpos%3Dnoun%3Asubst%3A.%2B%3Ainst%3A.%2B%3E/-1/0/100/-1/1/1000/0/-1/1000/noun.*/-1,1/4/true/0/-1/-1/-1/-1/-1/-1|<pos=verb:fin:pl:.*>]]''+''[[http://spokes.clarin-pl.eu/#search/pl/spokes/%3Cpos%3Dverb%3Afin%3Apl%3A.*%3E/-1/0/20/-1/1/1000/0/-1/1000/noun.*/-1,1/4/true/0/-1/-1/-1/-1/-1/-1|<pos=verb:fin:pl:.*>]]''
 ;#; ;#;
  
Line 585: Line 608:
  
 ;#; ;#;
-''[[http://spokes.clarin-pl.eu/#search/pl/spokes/<lemma=zdać pos=verb:fin:sg:.*>/-1/0/20/-1/-1/-1/-1/time_aligned desc/1000/noun.*/-1,1/4/-1/-1/-1/-1|<lemma=zdać pos=verb:fin:sg:.*>]]''+''[[http://spokes.clarin-pl.eu/#search/pl/spokes/%3Clemma%3Dzdać%20pos%3Dverb%3Afin%3Asg%3A.*%3E/-1/0/20/-1/1/1000/0/-1/1000/noun.*/-1,1/4/true/0/-1/-1/-1/-1/-1/-1|<lemma=zdać pos=verb:fin:sg:.*>]]''
 ;#; ;#;
  
Line 619: Line 642:
  
 ;#; ;#;
-''[[http://clarin.pelcra.pl/Spokes/#search/pl/spokes/(<lemma=słuchać> <pos=.*:gen:.*>)=1/-1/0/20/-1/-1/-1/-1/-1/1000/noun.*/-1,1/4/-1/-1/-1/-1|(<lemma=słuchać> <pos=.*:gen:.*>)=1]]''+''[[http://spokes.clarin-pl.eu/#search/pl/spokes/%3Clemma%3Dsłuchać%3E%20%3Cpos%3D.*%3Agen%3A.*%3E/-1/0/20/-1/1/1000/0/-1/1000/noun.*/-1,1/4/true/1/-1/-1/-1/-1/-1/-1|<lemma=słuchać> <pos=.*:gen:.*> (Slop=1)]]''
 ;#; ;#;
  
Line 630: Line 653:
 |6|  nie Zuźka jest odporna jak|  słuchaj od  |Marcela i Patrycji się nie zaraziła a one non stop chore są w domu osiemnaście stopni mam a ze na wierzchu śpi nogi jak lodek zimne | |6|  nie Zuźka jest odporna jak|  słuchaj od  |Marcela i Patrycji się nie zaraziła a one non stop chore są w domu osiemnaście stopni mam a ze na wierzchu śpi nogi jak lodek zimne |
 |7|  no i|  słuchaj poprosiłam studentów  |z pierwszego roku no i parę osób mi wysłało no i tak jak mi parę osób wysłało no to wiesz | |7|  no i|  słuchaj poprosiłam studentów  |z pierwszego roku no i parę osób mi wysłało no i tak jak mi parę osób wysłało no to wiesz |
 +
 +
 +
 +=====REST API=====
 +
 +The REST API of Spokes PL makes it possible to search and extract the entire contents of the corpus. 
 +
 +  * To get the complete list of transcriptions see [[http://spokes.clarin-pl.eu/#explore/br/0/50|this link]]
 +  * Here is how you can get the [[http://212.191.73.242:8098/api/fulltext?id=0154&text_id=0154&offset=0&limit=20&filter=%20&orderBy=seq&orderDir=asc&isFirstRun=1||list of all utterance turns]] in this text.
spokes_documentation.1485963498.txt.gz · Last modified: 2017/02/01 16:38 by pezik