Frequently asked questions
Last updated
Last updated
Please cite one of the following when you publish work which utilizes TCSE.
Hasebe, Yoichiro. (2015) Design and Implementation of an Online Corpus of Presentation Transcripts of TED Talks. Procedia: Social and Behavioral Sciences 198(24), 174–182.
TCSE uses data provided by TED under the Creative Commons BY-NC-ND license.
TCSE is made available free for non-commercial educational and scientific use, but please use this system at your own risk. All materials and information are provided "as is," with no warranties or guarantees whatsoever.
TCSE is created by (yohasebe@gmail.com
) at Doshisha University, Kyoto, Japan.
TCSE is updated about once a month with newly added talks, transcriptions, and translations. Thus the statistical data of TCSE as a linguistic corpus continuously change through time.
Transcripts TED Talks are being translated in a number of different languages. The number of talks translated varies from language to language. TCSE offers data of languages in which more than 1,000 talks have been translated. Currently 29 languages are available (aside from English, the language of the original talks). Swedish was added most recently in late 2018 after it exceeded the 1,000 threshold.
See the main page of TCSE for numbers of talks translated in each of the languages.
List of translation languages available on TCSE
Arabic
Bulgarian
Chinese, Simplified
Chinese, Traditional
Croatian
Czech
Dutch
French
German
Greek
Hebrew
Hungarian
Indonesian
Italian
Japanese
Korean
Persian
Polish
Portuguese
Portuguese, Brazilian
Romanian
Russian
Serbian
Spanish
Swedish
Thai
Turkish
Ukrainian
Vietnamese