The dataset contains 2485 paragraphs, which comprise of 4002 sentences from audience-addressed speeches, compiled from Wikipedia 2012.
The paragraphs were given to a professionally native female US English speaker, that was instructed to read in a persuasive and lively manner. In turn, based on the recorded speech, each sentence was annotated by 4 professional labelers for emphasized words. The labelers got the original text and the recorded data with the following guidelines.
i) Emphasized words are words that clearly stand out in the speech relatively to most of the words in the sentence.
ii) Label only based on the speech, and not based on the importance of the word in the text.
|Format||License||Domain||Number of Records||Size||Originally Published|
||CC-BY-SA 3.0||Natural Language Processing||
2485 sentences with emphasized words
||3.3 MB||September 02, 2018|
Exposure to video games causes at least a temporary increase in and this exposure correlates with in the real world. A single child would be left with having to provide support for his or her parents and grandparents. The pursuit of athletes has turned into a modern day -.
- IBM Project Debater Project Debater is the first AI system that can debate humans on complex topics. The goal is to help people build persuasive arguments and make well-informed decisions. This dataset contributed to training the models in Project Debater.
- Data Asset eXchange (DAX) Explore useful and relevant data sets for enterprise data science.
- Model Asset eXchange (MAX) A place for developers to find and use free and open source deep learning models.
- Center for Open-Source Data & AI Technologies (CODAIT) Improving the Enterprise AI Lifecycle in Open Source.