- Watson Visual Recognition (VR)
- Watson Speech-to-Text
- Watson Natural Language Understanding
- Watson Tone Analyzer
- Cloudant NoSQL DB
- Watson Tone Analyzer is limited to 2500 API calls.
- Watson Visual Recognition is limited to 250 API calls in a 24-hour period.
- Use short videos. Note that the sample video “Grid Breakers” found in the journey’s media_files directory is around 8 minutes long.
- Only use Visual Recognition after you have confirmed the video doesn’t have any issues that would prevent it from being completely processed. To do this, do not use the
-Voption in the process command. For example:
# Speech-to-Text only
bin/processMedia -S -f public/media_files/grid-breakers.mp4
- Reduce the number of seconds between screen captures. For example:
# Speech-to-Text and VR, image every 20 seconds
bin/processMedia -S -V -r 20000 -f public/media_files/grid-breakers.mp4