We’re giving away 1,500 DJI Tello drones. Enter to win ›
Get the code
View the demo
By Scott Graham, Rich Hagarty | Published September 19, 2017 - Updated June 20, 2018
Artificial IntelligenceNode.jsPlatform as a ServiceRetail
Multimedia files are increasingly essential for any type of web communication, whether marketing, instructional, or entertainment. This pattern shows you how to use IBM Watson® Node.js SDK to create a web UI app that includes speech-to-text conversion, tone analysis, natural language understanding, and visual recognition processing to enrich multimedia files.
If you use the web – and who doesn’t? – you know that multimedia files are now essential for building an audience. Whether you’re retailing, marketing, instructing, or entertaining, a flat web page is no longer an option. You need audio and video.
Most developers know how to include multimedia content in their apps. But anyone with a lot of video content knows how difficult it is to quickly perform a granular search and pull data from those video files. What’s the essential information covered in the video? How do you find related videos? Can you quickly provide recommendations for other videos to a user? A developer who knows how to quickly search and derive information from video content will have an edge on the pack.
This pattern will help you do more with multimedia. It shows you how to use the IBM Watson Node.js SDK to create a web UI app that includes speech-to-text conversion, tone analysis, natural language understanding, and visual recognition processing to enrich multimedia files. By performing visual recognition every few seconds, you can find information in the video faster and make it readily available for any purpose.
You’ll create two Node.js apps. The first app processes multimedia files using IBM Watson Speech-to-Text, Tone Analyzer, Natural Language Understanding, and Visual Recognition services, plus a Cloudant NoSQL database. The multimedia processor will extract enriched data from the media files and store the result in the NoSQL DB. The second is a web UI app that displays the enriched data, enabling you to view the results in real time as the media file is played back in a series of time segments.
When you complete this pattern, you will understand how to:
If you’re looking to work on dynamic multimedia content and separate yourself from the development pack, this pattern is for you.
Ready to put this code pattern to use? Complete details on how to get started running and using this application are in the README.
April 24, 2018
December 4, 2018
November 17, 2018
Back to top