• United States
IBM?
  • Site map
IBM? developerWorks   Developer Centers
  • Marketplace

  • Close
    Search
  • Sign in
    • Sign in
    • Register
  • IBM Navigation
dW Answers
  • Spaces
    • Blockchain
    • IBM Cloud platform
    • Internet of Things
    • Predictive Analytics
    • Watson
    • See all spaces
  • Tags
  • Users
  • Badges
  • FAQ
  • Help
Close

Name

developerWorks

  • Learn
  • Develop
  • Connect

Discover IBM

  • ConnectMarketplace
  • Products
  • Services
  • Industries
  • Careers
  • Partners
  • Support
10.190.13.206

Watson×

Refine your search by using the following advanced search options.

Criteria Usage
Questions with keyword1 or keyword2 keyword1 keyword2
Questions with a mandatory word, e.g. keyword2 keyword1 +keyword2
Questions excluding a word, e.g. keyword2 keyword1 -keyword2
Questions with keyword(s) and a specific tag keyword1 [tag1]
Questions with keyword(s) and either of two or more specific tags keyword1 [tag1] [tag2]
To search for all posts by a user or all posts with a specific tag, start typing and choose from the suggestion list. Do not use a plus or minus sign with a tag, e.g., +[tag1].
  • Ask a question

Watson STT Speaker label and Smart formatting doesnt work together

270007CEJP gravatar image
Question by Srividhya_Narayanan  (1) | Apr 17, 2017 at 01:38 AM watsonspeech-to-text

Can we enable diarization and smart format in speech to text service at the same time? We actually need both for our requirement –identify between agent/customer, we also need customer addresses/phone numbers in smart format.

Looking at the example at https://speech-to-text-demo.mybluemix.net/ only one or the other works – primarily because smart formatted output is not available in the timestamped words and we need the timestamped words in order to diarize the text. In the below example the timestamped text holds “twenty thousand dollars” whereas the transcript reads as $20000. Is there anyway we can achieve both speaker diarization and smart formatting? Please suggest on the possible option. Thanks! [ "twenty", 27.19, 27.51 ], [ "thousand", 27.51, 27.83 ], [ "dollars", 27.83, 28.35 ] ], "confidence": 0.935, "transcript": "thank you for calling this is Dave speaking how can I help you hi Dave I filled out an application last night and the last page it says to call and give more information I'd be more than happy to assist you with that I'll have to ask you some additional questions okay okay vehicle that you're looking to purchase are you purchasing from an individual or from a dealer I'm an individual okay all right %HESITATION any special occasion for the car purchase no I just want a new car okay and looks like you applied for $20000 " } ], "final": true } ], "result_index": 0 }

People who like this

  0
Comment
10 |3000 characters needed characters left characters exceeded
  • Viewable by all users
  • Viewable by moderators
  • Viewable by moderators and the original poster

2 answers

  • Sort: 
110000PNBC gravatar image

Answer by @chughts (8573) | Apr 17, 2017 at 01:41 PM

What you are seeing is maybe just a feature of the sample application as the API documentation (https://www.ibm.com/watson/developercloud/speech-to-text/api/v1/#recognize_sessionless_nonmp12), doesn't mention any exclusivity between the speaker_labels and smart_formatting options. So I guess you can try specifying both.

Comment

People who like this

  0   Show 2   Share
10 |3000 characters needed characters left characters exceeded
  • Viewable by all users
  • Viewable by moderators
  • Viewable by moderators and the original poster
270007CEJP gravatar image Srividhya_Narayanan (1)   Apr 17, 2017 at 10:21 PM 0
Share

I have tried specifying both. But the timestamped words aren't smart formatted

110000PNBC gravatar image @chughts (8573) ♦ Srividhya_Narayanan (1)   Apr 17, 2017 at 11:43 PM 0
Share

Just to clarify what I think you are saying. Are you using the speaker labels option or the smart formatting option?

270007CEJP gravatar image

Answer by Srividhya_Narayanan (1) | Apr 19, 2017 at 05:56 AM

I am using both speaker_labels true and smart_formatting true. Any responses please? I badly need speaker labeling as we are transcribing call center text. Smart formatting is also needed as the user mentions phone number and we need it to be properly formatted..

Comment

People who like this

  0   Share
10 |3000 characters needed characters left characters exceeded
  • Viewable by all users
  • Viewable by moderators
  • Viewable by moderators and the original poster

Follow this question

114 people are following this question.

Show
Hide
270007CEJP gravatar image
270000GWTC gravatar image
270000FDR4 gravatar image
270000XYBA gravatar image
110000716N gravatar image
270006XPGB gravatar image
12000089VF gravatar image
270007S0YR gravatar image
0 gravatar image
270000HCSB gravatar image
270007J4V5 gravatar image
060000TPFC gravatar image
270007TC7S gravatar image
270007N89G gravatar image
270007E13S gravatar image
270003G2C2 gravatar image
060000U5YA gravatar image
1200006P5U gravatar image
0600020WKY gravatar image
2700077Y16 gravatar image
060000GS8A gravatar image
27000412BP gravatar image
270003TDDU gravatar image
110000A7Q4 gravatar image
0600026V6R gravatar image
270007KSPS gravatar image
120000GKJX gravatar image
270005CRQH gravatar image
270002DX0R gravatar image
1100006DS0 gravatar image
120000PKE0 gravatar image
2700071CQ3 gravatar image
060000JCQ6 gravatar image
120000KBJ0 gravatar image
2700048B8W gravatar image
2700077GBQ gravatar image
2700050NEH gravatar image
270001KHBU gravatar image
270003U2JX gravatar image
1200007P68 gravatar image
060000PBF9 gravatar image
310000C3WF gravatar image
120000FVD3 gravatar image
2700078CT8 gravatar image
120000DJQR gravatar image
110000CPVN gravatar image
100000PUHW gravatar image
060000UPGT gravatar image
3100012P1N gravatar image
270005EH6S gravatar image
2700064F5C gravatar image
270002YGE4 gravatar image
120000K2Y8 gravatar image
31000066QG gravatar image
310001XRBV gravatar image
310000AVXW gravatar image
100000AYJ5 gravatar image
0600006FD4 gravatar image
270007817Q gravatar image
3100022S3B gravatar image
270003TGA5 gravatar image
0600029FSS gravatar image
270000S0MP gravatar image
270000W33V gravatar image
110000AF44 gravatar image
270003TTGW gravatar image
50WJRKN03C gravatar image
270001QB6R gravatar image
270000FXVC gravatar image
27000648WT gravatar image
270000CTQS gravatar image
310002CN85 gravatar image
110000C59H gravatar image
50T5CPU10M gravatar image
50G77GYY6D gravatar image
0600014AB6 gravatar image
3100026SAA gravatar image
3100015MKH gravatar image
310002BBMH gravatar image
270004YK46 gravatar image
310001F4NR gravatar image
270003XXWM gravatar image
270003Y1M4 gravatar image
31000098RE gravatar image
270006TJHJ gravatar image
310002BHAD gravatar image
270007QY2W gravatar image
2700013TF4 gravatar image
270001Y6MF gravatar image
2700039TS4 gravatar image
270001YQ13 gravatar image
50J5B74J8S gravatar image
5070VN6V9H gravatar image
270004MKK4 gravatar image
310000A2A3 gravatar image
27000341TY gravatar image
3100009MTN gravatar image
270007DV4D gravatar image
1000007SAX gravatar image
310000PX76 gravatar image
0600024THK gravatar image
110000NN2R gravatar image
1100007VMM gravatar image
50C3FWBXXD gravatar image
50RSE1REXS gravatar image
270005MKK0 gravatar image
110000PNBC gravatar image
27000035QP gravatar image
270004QGTF gravatar image
2700051773 gravatar image
060001F069 gravatar image
270005NUPA gravatar image
1000000446 gravatar image
1200006ABB gravatar image

Answers

Answers & comments

Related questions

How do I send an audio file for transcription using IBM Watson Android sdk? 4 Answers

speech to text is not converting properly 0 Answers

Integration Speech to Text with PHP project 1 Answer

How to minimize the time of the final response from Watson STT? 2 Answers

Streaming Speech to Text websocket fails with 500 error (python3) why? 1 Answer

  • Contact
  • Privacy
  • Terms of use
  • Accessibility
  • Report Abuse
  • Cookie Preferences

Powered by AnswerHub

Authentication check. Please ignore.
  • Anonymous
  • Sign in
  • Create
  • Ask a question
  • Spaces
  • API Connect
  • Application Performance Management
  • Appsecdev
  • BPM
  • Blockchain
  • Business Transaction Intelligence
  • CAPI
  • CAPI SNAP
  • CICS
  • Cloud Analytics
  • Cloud Automation
  • Cloud Object Storage
  • Cloud marketplace
  • Collaboration
  • Content Services (ECM)
  • Continuous Testing
  • Courses
  • Customer Experience Analytics
  • DB2 LUW
  • DataPower
  • Decision Optimization
  • DevOps Services
  • Digital Commerce
  • Digital Experience
  • Finance
  • Global Entrepreneur Program
  • Hadoop
  • IBM Cloud platform
  • IBM Design
  • IBM Forms Experience Builder
  • IBM Maximo Developer
  • IBM StoredIQ
  • IBM StoredIQ-Cartridges
  • IIDR
  • ITOA
  • InformationServer
  • Integration Bus
  • Internet of Things
  • Kenexa
  • Linux on Power
  • LinuxONE
  • MDM
  • Mainframe
  • Messaging
  • Node.js
  • ODM
  • Open
  • PowerAI
  • PowerVC
  • Predictive Analytics
  • Product Insights
  • PureData for Analytics
  • Push
  • QRadar App Development
  • Run Book Automation
  • Search Insights
  • Storage
  • Streamsdev
  • Supply Chain Business Network
  • Supply Chain Insights
  • Swift
  • UBX Capture
  • Universal Behavior Exchange
  • UrbanCode
  • WASdev
  • WSRR
  • Watson
  • Watson Campaign Automation
  • Watson Content Hub
  • Watson Marketing Insights
  • dW Answers Help
  • dW Premium
  • developerWorks Sandbox
  • developerWorks Team
  • Watson Health
  • More
  • Tags
  • Questions
  • Users
  • Badges