Digital Developer Conference: a FREE half-day online conference focused on AI & Cloud – North America: Nov 2 – India: Nov 9 – Europe: Nov 14 – Asia Nov 23 Register now

Close outline
  • United States
IBM?
  • Site map
IBM?
  • Marketplace

  • Close
    Search
  • Sign in
    • Sign in
    • Register
  • IBM Navigation
IBM Developer Answers
  • Spaces
    • Blockchain
    • IBM Cloud platform
    • Internet of Things
    • Predictive Analytics
    • Watson
    • See all spaces
  • Tags
  • Users
  • Badges
  • FAQ
  • Help
Close

Name

Community

  • Learn
  • Develop
  • Connect

Discover IBM

  • ConnectMarketplace
  • Products
  • Services
  • Industries
  • Careers
  • Partners
  • Support
10.190.13.195

Refine your search by using the following advanced search options.

Criteria Usage
Questions with keyword1 or keyword2 keyword1 keyword2
Questions with a mandatory word, e.g. keyword2 keyword1 +keyword2
Questions excluding a word, e.g. keyword2 keyword1 -keyword2
Questions with keyword(s) and a specific tag keyword1 [tag1]
Questions with keyword(s) and either of two or more specific tags keyword1 [tag1] [tag2]
To search for all posts by a user or all posts with a specific tag, start typing and choose from the suggestion list. Do not use a plus or minus sign with a tag, e.g., +[tag1].
  • Ask a question

SPSS Web Feed Unicode Error

50MH2N1CJ2 gravatar image
Question by CarlG  (1) | Oct 31, 2016 at 09:10 AM spssunicodetextmining

I'm using text analytics in SPSS Modeler 17.1. With an HTML url input to the web feed node, I'm getting the following error:

[2016-10-31 14:03:02] Web Feed Error: Invalid character (Unicode: 0x3) - Memory : 98996kb - Memory peak : 140808kb - Memory : 93896kb - Memory peak : 140808kb

Is there a way to correct this?

People who like this

  0   Show 1
Comment
10 |3000 characters needed characters left characters exceeded
  • Viewable by all users
  • Viewable by moderators
  • Viewable by moderators and the original poster
270003YFX7 gravatar image TravisL (256) ♦   Oct 31, 2016 at 09:50 AM 0
Share

@CarlG Does this error occur for every web feed URL you enter in the web feed node? Or just one in particular? For example, what happens if you try using one or both of the following web feed URLs in a web feed node and preview the data:

http://www.feedforall.com/sample.xml http://www.feedforall.com/sample-feed.xml

Does that work in your client? If the above examples work for you, but the one URL you are trying does not, are you able to share what that URL is that you're using for others to try to test with?

3 answers

  • Sort: 
50MH2N1CJ2 gravatar image

Answer by CarlG (1) | Oct 31, 2016 at 11:03 AM

@TravisL For one in particular. Your two feeds work fine.

I'm using the following which works okay for pages 1 to 30, but gives me unicode errors when adding page 31 to the input. However, if I input ONLY page 31, it seems to work.

https://www.digitalmarketplace.service.gov.uk/g-cloud/search?q=&page=1

--- repeated ---

https://www.digitalmarketplace.service.gov.uk/g-cloud/search?q=&page=30 https://www.digitalmarketplace.service.gov.uk/g-cloud/search?q=&page=31

Comment

People who like this

  0   Show 1   Share
10 |3000 characters needed characters left characters exceeded
  • Viewable by all users
  • Viewable by moderators
  • Viewable by moderators and the original poster
270003YFX7 gravatar image TravisL (256) ♦   Nov 01, 2016 at 09:02 AM 0
Share

I tested putting those URLs in a web feed node, all the way from page 1 to page 31, and I did not see any errors at all. However, I also did not return any data, just empty columns. Attached is the basic test stream I created and ran. So if you attempt to run this same stream on your end, you get the same error?link text

web-feed-test.zip (3.2 kB)
50MH2N1CJ2 gravatar image

Answer by CarlG (1) | Nov 01, 2016 at 12:54 PM

@TravisL There needs to be a start tag in the web feed for each URL to return data.

I've attached the version I'm trying to run.

link text


dm.zip (932.5 kB)
Comment

People who like this

  0   Show 1   Share
10 |3000 characters needed characters left characters exceeded
  • Viewable by all users
  • Viewable by moderators
  • Viewable by moderators and the original poster
270003YFX7 gravatar image TravisL (256) ♦   Nov 01, 2016 at 04:52 PM 0
Share

@CarlG I was unable to extract and open the attached zip file. It seems to have become corrupted. Can you try attaching again?

50MH2N1CJ2 gravatar image

Answer by CarlG (1) | Nov 02, 2016 at 05:11 AM

Hi @TravisL ... apols, try this one ... thanks for persevering with this!

link text


dm-2.zip (932.7 kB)
Comment

People who like this

  0   Share
10 |3000 characters needed characters left characters exceeded
  • Viewable by all users
  • Viewable by moderators
  • Viewable by moderators and the original poster

Follow this question

92 people are following this question.

Answers

Answers & comments

Related questions

SPSS Modeler premium won't install 5 Answers

SPSS Web Feed Login 0 Answers

TEXT MINING FIELD WON'T ALLOW ME TO SELECT PATH 0 Answers

SPSS Modeler Text Mining - Paragraph Mode - Can I create a manual paragraphing in a word file? 1 Answer

How one can see the colored category matching onn the full document at the text mining node? 1 Answer

  • Contact
  • Privacy
  • IBM Developer Terms of use
  • Accessibility
  • Report Abuse
  • Cookie Preferences

Powered by AnswerHub

Authentication check. Please ignore.
  • Anonymous
  • Sign in
  • Create
  • Ask a question
  • Spaces
  • API Connect
  • Analytic Hybrid Cloud Core
  • Application Performance Management
  • Appsecdev
  • BPM
  • Blockchain
  • Business Transaction Intelligence
  • CAPI
  • CAPI SNAP
  • CICS
  • Cloud Analytics
  • Cloud Automation
  • Cloud Object Storage
  • Cloud marketplace
  • Collaboration
  • Content Services (ECM)
  • Continuous Testing
  • Courses
  • Customer Experience Analytics
  • DB2 LUW
  • Data and AI
  • DataPower
  • Decision Optimization
  • DevOps Build
  • DevOps Services
  • Developers IBM MX
  • Digital Commerce
  • Digital Experience
  • Finance
  • Global Entrepreneur Program
  • Hadoop
  • Hybrid Cloud Core
  • Hyper Protect
  • IBM Cloud platform
  • IBM Design
  • IBM Forms Experience Builder
  • IBM Maximo Developer
  • IBM StoredIQ
  • IBM StoredIQ-Cartridges
  • IIDR
  • ITOA
  • InformationServer
  • Integration Bus
  • Internet of Things
  • Kenexa
  • Linux on Power
  • LinuxONE
  • MDM
  • Mainframe
  • Messaging
  • Node.js
  • ODM
  • Open
  • PartnerWorld Developer Support
  • PowerAI
  • PowerVC
  • Predictive Analytics
  • Product Insights
  • PureData for Analytics
  • Push
  • QRadar App Development
  • Run Book Automation
  • Search Insights
  • Security Core
  • Storage
  • Storage Core
  • Streamsdev
  • Supply Chain Business Network
  • Supply Chain Insights
  • Swift
  • UBX Capture
  • Universal Behavior Exchange
  • UrbanCode
  • WASdev
  • WSRR
  • Watson
  • Watson Campaign Automation
  • Watson Content Hub
  • Watson Marketing Insights
  • dW Answers Help
  • dW Premium
  • developerWorks Sandbox
  • developerWorks Team
  • Watson Health
  • More
  • Tags
  • Questions
  • Users
  • Badges