It’s that time of year for the annual conversation between The New Builders and ODPi, the Linux Foundation project standardizing the big data ecosystem around Apache™ Hadoop®.
We met up with ODPi VP of Technology Roman Shoposhnik at ApacheCon 2017 in Miami, building on the interview we conducted last year with ODPi Director John Mertic in Vancouver.
Roman shares how he sets a fair technical agenda for ODPi as VP of Technology (5:30), ODPi’s moves toward tighter integration with ASF (7:55), why he thinks of ODPi as the “commercial yin” to the “open source yang” of the Apache™ Bigtop® project that he founded (11:25), and why IoT’s prominence today reminds him of Big Data a decade ago (13:10).
Register for the IBM Data Science Bootcamp at Spark Summit to learn the ins and outs of the open source PixieDust library and how it simplifies working in a Jupyter Notebook.
You can find new episodes of The New Builders on developerWorks TV and SoundCloud. Find out more about IBM Watson Data Platform at ibm.co/watsondataplatform. Contact host Jim Young on Twitter (@JW_Young) or email (firstname.lastname@example.org). The show’s music is provided by School for Robots. Check them out at schoolforrobots.bandcamp.com!
[Register]: IBM Data Science Bootcamp at Spark Summit
Register today to join IBM Distinguished Engineer David Taieb for a data science workshop during Spark Summit, June 7, 3-5pm at Galvanize in San Francisco. Using open traffic data from San Francisco, you’ll learn how to use the open source PixieDust library to work more efficiently with Apache™ Spark® and Jupyter Notebooks. With PixieDust, you can:
- Speed data manipulation and display with features like auto-visualization of Spark DataFrames
- Enable real-time Spark Job progress monitoring directly from the Notebook
- Build a custom dashboard combining multiple data sources to extract insights