Subscribe to our Newsletter

Parallelize R Code Using Apache® Spark™

Event Details

Parallelize R Code Using Apache® Spark™

Time: August 15, 2017 from 9am to 10am
Location: Online
Website or Map: http://bit.ly/2udd64g
Event Type: dsc, webinar
Organized By: Bill Vorhies, Editorial Director -- Data Science Central
Latest Activity: Jul 13

Export to Outlook or iCal (.ics)

Event Description

Space is limited.

Reserve your Webinar seat now

R is the latest language added to Apache Spark, and the SparkR API is slightly different from PySpark. SparkR’s evolving interface to Apache Spark offers a wide range of APIs and capabilities to Data Scientists and Statisticians. With the release of Spark 2.0, and subsequent releases, the R API officially supports executing user code on distributed data. This is done primarily through a family of apply() functions.

In this Data Science Central webinar, we will explore the following:

  • Provide an overview of this new functionality in SparkR.
  • Show how to use this API with some changes to regular code with dapply().
  • Focus on how to correctly use this API to parallelize existing R packages.
  • Consider performance and examine correctness when using the apply family of functions in SparkR.

SpeakerHossein Falaki, Software Engineer -- Databricks Inc.

Hosted by: 
Bill VorhiesEditorial Director -- Data Science Central

Again, Space is limited so please register early:

Reserve your Webinar seat now

 

After registering you will receive a confirmation email containing information about joining the Webinar.

Comment Wall

Comment

RSVP for Parallelize R Code Using Apache® Spark™ to add comments!

Join Big Data News

Attending (1)

© 2017   BigDataNews.com is a subsidiary of DataScienceCentral LLC and not affiliated with Systap   Powered by

Badges  |  Report an Issue  |  Terms of Service