Friday, February 25, 2022

3.3. Spark (as a processing engine) - 2> Notebook (With Dataframe) in DataBricks

 3.3. Spark (as a processing engine) - 2> Notebook (With Dataframe) in DataBricks

 

With Databricks

  1. Create the Cluster.

    Select the runtime as 6.6 and give a cluster name. Finally , click on Create Cluster.

    confirm by clicking "clusters" page

  1. upload the file. upload file "people_with_header.csv" , it will be uploaded to the path "/FileStore/tables/people_with_header.csv" ( please make note of the path).Click on "Data" , and then "Add Data" . Drop the file "people_with_header.csv" in "Drop files to upload"

  1. Import sparkSQL Demo.html . Click on "workspace" and import from there.

 

 


No comments:

Post a Comment

6.1. Kafka : Run

  6.1. Kafka : Run