Azure Databricks | Cookbook
Inhaltsverzeichnis
Reading Data
Create Table from CSV file with SQL
DROP TABLE IF EXISTS quickstart; CREATE TABLE quickstart USING csv OPTIONS (path "/databricks-datasets/data.csv", header "true")
Create Table from CSV file with PySpark
quickstart= spark.read.csv("/databricks-datasets/data.csv", header="true", inferSchema="true")
Analyse Data
Group and Display
from pyspark.sql.functions import avg display(quickstart.select("color","price").groupBy("color").agg(avg("price")).sort("color"))
Leave a Reply