Category: Spark Sql

Add constant column in spark 0

Add constant column in spark

If we want to add a column with default value then we can do in spark. In spark 2.2 there are two ways to add constant value in...

JDBC in Spark SQL 0

JDBC in Spark SQL

Apache Spark has very powerful built-in API for gathering data from a relational database. Effectiveness and efficiency, following the usual Spark approach, is managed in a transparent way....

Redshift Database connection in spark 0

Redshift Database connection in spark

This blog primarily focus on how to connect to redshift from Spark. Redshift: Amazon Redshift is a fully managed petabyte-scale data warehouse service. Redshift is designed for analytic...

Spark SQL Using Parquet 0

Spark SQL Using Parquet

Today, I’m focusing on how to use parquet format in spark.  Please get the more insight about parquet format If you are new to this format. Parquet: Apache Parquet is a...

Spark SQL Using Hive 0

Spark SQL Using Hive

In this blog I’m going to describe how to integrate hive with spark. You may find this code on spark’s official github page. My effort is to describe...