05 Transform Stage and Store - Spark/057 Introduction.mp49.16MB
05 Transform Stage and Store - Spark/058 Introduction to Spark.mp43.98MB
05 Transform Stage and Store - Spark/059 Setup Spark on Windows.mp489.08MB
05 Transform Stage and Store - Spark/060 Quick overview about Spark documentation.mp419.69MB
05 Transform Stage and Store - Spark/061 Initializing Spark job using spark-shell.mp467.65MB
05 Transform Stage and Store - Spark/062 Create Resilient Distributed Data Sets (RDD).mp436.08MB
05 Transform Stage and Store - Spark/063 Previewing data from RDD.mp464.45MB
05 Transform Stage and Store - Spark/064 Reading different file formats - Brief overview using JSON.mp426.23MB
05 Transform Stage and Store - Spark/065 Transformations Overview.mp410.38MB
05 Transform Stage and Store - Spark/066 Manipulating Strings as part of transformations using Scala.mp433.46MB
05 Transform Stage and Store - Spark/067 Row level transformations using map.mp447.75MB
05 Transform Stage and Store - Spark/068 Row level transformations using flatMap.mp426.5MB
05 Transform Stage and Store - Spark/069 Filtering the data.mp458.38MB
05 Transform Stage and Store - Spark/070 Joining data sets - inner join.mp429.77MB
05 Transform Stage and Store - Spark/071 Joining data sets - outer join.mp451.11MB
05 Transform Stage and Store - Spark/072 Aggregations - Getting Started.mp415.19MB
05 Transform Stage and Store - Spark/073 Aggregations - using actions (reduce and countByKey).mp443.84MB
05 Transform Stage and Store - Spark/074 Aggregations - understanding combiner.mp414.36MB
05 Transform Stage and Store - Spark/075 Aggregations using groupByKey - least preferred API for aggregations.mp456.34MB
05 Transform Stage and Store - Spark/076 Aggregations using reduceByKey.mp425.74MB
05 Transform Stage and Store - Spark/077 Aggregations using aggregateByKey.mp460.81MB
05 Transform Stage and Store - Spark/078 Sorting data using sortByKey.mp459.24MB
05 Transform Stage and Store - Spark/079 Global Ranking - using sortByKey with take and takeOrdered.mp434.09MB
05 Transform Stage and Store - Spark/080 By Key Ranking - Converting (K V) pairs into (K IterableV) using groupByKey.mp419.54MB
05 Transform Stage and Store - Spark/081 Get topNPrices using Scala Collections API.mp431.87MB
05 Transform Stage and Store - Spark/082 Get topNPricedProducts using Scala Collections API.mp437.04MB
05 Transform Stage and Store - Spark/083 Get top n products by category using groupByKey flatMap and Scala function.mp419.58MB
05 Transform Stage and Store - Spark/084 Set Operations - union intersect distinct as well as minus.mp459.35MB
05 Transform Stage and Store - Spark/085 Save data in Text Input Format.mp443.22MB
05 Transform Stage and Store - Spark/086 Save data in Text Input Format using Compression.mp428.82MB
05 Transform Stage and Store - Spark/087 Saving data in standard file formats - Overview.mp429.41MB
05 Transform Stage and Store - Spark/088 Revision of Problem Statement and Design the solution.mp48.75MB
05 Transform Stage and Store - Spark/089 Solution - Get Daily Revenue per Product - Launching Spark Shell.mp425.22MB
05 Transform Stage and Store - Spark/090 Solution - Get Daily Revenue per Product - Read and join orders and order_items.mp448.44MB
05 Transform Stage and Store - Spark/091 Solution - Get Daily Revenue per Product - Compute daily revenue per product id.mp439.35MB
05 Transform Stage and Store - Spark/092 Solution - Get Daily Revenue per Product - Read products data and create RDD.mp444.03MB
05 Transform Stage and Store - Spark/093 Solution - Get Daily Revenue per Product - Sort and save to HDFS.mp488.61MB
05 Transform Stage and Store - Spark/094 Solution - Add spark dependencies to sbt.mp419.98MB
05 Transform Stage and Store - Spark/095 Solution - Develop as Scala based application.mp490.52MB
05 Transform Stage and Store - Spark/096 Solution - Run locally using spark-submit.mp435.93MB
06 Data Analysis - Spark SQL or HiveQL/098 Different interfaces to run Hive queries.mp422.83MB
06 Data Analysis - Spark SQL or HiveQL/099 Create Hive tables and load data in text file format.mp467.97MB
06 Data Analysis - Spark SQL or HiveQL/100 Create Hive tables and load data in ORC file format.mp430.55MB
06 Data Analysis - Spark SQL or HiveQL/101 Using spark-shell to run Hive queries or commands.mp413.37MB
06 Data Analysis - Spark SQL or HiveQL/102 Functions - Getting Started.mp410.85MB
06 Data Analysis - Spark SQL or HiveQL/103 Functions - Manipulating Strings.mp454.33MB
06 Data Analysis - Spark SQL or HiveQL/104 Functions - Manipulating Dates.mp433MB
06 Data Analysis - Spark SQL or HiveQL/105 Functions - Aggregations.mp414.9MB
06 Data Analysis - Spark SQL or HiveQL/106 Functions - CASE.mp432.08MB
06 Data Analysis - Spark SQL or HiveQL/107 Row level transformations.mp417.67MB
06 Data Analysis - Spark SQL or HiveQL/108 Joins.mp451.25MB
06 Data Analysis - Spark SQL or HiveQL/109 Aggregations.mp438.65MB
06 Data Analysis - Spark SQL or HiveQL/110 Sorting.mp421.37MB
06 Data Analysis - Spark SQL or HiveQL/111 Set Operations.mp413.2MB
06 Data Analysis - Spark SQL or HiveQL/112 Analytics Functions - Aggregations.mp445.26MB
06 Data Analysis - Spark SQL or HiveQL/113 Analytics Functions - Ranking.mp424.06MB
06 Data Analysis - Spark SQL or HiveQL/114 Windowing Functions.mp425.55MB
06 Data Analysis - Spark SQL or HiveQL/115 Create Data Frame and Register as Temp table.mp444.57MB
06 Data Analysis - Spark SQL or HiveQL/116 Writing Spark SQL Applications - process data.mp429.69MB
06 Data Analysis - Spark SQL or HiveQL/117 Writing Spark SQL Applications - Save data into Hive tables.mp424.37MB
06 Data Analysis - Spark SQL or HiveQL/118 Data Frame Operations.mp445.35MB
07 Data Ingest - real time near real time and streaming analytics/119 Introduction.mp49.46MB
07 Data Ingest - real time near real time and streaming analytics/120 Flume - Getting Started.mp453.97MB
07 Data Ingest - real time near real time and streaming analytics/121 Flume - Web Server Logs to HDFS - Introduction.mp460.36MB
07 Data Ingest - real time near real time and streaming analytics/122 Flume - Web Server Logs to HDFS - Setup Data.mp420.28MB
07 Data Ingest - real time near real time and streaming analytics/123 Flume - Web Server Logs to HDFS - Source exec.mp419.17MB
07 Data Ingest - real time near real time and streaming analytics/124 Flume - Web Server Logs to HDFS - Sink HDFS - Getting Started.mp444.77MB
07 Data Ingest - real time near real time and streaming analytics/125 Flume - Web Server Logs to HDFS - Sink HDFS - Customize properties.mp440.35MB
07 Data Ingest - real time near real time and streaming analytics/126 Flume - Web Server Logs to HDFS - Deep dive to memory channel.mp411.77MB
07 Data Ingest - real time near real time and streaming analytics/127 Kafka - Getting Started - High Level Architecture.mp417.97MB
07 Data Ingest - real time near real time and streaming analytics/128 Kafka - Getting Started - Produce and consume messages using commands.mp461.15MB
07 Data Ingest - real time near real time and streaming analytics/129 Kafka - Anatomy of a topic.mp433.46MB
07 Data Ingest - real time near real time and streaming analytics/130 Flume and Kafka in Streaming analytics.mp419.6MB
07 Data Ingest - real time near real time and streaming analytics/131 Spark Streaming - Getting Started.mp436.44MB
07 Data Ingest - real time near real time and streaming analytics/132 Spark Streaming - Setting up netcat.mp419.57MB
07 Data Ingest - real time near real time and streaming analytics/133 Spark Streaming - Develop Word Count program.mp430.77MB
07 Data Ingest - real time near real time and streaming analytics/134 Spark Streaming - Ship and run word count program on the cluster.mp431.9MB
07 Data Ingest - real time near real time and streaming analytics/135 Spark Streaming - Data Structure (DStream) and APIs overview.mp427.4MB
07 Data Ingest - real time near real time and streaming analytics/136 Spark Streaming - Get department wise traffic - Problem Statement.mp423.35MB
07 Data Ingest - real time near real time and streaming analytics/137 Spark Streaming - Get department wise traffic - Development.mp432.02MB
07 Data Ingest - real time near real time and streaming analytics/138 Spark Streaming - Get department wise traffic - Run on the cluster.mp437.83MB
07 Data Ingest - real time near real time and streaming analytics/139 Flume and Spark Streaming - Department Wise Traffic - Setup Flume.mp461.65MB
07 Data Ingest - real time near real time and streaming analytics/140 Flume and Spark Streaming - Department Wise Traffic - Add sbt dependencies.mp421.15MB
07 Data Ingest - real time near real time and streaming analytics/141 Flume and Spark Streaming - Department Wise Traffic - Develop and build.mp416.2MB
07 Data Ingest - real time near real time and streaming analytics/142 Flume and Spark Streaming - Department Wise Traffic - Run and Validate.mp448.74MB
07 Data Ingest - real time near real time and streaming analytics/143 Flume and Kafka integration - Develop configuration file.mp442.48MB
07 Data Ingest - real time near real time and streaming analytics/144 Flume and Kafka integration - Run and validate.mp450.27MB
07 Data Ingest - real time near real time and streaming analytics/145 Kafka and Spark Streaming - Add dependencies.mp423.09MB
07 Data Ingest - real time near real time and streaming analytics/146 Kafka and Spark Streaming - Develop and build application.mp424.65MB
07 Data Ingest - real time near real time and streaming analytics/147 Kafka and Spark Streaming - Run and Validate.mp442.95MB
08 Sample scenarios with solutions/148 Introduction to Sample Scenarios and Solutions.mp48.85MB
08 Sample scenarios with solutions/149 Problem Statements - General Guidelines.mp412.63MB
08 Sample scenarios with solutions/150 Initializing the job - General Guidelines.mp434.19MB
08 Sample scenarios with solutions/151 Getting crime count per type per month - Understanding Data.mp443.29MB
08 Sample scenarios with solutions/152 Getting crime count per type per month - Implementing the logic - Core API.mp468.07MB
08 Sample scenarios with solutions/153 Getting crime count per type per month - Implementing the logic - Data Frames.mp481.55MB
08 Sample scenarios with solutions/154 Getting crime count per type per month - Validating Output.mp417.64MB
08 Sample scenarios with solutions/155 Get inactive customers - using Core Spark API (leftOuterJoin).mp464.28MB
08 Sample scenarios with solutions/156 Get inactive customers - using Data Frames and SQL.mp475.04MB
08 Sample scenarios with solutions/157 Get top 3 crimes in RESIDENCE - using Core Spark API.mp464.53MB
08 Sample scenarios with solutions/158 Get top 3 crimes in RESIDENCE - using Data Frame and SQL.mp462.3MB
08 Sample scenarios with solutions/159 Convert NYSE data from text file format to parquet file format.mp456.19MB
08 Sample scenarios with solutions/160 Get word count - with custom control arguments num keys and file format.mp470.63MB