04. Data Ingestion - Apache Sqoop/39. Sqoop Import - Using split by.mp462.56MB
08. Sample scenarios with solutions/158. Get top 3 crimes in RESIDENCE - using Data Frame and SQL.mp462.3MB
07. Data Ingest - real time, near real time and streaming analytics/139. Flume and Spark Streaming - Department Wise Traffic - Setup Flume.mp461.65MB
07. Data Ingest - real time, near real time and streaming analytics/128. Kafka - Getting Started - Produce and consume messages using commands.mp461.15MB
05. Transform, Stage and Store - Spark/77. Aggregations using aggregateByKey.mp460.81MB
07. Data Ingest - real time, near real time and streaming analytics/121. Flume - Web Server Logs to HDFS - Introduction.mp460.36MB
05. Transform, Stage and Store - Spark/84. Set Operations - union, intersect, distinct as well as minus.mp459.35MB
05. Transform, Stage and Store - Spark/78. Sorting data using sortByKey.mp459.24MB
05. Transform, Stage and Store - Spark/69. Filtering the data.mp458.38MB
05. Transform, Stage and Store - Spark/75. Aggregations using groupByKey - least preferred API for aggregations.mp456.34MB
08. Sample scenarios with solutions/159. Convert NYSE data from text file format to parquet file format.mp456.19MB
06. Data Analysis - Spark SQL or HiveQL/103. Functions - Manipulating Strings.mp454.33MB
07. Data Ingest - real time, near real time and streaming analytics/120. Flume - Getting Started.mp453.97MB
04. Data Ingestion - Apache Sqoop/44. Sqoop Import - columns and query.mp453.76MB
04. Data Ingestion - Apache Sqoop/37. Sqoop Import - Execution Life Cycle.mp453.34MB
06. Data Analysis - Spark SQL or HiveQL/108. Joins.mp451.25MB
05. Transform, Stage and Store - Spark/71. Joining data sets - outer join.mp451.11MB
07. Data Ingest - real time, near real time and streaming analytics/144. Flume and Kafka integration - Run and validate.mp450.27MB
04. Data Ingestion - Apache Sqoop/55. Sqoop Export - Update and Upsert.mp449.74MB
07. Data Ingest - real time, near real time and streaming analytics/142. Flume and Spark Streaming - Department Wise Traffic - Run and Validate.mp448.74MB
05. Transform, Stage and Store - Spark/90. Solution - Get Daily Revenue per Product - Read and join orders and order_items.mp448.44MB
05. Transform, Stage and Store - Spark/67. Row level transformations using map.mp447.75MB
05. Transform, Stage and Store - Spark/97. Solution - Ship and run it on big data cluster.mp447.18MB
06. Data Analysis - Spark SQL or HiveQL/118. Data Frame Operations.mp445.35MB
06. Data Analysis - Spark SQL or HiveQL/112. Analytics Functions - Aggregations.mp445.26MB
07. Data Ingest - real time, near real time and streaming analytics/124. Flume - Web Server Logs to HDFS - Sink HDFS - Getting Started.mp444.77MB
06. Data Analysis - Spark SQL or HiveQL/115. Create Data Frame and Register as Temp table.mp444.57MB
04. Data Ingestion - Apache Sqoop/50. Sqoop Import - Import all tables.mp444.07MB
05. Transform, Stage and Store - Spark/92. Solution - Get Daily Revenue per Product - Read products data and create RDD.mp444.03MB
04. Data Ingestion - Apache Sqoop/34. Sqoop connect string and validating using list commands.mp443.87MB
05. Transform, Stage and Store - Spark/73. Aggregations - using actions (reduce and countByKey).mp443.84MB
08. Sample scenarios with solutions/151. Getting crime count per type per month - Understanding Data.mp443.29MB
05. Transform, Stage and Store - Spark/85. Save data in Text Input Format.mp443.22MB
05. Transform, Stage and Store - Spark/76. Aggregations using reduceByKey.mp425.74MB
06. Data Analysis - Spark SQL or HiveQL/114. Windowing Functions.mp425.55MB
05. Transform, Stage and Store - Spark/89. Solution - Get Daily Revenue per Product - Launching Spark Shell.mp425.22MB
07. Data Ingest - real time, near real time and streaming analytics/146. Kafka and Spark Streaming - Develop and build application.mp424.65MB
06. Data Analysis - Spark SQL or HiveQL/117. Writing Spark SQL Applications - Save data into Hive tables.mp424.37MB
06. Data Analysis - Spark SQL or HiveQL/113. Analytics Functions - Ranking.mp424.06MB
07. Data Ingest - real time, near real time and streaming analytics/136. Spark Streaming - Get department wise traffic - Problem Statement.mp423.35MB
07. Data Ingest - real time, near real time and streaming analytics/145. Kafka and Spark Streaming - Add dependencies.mp423.09MB
03. Getting Started/24. Setup Environment - using Cloudera Quickstart VM.mp423.07MB
06. Data Analysis - Spark SQL or HiveQL/98. Different interfaces to run Hive queries.mp422.83MB
04. Data Ingestion - Apache Sqoop/33. Preview of MySQL on labs.mp422.25MB
02. Scala Fundamentals/04. Setup Scala on Windows.mp421.86MB
06. Data Analysis - Spark SQL or HiveQL/110. Sorting.mp421.37MB
02. Scala Fundamentals/16. Development Cycle - Compile source code to jar using SBT.mp421.33MB
07. Data Ingest - real time, near real time and streaming analytics/140. Flume and Spark Streaming - Department Wise Traffic - Add sbt dependencies.mp421.15MB
01. Introduction/02. Using labs for preparation.mp420.44MB
07. Data Ingest - real time, near real time and streaming analytics/122. Flume - Web Server Logs to HDFS - Setup Data.mp420.28MB
05. Transform, Stage and Store - Spark/94. Solution - Add spark dependencies to sbt.mp419.98MB
05. Transform, Stage and Store - Spark/60. Quick overview about Spark documentation.mp419.69MB
04. Data Ingestion - Apache Sqoop/32. Accessing Sqoop Documentation.mp419.63MB
07. Data Ingest - real time, near real time and streaming analytics/130. Flume and Kafka in Streaming analytics.mp419.6MB
05. Transform, Stage and Store - Spark/83. Get top n products by category using groupByKey, flatMap and Scala function.mp419.58MB
07. Data Ingest - real time, near real time and streaming analytics/132. Spark Streaming - Setting up netcat.mp419.57MB
05. Transform, Stage and Store - Spark/80. By Key Ranking - Converting (K, V) pairs into (K, Iterable[V]) using groupByKey.mp419.54MB
01. Introduction/01. CCA 175 Spark and Hadoop Developer - Curriculum.mp419.32MB
07. Data Ingest - real time, near real time and streaming analytics/123. Flume - Web Server Logs to HDFS - Source exec.mp419.17MB
02. Scala Fundamentals/10. Collections - Seq, Set and Map.mp418.7MB
07. Data Ingest - real time, near real time and streaming analytics/127. Kafka - Getting Started - High Level Architecture.mp417.97MB
03. Getting Started/30. Setup Data Sets.mp417.82MB
06. Data Analysis - Spark SQL or HiveQL/107. Row level transformations.mp417.67MB
08. Sample scenarios with solutions/154. Getting crime count per type per month - Validating Output.mp417.64MB
02. Scala Fundamentals/15. Development Cycle - Developing Source code.mp416.39MB
07. Data Ingest - real time, near real time and streaming analytics/141. Flume and Spark Streaming - Department Wise Traffic - Develop and build.mp416.2MB
05. Transform, Stage and Store - Spark/72. Aggregations - Getting Started.mp415.19MB
06. Data Analysis - Spark SQL or HiveQL/105. Functions - Aggregations.mp414.9MB
05. Transform, Stage and Store - Spark/74. Aggregations - understanding combiner.mp414.36MB
04. Data Ingestion - Apache Sqoop/40. Sqoop Import - auto reset to one mapper.mp414.21MB
03. Getting Started/21. Introduction and Curriculum.mp414.03MB
06. Data Analysis - Spark SQL or HiveQL/101. Using spark-shell to run Hive queries or commands.mp413.37MB
06. Data Analysis - Spark SQL or HiveQL/111. Set Operations.mp413.2MB
08. Sample scenarios with solutions/149. Problem Statements - General Guidelines.mp412.63MB
07. Data Ingest - real time, near real time and streaming analytics/126. Flume - Web Server Logs to HDFS - Deep dive to memory channel.mp411.77MB
02. Scala Fundamentals/17. Development Cycle - Setup SBT on Windows.mp411.08MB
02. Scala Fundamentals/18. Development Cycle - Compile changes and run jar with arguments.mp410.95MB
02. Scala Fundamentals/12. Setting up Data Sets for Basic I_O Operations.mp410.94MB
06. Data Analysis - Spark SQL or HiveQL/102. Functions - Getting Started.mp410.85MB
05. Transform, Stage and Store - Spark/65. Transformations Overview.mp410.38MB
07. Data Ingest - real time, near real time and streaming analytics/119. Introduction.mp49.46MB
05. Transform, Stage and Store - Spark/57. Introduction.mp49.16MB
02. Scala Fundamentals/14. Tuples.mp49.05MB
08. Sample scenarios with solutions/148. Introduction to Sample Scenarios and Solutions.mp48.85MB
05. Transform, Stage and Store - Spark/88. Revision of Problem Statement and Design the solution.mp48.75MB