HDP DEVELOPER: APACHE STORM AND TRIDENT FUNDAMENTALS – GTHDP07

Course Description

This course provides a technical introduction to the fundamentals of Apache Storm and Trident that includes the concepts, terminology, architecture, installation, operation, and management of Storm and Trident. Simple Storm and Trident code excerpts are provided throughout the course. The course also includes an introduction to, and code samples for, Apache Kafka. Apache Kafka is a messaging system that is commonly used in concert with Storm and Trident.

 

Format

Self-paced, online exploration or
Instructor led exploration and discussion

 

Certification

Hortonworks offers a comprehensive certification program that identifies you as an expert in Apache Hadoop. Visit hortonworks.com/training/certification for more information.

HortonWorks University

Hortonworks University is your expert source for Apache Hadoop training and certification. Public and private on-site courses are available for developers, administrators, data analysts and other IT professionals involved in implementing big data solutions. Classes combine presentation material with industry-leading hands-on labs that fully prepare students for real-world Hadoop scenarios.

^^

Duration

2 days

^^

Target Audience

Hadoop developers who need to be able to design and build Storm and Kafka applications using Java and the Trident API.

^^

Course Prerequisites

Students must have experience developing Java applications and using a Java IDE. Labs are completed using the Eclipse IDE and Gradle. Students should have a basic understanding of Hadoop.

^^

Suggested Follow on Courses

There are various courses you could take depending on your business needs. Get in touch with us – we would be more than happy to discuss your training objectives with you.

^^

Course Content

Course Objectives

  • Recognize differences between batch and real-time data   processing
  • Define Storm elements including tuples, streams, spouts, topologies, worker processes, executors, and stream groupings
  • Explain and install Storm architectural components, including Nimbus, Supervisors, and ZooKeeper cluster
  • Recognize/interpret Java code for a spout, bolt, or topology.
  • Identify how to develop and submit a topology to a local or remote distributed cluster
  • Recognize and explain the differences between reliable and unreliable Storm operation
  • Manage and monitor Storm using the command-line client or browser-based Storm User Interface (UI)
  • Define Kafka topics, producers, consumers, and brokers
  • Publish Kafka messages to Storm or Trident topologies
  • Define Trident elements including tuples, streams, batches, partitions, topologies, Trident spouts, and operations
  • Recognize and interpret the code for Trident operations, including filters, functions, aggregations, merges, and joins
  • Recognize the differences between the different types of Trident state
  • Identify how Trident state supports exactly-once processing semantics and idempotent operation
  • Recognize the differences in fault tolerance between different types of Trident spouts
  • Recognize and interpret the code for Trident state-based operations

^^

See more Hadoop courses