IBM InfoSphere BigMatch v11.4 for Apache Hadoop

IBM InfoSphere BigMatch v11.4 for Apache Hadoop Course Description

Duration: 2.00 days (16 hours)

The IBM InfoSphere Big Match on Hadoop course will introduce students to the Probabilistic Matching Engine (PME) and how it can be used to resolve and discover entities across multiple data sets in Hadoop.

Students will learn the basics of a PME algorithm including data model configuration, standardization, comparison and bucketing functions, weight generation, and threshold.

Next Class Dates

Contact us to customize this class with your own dates, times and location. You can also call 1-888-563-8266 or chat live with a Learning Consultant.

Back to Top

Intended Audience for this IBM InfoSphere BigMatch v11.4 for Apache Hadoop Course

  • » The course is designed for a technical audience that will be setting up a custom algorithm for the Probabilistic Matching Engine to use Big Match on Apache Hadoop to compare, match and/or search member records across multiple data sets

Back to Top

IBM InfoSphere BigMatch v11.4 for Apache Hadoop Course Objectives

  • » Understand the capabilities of the Probabilistic Matching Engine
  • » Understand how the Probabilistic Matching engine is used with Big Insights to solve certain use cases.
  • » Understand the technical framework of the Big Match solution and how member data is derived, bucketed and compared to produce a complete entity from multiple data sets.
  • » Create a project and data model using the Big Match Console
  • » Configure the HBase tables that will be used in a Big Match solution
  • » Configure an algorithm using he Big Match console that includes Standardization, Comparison and Bucketing functions.
  • » Set up Strings for Anonymous value, Equivalency values, Frequency values, and character maps using the Big Match console
  • » Set up and run the Weight Generation process
  • » Evaluate and set thresholds for the algorithm
  • » Deploy a new algorithm to Big Match
  • » Evaluate Entity results and reconfigure algorithm based on evaluation. E.g. Large Buckets, Large Entities, Member not belonging to any buckets, etc

Back to Top

IBM InfoSphere BigMatch v11.4 for Apache Hadoop Course Outline

      1. Introduction to Big Match for Apache Hadoop
        1. What is Big Match
        2. How Big Match Works
        3. Big Match Components
        4. Big Match Architecture
      2. Big Match Data Model Definition
        1. Members
        2. Attribute Types
        3. Member Attributes
        4. Sources
        5. Information Sources
      3. PME Algorithm
        1. Standardization
        2. Bucketing
        3. Comparison Functions
      4. Bucket Analysis
        1. Bucket Optimization
        2. Bucket Concerns
      5. Weights
        1. String Weights
        2. Numeric Weights
        3. Multi-dimensional Weights
        4. Troubleshooting Weights
      6. HBase Tables
        1. HBase concepts
        2. Big Match commands
        3. Big Match Tables (.pmebktidx, .pmemdmidx, .pmeentidx)
        4. Best Practices
      7. BigMatch Applications
        1. PME Derive
        2. PME Compare
        3. PME Link
        4. PME Analysis

Back to Top

Do you have the right background for IBM InfoSphere BigMatch v11.4 for Apache Hadoop?

Skills Assessment

We ensure your success by asking all students to take a FREE Skill Assessment test. These short, instructor-written tests are an objective measure of your current skills that help us determine whether or not you will be able to meet your goals by attending this course at your current skill level. If we determine that you need additional preparation or training in order to gain the most value from this course, we will recommend cost-effective solutions that you can use to get ready for the course.

Our required skill-assessments ensure that:

  1. All students in the class are at a comparable skill level, so the class can run smoothly without beginners slowing down the class for everyone else.
  2. NetCom students enjoy one of the industry's highest success rates, and pass rates when a certification exam is involved.
  3. We stay committed to providing you real value. Again, your success is paramount; we will register you only if you have the skills to succeed.
This assessment is for your benefit and best taken without any preparation or reference materials, so your skills can be objectively measured.

Take your FREE Skill Assessment test »

Back to Top

Award winning, world-class Instructors

Our instructors are passionate at teaching and are experts in their respective fields. Our average NetCom instructor has many, many years of real-world experience and impart their priceless, valuable knowledge to our students every single day. See our world-class instructors.   See more instructors...

Back to Top

Client Testimonials & Reviews about their Learning Experience

We are passionate in delivering the best learning experience for our students and they are happy to share their learning experience with us.
Read what students had to say about their experience at NetCom.   Read student testimonials...

Back to Top