Enterprise Data Catalog: Configuration and Maintenance (EDC-CM)

Objectives

After successfully completing this course, students should be able to:

  • Install and Configure EDC considering the sizing requirements
  • Use the Catalog Administrator interface
  • Manage Resources, Schedules, Attributes, Synonyms and Connections
  • Configure reusable settings
  • Monitor and troubleshoot EDC
  • Manage Data Domains and Composite Data Domains
  • Extract metadata from data sources using the Universal Connectivity Framework
  • Create Custom models and Custom resource types

Target Audience

  • Administrator
  • Architect
  • Developer

Prerequisites

  • None

Agenda

Module 1: Overview of Enterprise Data Catalog

  • Key capabilities of EDC
  • EDC architecture
  • EDC features
  • Catalog administration tasks
  • EDC concepts
  • Catalog Administrator workspaces

Module 2: EDC Sizing, and Pre-installation Phase

  • Differentiate Low, Medium, and High depending on the size of data
  • Perform Pre-installation steps 

Module 3: Deployment

  • External Hadoop Cluster Deployment
  • Internal Hadoop Cluster Deployment

Module 4: Installation

  • EDC installation steps in different modes
  • Post installation steps
  • Uninstall EDC

Module 5: Resource Management

  • System resources
  • Resource security
  • Connections
  • Connection types
  • Reusable Data Integration Service (DIS) configuration
  • Schedules

Module 6: Attribute Management and Synonyms

  • System and Custom attributes
  • Attribute properties
  • Edit system attributes
  • Create and use custom attributes
  • Synonym definition files
  • Import the synonym definition file in Catalog Administrator

Module 7: Data Domains

  • Resource Characteristics
  • Resource Types
  • Creating a resource
  • Editing a resource
  • Scanning a resource
  • System resources
  • Resource security

Module 8: Create a New Resource

  • Create a new resource
  • Synchronize Business Glossary with EDC

Module 9: Universal Connectivity Framework

  • Discuss universal connectivity framework
  • List the supported metadata sources
  • Create a resource type that represents the metadata source type
  • Create a resource for the defined resource type

Module 10: Custom Models and Resources

  • Custom metadata integration
  • Create and manage custom model
  • Create the custom resource type
  • Create the custom resource
  • Custom Scanners

Module 11: Performance Tuning

  • Performance tuning stages and parameters
  • EDC sizing recommendations
  • Tuning performance based on the size of the data
  • Tuning performance for similarity
  • Tuning profile warehouse
  • Data integration service system requirements for profiling
  • Tuning for profiling performance
  • Data integration service parameters
  • Profile configuration in data integration service
  • Data integration service profiling properties

Module 12: Monitoring and Troubleshooting Enterprise Data Catalog

  • Monitor resources and tasks
  • Manage tasks
  • Apply filters to monitor tasks
  • Troubleshoot errors in EDC