1 / 8

Oracle Enterprise Data Quality

Oracle Enterprise Data Quality. The Phrase Profiler. The Phrase Profiler. Used alongside the Parse processor to profile: Names, addresses, product descriptions etc. Provides a quick way to build classification lists. E.g : Titles: Mr, Mrs, Ms, Miss, Dr.

onslow
Télécharger la présentation

Oracle Enterprise Data Quality

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Oracle Enterprise Data Quality The Phrase Profiler

  2. The Phrase Profiler • Used alongside the Parse processor to profile: • Names, addresses, product descriptions etc. • Provides a quick way to build classification lists. E.g: • Titles: Mr, Mrs, Ms, Miss, Dr. • Countries: UK, USA, France, Germany. • Product Categories: Dairy, Frozen, Bread, Meat, Fruit & Veg. • Assesses data to understand which parsing rules to apply.

  3. Common Words and Phrases • Example: names and addresses: Identified words and phrases Number of occurrences Locations of words and phrases

  4. Identify Misplaced Data • ‘Mr’ is stored in wrong attribute: • On investigating...

  5. Identify and Manage Ambiguities • ‘Victoria’ might be classified as a given name. • ‘Victoria Centre’ might be classified as a valid building.

  6. What is Reference Data? (Recap) • Tables of data stored within Enterprise Data Quality. • Can be used to store lists of any data used in project. • E.g. patterns, valid data, invalid data, characters. • Often used to check and improve working data. • Optional lookup column.

  7. Capture Reference Data • Create or add to lists of terms in reference data. • You can add to your lists iteratively. • Then use within the Parse processor.

  8. Lab Overview • Lab 1: Profiling Textual Data: • Create a Project. • Create a Data Store. • Create a Snapshot. • Create a Process. • Use the Phrase Profiler. • Adjust Phrase Profiler Options.

More Related