Course Catalog Help
DATAENG 08 (Builder): Ontology Data Pipelines in Pipeline Builder

DATAENG 08 (Builder): Ontology Data Pipelines in Pipeline Builder

Refresher end-to-end tour of the Ontology, including how to optimise the your pipeline for the creation of an Ontology in Pipeline Builder.

rate limit

Code not recognized.

About this course

Data pipelines are not isolated artifacts; they’re always built for some purpose. In this training, the purpose of our pipeline is to prepare datasets to back a set of linked Ontology object types. You’ll have hands-on practice configuring the Ontology and updating a pipeline to optimize the backing datasets and make them resilient to upstream data changes that would alter the shape of your Ontology. Finally, a collection of exercises will walk you through common troubleshooting steps.

⚠️ Course prerequisites

  • DATAENG 07: Configuring Data Expectations: If you have not completed this course, please do so now.
  • Necessary permissions to edit your organization’s Ontology. Please reach out to your program administrator or Palantir point of contact if you need authorization.

📖 Learning Objectives

  1. Understand core Ontology concepts, basic use cases, and design decisions.
  2. Search and filter object types in the Object Explorer.
  3. Know how to create new object and link types in the Ontology Manager.
  4. Optimize backing datasets for use in the Ontology in accordance with best practices.
  5. Apply checks to your pipeline to ensure object types are resilient to upstream data changes.
  6. Understand the Ontology storage and retrieval architecture.
  7. Develop an experience-based understanding of "writeback" and its place in a data pipeline.
  8. Practice troubleshooting common Ontology synchronization issues.

💪 Foundry Skills

  • Use the Object Explorer to search, filter, and generally understand the Ontology.
  • Use the Ontology Manager to create and edit ontology object and link types.
  • Update pipeline transforms to optimize a backing dataset for synchronization with the Ontology backend.
  • Create "writeback" datasets and position them in your data pipeline.
  • Accommodate additive and destructive schema changes via the object storage interface in the Ontology Management Application.

Curriculum

  • Introduction
  • About this Course
  • The Foundry Ontology
  • What is the Foundry Ontology?
  • Viewing your Ontology
  • The Backing Dataset
  • The Ontology in Data Lineage
  • Ontology Design Considerations
  • Exercise Summary
  • Building Ontology Data Pipelines
  • Ontology Datasource Preparation Guidance
  • Checking your Backing Datasets
  • Building an Ontology Project
  • Ontology Data Transforms
  • Documenting and Updating Your Pipeline
  • Exercise Summary
  • Configuring the Ontology
  • Configuring your Flight Alert Object Type
  • Configuring Object Type Properties
  • Introduction to Link Types
  • Saving your Ontology Changes
  • What Happens when you Save?
  • Configuring your Passenger Object Type
  • Configuring your Passenger Object Type Properties
  • Configuring your Many-to-Many Link Type
  • Reviewing Your Ontology
  • Exercise Summary
  • Ontology Edits through Writeback
  • Writeback Overview
  • Creating Writeback Datasets
  • Enabling Writeback on Links
  • Scheduling Writeback Dataset Builds
  • Exercise Summary
  • Ontology Changes and Troubleshooting
  • Additive Backing Dataset Changes: Part 1
  • Additive Backing Dataset Changes: Part 2
  • Destructive Backing Dataset Changes: Part 1
  • Destructive Backing Dataset Changes: Part 2
  • Exercise Summary
  • Update your Ontology Status
  • Update your Ontology Status
  • Conclusion
  • Key Takeaways
  • Next Steps

About this course

Data pipelines are not isolated artifacts; they’re always built for some purpose. In this training, the purpose of our pipeline is to prepare datasets to back a set of linked Ontology object types. You’ll have hands-on practice configuring the Ontology and updating a pipeline to optimize the backing datasets and make them resilient to upstream data changes that would alter the shape of your Ontology. Finally, a collection of exercises will walk you through common troubleshooting steps.

⚠️ Course prerequisites

  • DATAENG 07: Configuring Data Expectations: If you have not completed this course, please do so now.
  • Necessary permissions to edit your organization’s Ontology. Please reach out to your program administrator or Palantir point of contact if you need authorization.

📖 Learning Objectives

  1. Understand core Ontology concepts, basic use cases, and design decisions.
  2. Search and filter object types in the Object Explorer.
  3. Know how to create new object and link types in the Ontology Manager.
  4. Optimize backing datasets for use in the Ontology in accordance with best practices.
  5. Apply checks to your pipeline to ensure object types are resilient to upstream data changes.
  6. Understand the Ontology storage and retrieval architecture.
  7. Develop an experience-based understanding of "writeback" and its place in a data pipeline.
  8. Practice troubleshooting common Ontology synchronization issues.

💪 Foundry Skills

  • Use the Object Explorer to search, filter, and generally understand the Ontology.
  • Use the Ontology Manager to create and edit ontology object and link types.
  • Update pipeline transforms to optimize a backing dataset for synchronization with the Ontology backend.
  • Create "writeback" datasets and position them in your data pipeline.
  • Accommodate additive and destructive schema changes via the object storage interface in the Ontology Management Application.

Curriculum

  • Introduction
  • About this Course
  • The Foundry Ontology
  • What is the Foundry Ontology?
  • Viewing your Ontology
  • The Backing Dataset
  • The Ontology in Data Lineage
  • Ontology Design Considerations
  • Exercise Summary
  • Building Ontology Data Pipelines
  • Ontology Datasource Preparation Guidance
  • Checking your Backing Datasets
  • Building an Ontology Project
  • Ontology Data Transforms
  • Documenting and Updating Your Pipeline
  • Exercise Summary
  • Configuring the Ontology
  • Configuring your Flight Alert Object Type
  • Configuring Object Type Properties
  • Introduction to Link Types
  • Saving your Ontology Changes
  • What Happens when you Save?
  • Configuring your Passenger Object Type
  • Configuring your Passenger Object Type Properties
  • Configuring your Many-to-Many Link Type
  • Reviewing Your Ontology
  • Exercise Summary
  • Ontology Edits through Writeback
  • Writeback Overview
  • Creating Writeback Datasets
  • Enabling Writeback on Links
  • Scheduling Writeback Dataset Builds
  • Exercise Summary
  • Ontology Changes and Troubleshooting
  • Additive Backing Dataset Changes: Part 1
  • Additive Backing Dataset Changes: Part 2
  • Destructive Backing Dataset Changes: Part 1
  • Destructive Backing Dataset Changes: Part 2
  • Exercise Summary
  • Update your Ontology Status
  • Update your Ontology Status
  • Conclusion
  • Key Takeaways
  • Next Steps