Amazon SageMaker Catalog now helps customized metadata kinds and wealthy textual content descriptions on the column stage, extending present curation capabilities for enterprise names, descriptions, and glossary time period classifications.
With these new options, information stewards can outline and seize business-specific metadata immediately in particular person columns, and authors can use markdown-enabled wealthy textual content to supply detailed documentation and enterprise context. Each type fields and formatted descriptions are listed in actual time, making them instantly discoverable by way of catalog search.
Column-level context is crucial for understanding and trusting information. This launch helps organizations enhance information discoverability, collaboration, and governance by letting metadata stewards doc columns utilizing structured and formatted data that aligns with inner requirements.
On this put up, we present methods to improve information discovery in SageMaker Catalog with customized metadata kinds and wealthy textual content documentation on the schema stage.
Key capabilities
SageMaker Catalog now provides the next key capabilities:
- Customized metadata kinds – Information stewards can now use customized metadata kinds to seize organization-specific metadata fields for columns comparable to
Enterprise Proprietor,Regulatory Classification,Models of Measure, orAuthorised Use Case. Every discipline is saved as a key-value pair and listed for search, enabling business-level queries like “discover columns the place sensitivity = confidential.” - Wealthy textual content (markdown) descriptions – Every column helps a markdown-enabled description discipline. Authors can format textual content with headings, bullet lists, and hyperlinks so as to add deeper enterprise or operational context—for instance, logic definitions, pattern values, or information lineage references.
- Actual-time indexing for search – Customized type values and wealthy textual content content material are listed as quickly as they’re saved. Customers can search utilizing a metadata worth, key phrase, or glossary time period throughout columns.
Resolution overview
For this put up, we discover a monetary providers use case. Our instance monetary providers group defines a column metadata type that features a number of fields, as illustrated within the following desk.
| Area | Instance Worth |
| Authorised Use Case | Monetary income modeling |
| Enterprise Proprietor | Finance Workplace |
| Area | RF |
For a dataset column named income, the creator provides the next markdown description:
When analysts seek for Area = RF, this column seems in outcomes with full enterprise context.
Within the following sections, we display methods to use to make use of metadata kinds for columns and add wealthy textual content descriptions that’s searchable.
Conditions
To check this answer, you must have an Amazon SageMaker Unified Studio area arrange with a site proprietor or area unit proprietor privileges. You must also have an present undertaking to publish property and catalog property. For directions to create these property, see the Getting began information.
On this instance, we created a undertaking named financial_analysis and a check desk. To create an identical desk, see Get began with Amazon S3 Tables in Amazon SageMaker Unified Studio. To ingest the pattern information to SageMaker Catalog and generate enterprise metadata, see Create an Amazon SageMaker Unified Studio information supply for Amazon Redshift within the undertaking catalog.
Create new metadata type
Full the next steps to create a brand new metadata type:
- In SageMaker Unified Studio, go to your undertaking.
- Beneath Venture catalog within the navigation pane, select Metadata entities.
- Select Create metadata type.
- Present an non-obligatory show identify, a technical identify, and an non-obligatory description, then select Create metadata type.

- Outline the shape fields. On this instance, we add the fields
Area,Enterprise Proprietor, andAuthorised Use Case. - For Requirement Choices, choose the configuration for every discipline. For our use case, we choose All the time required.
- Select Create discipline.

- Activate Enabled so the shape is seen and can be utilized for property.

Connect metadata type to column
Full the next steps to connect the metadata type to a column:
- Beneath Venture catalog within the navigation pane, select Property.
- Seek for and choose your asset (for this instance, we use the asset
business_finance).
- On the Schema tab, select View/Edit subsequent to the
incomediscipline.
- Select Add metadata type.

- Select the shape you created and select Add.

- Add particulars for the metadata type fields

Add further context as formatted textual content
Subsequent, we enter a wealthy textual content description for every column utilizing the markdown editor, together with headings, bullet lists, hyperlinks, and pattern values. Full the next steps:
- Select Edit subsequent to README for the
incomediscipline the place you added the metadata type.
- Enter particulars and select Save.

- Select Preview to view the formatted README on the column stage.

Publish and confirm search
Now you’re able to publish the asset. The metadata type values and markdown descriptions turn into a part of the catalog file and are listed for search. You may as well see the historical past of revisions on the Historical past tab. Different undertaking customers can see the metadata type and wealthy textual content description for the printed property and subscribe to the information asset. You’ll be able to create extra information merchandise with these property, and they’ll even have the column metadata type and README.

Within the catalog search UI, information customers can now filter on customized type fields (for instance, “Area = RF”) or search in pure language for textual content that matches the column description.

Finest practices
Think about the next greatest practices when utilizing this characteristic:
- Outline metadata kinds aligned with your online business vocabulary (domains, homeowners, sensitivity ranges) proactively earlier than publishing property at scale.
- Make column descriptions actionable—embrace enterprise definitions, worth ranges, logic, replace cadence, and dependencies.
- Confirm the catalog indexing is well timed; publish adjustments proactively so search outcomes replicate new metadata.
- Use governance controls. You’ll be able to mix column-level metadata with present asset-level templates and approval workflows to implement publishing requirements.
- Monitor search utilization and metadata completeness; goal high-value datasets for full column-level documentation first.
- Don’t retailer confidential or delicate data in your metadata kinds.
Conclusion
With column-level metadata kinds and wealthy textual content descriptions, SageMaker Catalog helps organizations ship higher-quality metadata, stronger governance, and higher information discovery. These options make it simple for groups to seize full enterprise context and for analysts to rapidly find and perceive the information they want.
Customized metadata kinds and wealthy textual content descriptions on the column stage are actually accessible in AWS Areas the place SageMaker is supported.
To study extra about SageMaker, see the Amazon SageMaker Person Information. Get began with this functionality, check with the consumer information.
Concerning the Authors
