Blog

Data Dictionary vs. Business Glossary (and How They Can Get Your Business and IT Teams on the Same Page) By Jason Lim

  |   RSS Feeds

As an organization evolves, it’s natural for the language to evolve, too. This is where a data dictionary and business glossary become useful. Discover what those are and how the benefits users.
The post Data Dictionary vs. Business Glossary (and How They Can Get Your Business and IT Teams on the Same Page) appeared first on Alation.

In your organization, are you ever confused by different definitions of business terms? Do you ever wonder why the number of customers differs between two reports?

I’m going to assume the answer is “yes.” And you are not alone.

As an organization evolves, it’s natural for the language to evolve, too. That’s why it’s critical that important terms be defined, documented, and made visible to everyone. This aligns understanding, so that, for example, a term like “customer” means the same thing across all departments.

This is where a data dictionary and business glossary become useful for getting both your business and IT teams on the same page.

What is a data dictionary?

As the name suggests, a data dictionary defines and describes technical data terms. Data terms could be database schemas, tables, or columns. It may include information about the data type, size, default values, constraints, relationships to other data, and the meaning or purpose of a given asset.

Who benefits from a data dictionary?

Typically, data dictionaries are designed for more technical audiences, like IT or data scientists.

These folks will reference the data dictionary to understand data elements, which allows them to manage, move, merge, and analyze data with clarity.

For complex projects, like data wrangling, modeling, or database design, a data dictionary is a helpful resource, especially to new hires. For these tasks, they may look to the data dictionary to ensure use of the right assets.

What is a business glossary?

A business glossary, on the other hand, defines and describes business terms and organizational nomenclature.

If you’re thinking “business term definitions” are straightforward, think again. For enterprises operating across many departments, regions, and contexts, a single term can have multiple meanings. This can become confusing and result in costly mistakes.

For example, the term “revenue” to the sales department may include gift cards, but to the finance department it excludes gift cards. “Customers” and “users” might sound the same, but be accounted for differently, depending on who you ask. The term “beta feature” may be clear internally, but not to the Customer Success team, who communicate with customers.

Who benefits from a business glossary?

A business glossary is useful for the business audience, or people working in functional departments, such as finance, marketing, or sales. When new people join an organization, they can look to the business glossary to learn the business language.

A business glossary helps an organization agree and align on internal definitions. How often have you faced a problem where you think a business term means one thing, but another team believes it means something else? If an executive is presented with two different reports describing the same term, without context on why they differ, how will they react? In all likelihood, they will distrust both reports.

Another benefit of the business glossary is self-service. Users can help themselves without asking around to find an answer. This in itself promotes efficiency and productivity for everyone.

Data dictionary vs. business glossary: differences

 
Data Dictionary
Business Glossary
Focus On
Physical data assets
Business concepts
Key Artifact
List of datasets/tables and fields/columns
List of business terms and their definitions
Goal
Understand data assets and databases
Define common vocabulary and understanding of basic concepts
Owned By
IT
Business
Scope
One per data source
One per organization
Application
Data modeling, database design, documentation of data sources
Data governance, requirements analysis
Example
USER_ID – Unique 7 digit identifying number of a user
“Active Days” – The number of days for which a visit to the app is recorded

Challenges of developing a data dictionary and business glossary

We’ve established the value of both a data dictionary and business glossary. Now the question becomes, how to create them?

On the surface, both sound easy and straightforward to develop. But there are indeed inherent challenges.

For a data dictionary, the volume of new data is constantly increasing. This makes it difficult to keep up with all the new data elements that need to be defined. To keep pace, IT or database administrators may automate data dictionary maintenance. Automation helps IT build and preserve the integrity of the data dictionary.

For a business glossary, it’s important to assign ownership. Definitions may differ depending on context or team. You must ask, who is responsible for defining the terms? Are definitions set by one person or by group consensus? Is there an approval process? These challenges require both a robust process and tool to manage a business glossary.

How to use a data dictionary and business glossary with Alation

Alation Data Dictionary

In Alation, this is what a data dictionary looks like:

Alation is a machine learning data catalog. Once connected to data sources in the environment, Alation automatically indexes data and populates catalog pages by source. In the example above, a data dictionary of columns in the “metrics” table has been generated.

The table shows the technical column name, a business title name, the data type, and popularity. Users can click on the blue column links to get more information and context about the columns.

Machine Learning and the Behavioral Analysis Engine (BAE)

Note that the business “Title” names are auto-titled; machine learning translates the technical column names into natural human language. For example, “ts_created” has been identified as “Timestamp Created” by the machine and confirmed by a person, as indicated by the green robot head AKA Allie.

“Popularity” is an Alation-specific measure of how much the column has been searched and queried by the users. This is calculated automatically through Alation’s Behavioral Analysis Engine (BAE), by accounting for how frequently and recently the columns have been used.

Having insight into the popularity of data is useful, because it lets people know what data is most worthy of their attention. It also offers insight into the most data that is most trusted, a good indicator of quality.

Alation Business Glossary

The business glossary in Alation is not only a place to look up terms and definitions, or assign owners. There are some unique features that make Alation’s business glossary easier to manage and scale.

Each glossary is made up of a collection of terms. For example, the “Financial KPI Metrics” glossary may include terms for “EBIT”, “CACC”, and ‘“Debt-to-equity” ratio. The Glossary columns can be customized for different fields, such as”‘Author”, “Description”, or “Status.”

To keep the glossary layout consistent, templates can be applied so that all terms contain the properties of the same useful information. By leveraging Alation’s wiki-like articles, users can @mention and link another data asset, person, or group in Alation. This makes it easy to connect related and relevant information for users to follow.

Agile Approval Helps the Business Glossary Grow

Typically the process of adding a new business glossary term can seem overly complicated. A group of people may need to come together to debate, agree, and assign an owner to a term.

By contrast, in Alation, a lightweight workflow mechanism called “Agile Approval” assigns reviewers to a term. In other words, anyone can add a term for speed and transparency. However, for that term to be validated, the designated approver must approve the new term, at which point a green “Approved” banner will indicate it is certified.

When creating a business glossary, the sheer volume of terms can present a challenge. Many terms risk being forgotten or missed.That’s why a mechanism to automatically identify new terms that are commonly used within your business is essential.

Alation’s Auto-Suggested Terms does just that. It finds popular terms and their associated technical abbreviations to show the related data objects where they appear, along with popularity. The new suggested terms can then either be added as a new glossary term or to an existing article.

Conclusion

Now you understand what a data dictionary and business glossary is and how they differ. In essence a data dictionary is for data terms and a business glossary is for business terms. They both have value in aligning technical teams and business teams around a shared understanding. Such alignment translates into setting the right goals, calculating metrics in the right way, and basing it on the right data.

To see Alation’s business glossary and data dictionary in action, join a weekly live demo.

Subscribe to Alation’s Blog

Get the latest data cataloging news and trends in your inbox.

The post Data Dictionary vs. Business Glossary (and How They Can Get Your Business and IT Teams on the Same Page) appeared first on Alation.