OTIOSE/ADULTHOOD/JUNIOR DATA CATALOG & DISCOVERY LEAD
A D U L T H O O D
The Corporate Bestiary
FILE RECORD: JUNIOR-DATA-CATALOG-DISCOVERY-LEAD
WHAT DOES A JUNIOR DATA CATALOG & DISCOVERY LEAD ACTUALLY DO?

Junior Data Catalog & Discovery Lead

[01] THE ORG-CHART ARCHITECTURE

* The organizational hierarchy defining the pressure flow and extraction cycle for this role.
KNOWN ALIASES / DISGUISES:
Metadata Steward (Junior)Data Taxonomy CoordinatorData Governance Analyst (Catalog Focus)Enterprise Data Documentarian

[02] THE HABITAT (NATURAL RANGE)

  • Large legacy enterprises undergoing 'digital transformation' initiatives.
  • Bureaucratic tech companies with rapidly expanding, undocumented data lakes.
  • Organizations implementing expensive data governance solutions without prior data hygiene.

[03] SALARY DELUSION

MARKET AVERAGE
$98,000
* Salary figures for 'junior data' roles vary wildly, with 'lead' in the title often inflating expectations beyond actual technical contribution, typically landing below a true Data Scientist.
"A wage paid for attempting to organize chaos, primarily through documentation that will be obsolete before it's thoroughly reviewed, let alone published."

[04] THE FLIGHT RISK

FLIGHT RISK:85%HIGH RISK
[DIAGNOSIS]The 'junior' aspect implies entry-level administrative work, making them an easy target for cost-cutting, especially when their 'leadership' is purely in documentation and meeting facilitation.

[05] THE BULLSHIT METRICS

Data Assets Cataloged YTD
A simple count of documented (not necessarily verified or accurate) data sources added to the catalog tool, prioritizing quantity over quality or actual usability.
Data Ownership Framework Adherence Score
An internally generated, subjective rating of how well teams *claim* to follow data ownership guidelines, primarily based on meeting attendance and survey responses.
Metadata Completeness Percentage
A calculated metric based on the number of filled-in fields within the catalog tool for each asset, irrespective of the actual accuracy, relevance, or usefulness of the metadata itself.

[06] SIGNATURE WEAPONRY

Data Governance Framework v2.1 (PowerPoint Edition)
An aspirational deck outlining data principles and responsibilities, frequently updated but rarely implemented, used primarily to justify the role's existence and future meetings.
Enterprise Data Catalog Tool (e.g., Collibra, Alation)
An expensive SaaS platform purchased to automate cataloging, which still requires significant manual input and is seldom fully adopted or trusted by actual data producers.
Data Ownership Matrix v3.0
A complex spreadsheet mapping data assets to 'owners' (usually managers who don't understand the data), creating a blame hierarchy for data quality issues rather than solving them.

[07] SURVIVAL / ENCOUNTER GUIDE

[IF ENGAGED:]Smile, nod, and quickly redirect them to the relevant engineering team; their 'discovery' rarely requires actual technical input, only data points to add to a slide deck.

[08] THE JD AUTOPSY: WHAT DO THEY ACTUALLY DO?

LINKEDIN ILLUSION
[SOURCE REDACTED]
"enhancing data discoverability, establishing data ownership frameworks and ensuring compliance to governance standards across the enterprise."
OTIOSE TRANSLATION
Aggregating existing, often contradictory, documentation into a single spreadsheet labeled 'Enterprise Data Assets v4.7', then scheduling 'alignment meetings' to ensure nobody uses the old spreadsheet.
LINKEDIN ILLUSION
[SOURCE REDACTED]
"Lead the identification and development of new data sources."
OTIOSE TRANSLATION
Adding new database names to a Jira ticket whenever an actual engineer mentions a new table, then conducting 'discovery sessions' to ask them what it does and if it's 'governed'.
LINKEDIN ILLUSION
[SOURCE REDACTED]
"Experience working with data cataloging, data quality, master data management, or metadata management tools."
OTIOSE TRANSLATION
Clicking through an expensive SaaS platform (e.g., Collibra, Alation) to demonstrate 'tool utilization' by manually entering metadata that the system was supposed to automate.

[09] DAY-IN-THE-LIFE LOG

[10:00 - 11:00]
Catalog Audit & Re-documentation
Scouring outdated wikis, Slack channels, and shared drives for data sources that may or may not still exist, then re-entering their details into the new 'official' catalog tool for the third time this year.
[13:00 - 14:00]
Discovery Sync & Follow-up
Attending a meeting with a senior engineer who explains, for the fourth time, why their data can't be easily cataloged due to 'legacy systems' or 'technical debt', then sending a follow-up email summarizing the non-progress.
[15:00 - 16:00]
Governance Framework Review & Slideshare
Reading slides on 'data stewardship best practices' for the quarterly all-hands meeting, wondering if anyone in the company actually understands or cares about the distinction between data domains.

[10] THE BURN WARD (UNFILTERED COMPLAINTS)

* The stark reality of the role, scraped from Reddit, Blind, and anonymous career boards.
"My title says 'Lead', but I spend 80% of my time trying to figure out who owns which Excel sheet. It's like being a librarian for a dumpster fire, but the books keep moving."
r/cscareerquestions
"They hired me to 'catalogue the enterprise data estate,' which means I'm basically a professional Googler, but for internal SharePoint links that are all broken and 'secure data' that nobody can access."
teamblind.com
"The 'discovery' part of my job is just endless meetings where we 'identify' data sources that have been known for years, then document them again in a new tool, because the old one wasn't 'enterprise-grade'."
r/datascience

[11] RELATED SPECIMENS

[VIEW FULL TAXONOMY] ↗
SYSTEM MATCH: 98%
Lead Backend Data Procurement Analyst
Spend weeks documenting trivial manual data entry, then propose a custom Python script that breaks every month, requiring constant maintenance from actual developers.
SYSTEM MATCH: 91%
Enterprise Architect
Preside over an endless cycle of abstract discussions, ensuring no single technical decision is made without involving a committee, thus guaranteeing maximum inefficiency.
SYSTEM MATCH: 84%
SDET
To craft intricate Rube Goldberg machines of automated 'checks' that prove the obvious, then spend cycles 'monitoring' their inevitable flakiness, ensuring a constant stream of 'maintenance' tasks to justify continued existence.
PRODUCED BYOTIOSEOTIOSE icon