Pipeline run
fc829bda-6a1e-4086-befc-133024825a6d
Client output enrichment
v2 Skill cluster · Nature of work · AI index · Tech stack maturity · Evidence · KRA descriptionvocab breakdown (legacy)
Signals
Post-classification
Captured for admin review
1 POST /skills/extract-from-jd
2 POST /skills/extract-details
3 POST /skills/final-role-output
Data Engineer
CASE Aslug: data-engineer · id: 2 · source: db
Exact alias hit on data-engineer (1.0) — no other alias at this confidence; skill_top absent does not contradict
Resolution:
in_db
— role exists in library; skill↔dim and role↔dim links saved when applicable.
Job description
To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Job Category Products and Technology Job Details Senior Data Engineer (Sales & CS) As a Data Engineer, working in our Business Technology - Data and Analytics team, you will be working cross-functionally with business domain experts, analytics, and engineering teams to design and build data pipelines for our Data Warehouse. The data pipelines you build and maintain will enable our business partners and decision makers to get actionable insights from our Product and Corporate Systems. You would also be responsible for enhancements and maintaining our data warehouse. This is a great role for people passionate about working with data and data systems, and who love solving problems. It is for people who love technical challenges and are always looking for ways to improve existing software, processes and infrastructure. They are self-starters, detail and quality oriented. In short, we are looking for someone who takes pride in the craft and wants to be part of a team of talented data engineers and architects working to further Slack’s growth. If this role has your name written all over it, please contact us with a resume so that we can explore further. Responsibilities Design and build data pipelines from various cloud data sources for the enterprise data warehouse Partner with Data Engineers, Data architects, domain experts, data analysts and other teams to build foundational data sets that are trusted, well understood, aligned with business strategy and enable self-service Own and document data pipelines and data lineage Support and Maintain analytics tech ecosystem (data warehouse, ETL and BI tools) Continuously identify areas of improvement and ensure application of standards and best practices Requirements BS degree in Computer Science or other engineering discipline. 5+ years of experience working as a Data Engineer or a similar role. 2+ years of experience working with Sales data analytics or sourcing data from sales systems Very strong experience in writing complex SQLs and dimensional modeling Hands-on experience working with data warehouse technologies (Snowflake, Redshift) and Big Data technologies (e.g Hadoop, Hive, Spark) Hands on experience building data pipelines using ETL tools (e.g. Informatica, Matillion, Snaplogic) sourcing data from SaaS applications Proficiency with Python Ability to work on multiple areas like Data pipeline ETL, Data modeling & design, writing complex SQL queries etc. Ability to build the automation processes for the data quality and data reconciliation Understanding of CRM systems such as Salesforce is a big plus. Understanding of sales metrics for SaaS companies is a big plus. Proficiency in Airflow is a big plus. Passionate about various technologies including but not limited to SQL/No SQL/MPP databases etc. Excellent written and verbal communication and interpersonal skills, able to effectively collaborate with technical and business partners Slack is the collaboration hub of choice for companies of all sizes, all across the world. By using Slack, they ensure that the right people are always in the loop, that key information is always at their fingertips, and new team members can get up to speed easily. With Slack, teams are better connected. Launched in February 2014, Slack is the fastest growing business application ever and is used by thousands of teams and millions of users every day. We currently have many offices worldwide, including in San Francisco, Vancouver, Dublin, Melbourne, New York, London, Tokyo, Toronto, Denver and Pune. Ensuring a diverse and inclusive workplace where we learn from each other is core to Slack's values. We welcome people of different backgrounds, experiences, abilities and perspectives. We are an equal opportunity employer and a pleasant and supportive place to work. Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records. Come do the best work of your life here at Slack. Accommodations If you require assistance due to a disability applying for open positions please submit a request via this Accommodations Request Form . Posting Statement At Salesforce we believe that the business of business is to improve the state of our world. Each of us has a responsibility to drive Equality in our communities and workplaces. We are committed to creating a workforce that reflects society through inclusive programs and initiatives such as equal pay, employee resource groups, inclusive benefits, and more. Learn more about Equality at Salesforce and explore our benefits. Salesforce.com and Salesforce.org are Equal Employment Opportunity and Affirmative Action Employers. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, or disability status. Salesforce.com and Salesforce.org do not accept unsolicited headhunter and agency resumes. Salesforce.com and Salesforce.org will not pay any third-party agency or company that does not have a signed agreement with Salesforce.com or Salesforce.org . Salesforce welcomes all.
Skills from this JD
Each row merges API 1 extraction, API 2 library match / v3 orchestration (dimensions + locked dims), and API 3 persistence tags.
Skill enrichment (orchestrator / LLM)
No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).
- Category
- Data Engineering Tools
- Sub-category
- general
- Skill nature
- CONCEPT
- Volatility
- MEDIUM
- Typical lifespan
- MULTI_YEAR
- Version strategy
- UNVERSIONED
Skill enrichment (orchestrator / LLM)
No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).
- Category
- Cloud Platforms
- Sub-category
- general
- Skill nature
- PLATFORM
- Volatility
- MEDIUM
- Typical lifespan
- MULTI_YEAR
- Version strategy
- UNVERSIONED
Skill enrichment (orchestrator / LLM)
No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).
- Category
- Databases
- Sub-category
- general
- Skill nature
- CONCEPT
- Volatility
- MEDIUM
- Typical lifespan
- MULTI_YEAR
- Version strategy
- UNVERSIONED
Skill enrichment (orchestrator / LLM)
No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).
- Category
- Data Engineering Tools
- Sub-category
- general
- Skill nature
- PRACTICE
- Volatility
- MEDIUM
- Typical lifespan
- MULTI_YEAR
- Version strategy
- UNVERSIONED
Skill enrichment (orchestrator / LLM)
No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).
- Category
- Data Engineering Tools
- Sub-category
- general
- Skill nature
- TOOL
- Volatility
- MEDIUM
- Typical lifespan
- MULTI_YEAR
- Version strategy
- UNVERSIONED
Skill enrichment (orchestrator / LLM)
No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).
- Category
- Data Engineering Tools
- Sub-category
- general
- Skill nature
- CONCEPT
- Volatility
- MEDIUM
- Typical lifespan
- MULTI_YEAR
- Version strategy
- UNVERSIONED
Skill enrichment (orchestrator / LLM)
No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).
- Category
- Best Practices
- Sub-category
- general
- Skill nature
- CONCEPT
- Volatility
- STABLE
- Typical lifespan
- EVERGREEN
- Version strategy
- UNVERSIONED
Skill enrichment (orchestrator / LLM)
No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).
- Category
- Best Practices
- Sub-category
- general
- Skill nature
- PRACTICE
- Volatility
- STABLE
- Typical lifespan
- EVERGREEN
- Version strategy
- UNVERSIONED
Library artifacts (this run)
| Kind | Detail | DB id |
|---|---|---|
| canonical_skill_proposed | Data Pipelines | type=Data Engineering Tools subtype=general nature=CONCEPT lifespan=MULTI_YEAR | |
| canonical_skill_proposed | Cloud Data Sources | type=Cloud Platforms subtype=general nature=PLATFORM lifespan=MULTI_YEAR | |
| canonical_skill_proposed | Data Warehouse | type=Databases subtype=general nature=CONCEPT lifespan=MULTI_YEAR | |
| canonical_skill_proposed | ETL | type=Data Engineering Tools subtype=general nature=PRACTICE lifespan=MULTI_YEAR | |
| canonical_skill_proposed | BI Tools | type=Data Engineering Tools subtype=general nature=TOOL lifespan=MULTI_YEAR | |
| canonical_skill_proposed | Data Lineage | type=Data Engineering Tools subtype=general nature=CONCEPT lifespan=MULTI_YEAR | |
| canonical_skill_proposed | Standards | type=Best Practices subtype=general nature=CONCEPT lifespan=EVERGREEN | |
| canonical_skill_proposed | Best Practices | type=Best Practices subtype=general nature=PRACTICE lifespan=EVERGREEN |
nano JD Parser — gpt-4.1-nano click to toggle
Show raw JSON
{
"JD_type": "pass",
"about_company": {
"source_marker": {
"first_5_words": "Slack is the collaboration hub",
"last_5_words": "teams are better connected."
},
"text": "Slack is the collaboration hub of choice for companies of all sizes, all across the world. By using Slack, they ensure that the right people are always in the loop, that key information is always at their fingertips, and new team members can get up to speed easily. With Slack, teams are better connected.",
"word_count": 64
},
"certifications": [],
"company_name": "Salesforce",
"ctc": null,
"domain": {
"primary": {
"aliases": [
"SaaS",
"Cloud Computing"
],
"domain": "Software \u0026 SaaS Products"
},
"secondary": null
},
"education": [
{
"level": "Bachelor\u0027s",
"qualification": "BTECH/BE/BSC - Computer Science (or other engineering discipline)",
"raw": "BS degree in Computer Science or other engineering discipline.",
"requirement": "required"
}
],
"experience": {
"max": null,
"min": 5,
"raw": "5+ years of experience working as a Data Engineer or a similar role."
},
"job_locations": [
{
"aliases": [
"SF"
],
"city": "San Francisco",
"country": "United States",
"state": "California",
"work_mode": "null"
},
{
"aliases": [],
"city": "Vancouver",
"country": "Canada",
"state": "British Columbia",
"work_mode": "null"
},
{
"aliases": [],
"city": "Dublin",
"country": "Ireland",
"state": "Dublin",
"work_mode": "null"
},
{
"aliases": [],
"city": "Melbourne",
"country": "Australia",
"state": "Victoria",
"work_mode": "null"
},
{
"aliases": [],
"city": "New York",
"country": "United States",
"state": "New York",
"work_mode": "null"
},
{
"aliases": [],
"city": "London",
"country": "United Kingdom",
"state": "England",
"work_mode": "null"
},
{
"aliases": [],
"city": "Tokyo",
"country": "Japan",
"state": "Tokyo",
"work_mode": "null"
},
{
"aliases": [],
"city": "Toronto",
"country": "Canada",
"state": "Ontario",
"work_mode": "null"
},
{
"aliases": [],
"city": "Denver",
"country": "United States",
"state": "Colorado",
"work_mode": "null"
},
{
"aliases": [],
"city": "Pune",
"country": "India",
"state": "Maharashtra",
"work_mode": "null"
}
],
"role": "Senior Data Engineer (Sales \u0026 CS)",
"role_aliases": [
"Data Engineer",
"Senior Data Engineer",
"Data Pipeline Engineer"
],
"role_archetype": "Data",
"roles_and_responsibilities": [
{
"bullet_count": 5,
"heading": "Responsibilities",
"heading_was_present": true,
"source_marker": {
"first_5_words": "Design and build data pipelines",
"last_5_words": "standards and best practices."
},
"text": "Design and build data pipelines from various cloud data sources for the enterprise data warehouse.\nPartner with Data Engineers, Data architects, domain experts, data analysts and other teams to build foundational data sets that are trusted, well understood, aligned with business strategy and enable self-service.\nOwn and document data pipelines and data lineage.\nSupport and Maintain analytics tech ecosystem (data warehouse, ETL and BI tools).\nContinuously identify areas of improvement and ensure application of standards and best practices.",
"word_count": 64
}
],
"urls": []
}
API 1 — extract-from-jd click to toggle
{
"final_skills": [
{
"is_primary": true,
"skill_name": "Data Pipelines"
},
{
"is_primary": true,
"skill_name": "Cloud Data Sources"
},
{
"is_primary": true,
"skill_name": "Data Warehouse"
},
{
"is_primary": true,
"skill_name": "ETL"
},
{
"is_primary": true,
"skill_name": "BI Tools"
},
{
"is_primary": false,
"skill_name": "Data Lineage"
},
{
"is_primary": false,
"skill_name": "Standards"
},
{
"is_primary": false,
"skill_name": "Best Practices"
}
],
"jd_role": {
"display_name": "Senior Data Engineer (Sales \u0026 CS)",
"rationale": null,
"role_aliases": [
"Data Engineer",
"Senior Data Engineer",
"Data Pipeline Engineer"
],
"role_archetype": "Data",
"slug": ""
},
"nano_parsed": {
"JD_type": "pass",
"about_company": {
"source_marker": {
"first_5_words": "Slack is the collaboration hub",
"last_5_words": "teams are better connected."
},
"text": "Slack is the collaboration hub of choice for companies of all sizes, all across the world. By using Slack, they ensure that the right people are always in the loop, that key information is always at their fingertips, and new team members can get up to speed easily. With Slack, teams are better connected.",
"word_count": 64
},
"certifications": [],
"company_name": "Salesforce",
"ctc": null,
"domain": {
"primary": {
"aliases": [
"SaaS",
"Cloud Computing"
],
"domain": "Software \u0026 SaaS Products"
},
"secondary": null
},
"education": [
{
"level": "Bachelor\u0027s",
"qualification": "BTECH/BE/BSC - Computer Science (or other engineering discipline)",
"raw": "BS degree in Computer Science or other engineering discipline.",
"requirement": "required"
}
],
"experience": {
"max": null,
"min": 5,
"raw": "5+ years of experience working as a Data Engineer or a similar role."
},
"job_locations": [
{
"aliases": [
"SF"
],
"city": "San Francisco",
"country": "United States",
"state": "California",
"work_mode": "null"
},
{
"aliases": [],
"city": "Vancouver",
"country": "Canada",
"state": "British Columbia",
"work_mode": "null"
},
{
"aliases": [],
"city": "Dublin",
"country": "Ireland",
"state": "Dublin",
"work_mode": "null"
},
{
"aliases": [],
"city": "Melbourne",
"country": "Australia",
"state": "Victoria",
"work_mode": "null"
},
{
"aliases": [],
"city": "New York",
"country": "United States",
"state": "New York",
"work_mode": "null"
},
{
"aliases": [],
"city": "London",
"country": "United Kingdom",
"state": "England",
"work_mode": "null"
},
{
"aliases": [],
"city": "Tokyo",
"country": "Japan",
"state": "Tokyo",
"work_mode": "null"
},
{
"aliases": [],
"city": "Toronto",
"country": "Canada",
"state": "Ontario",
"work_mode": "null"
},
{
"aliases": [],
"city": "Denver",
"country": "United States",
"state": "Colorado",
"work_mode": "null"
},
{
"aliases": [],
"city": "Pune",
"country": "India",
"state": "Maharashtra",
"work_mode": "null"
}
],
"role": "Senior Data Engineer (Sales \u0026 CS)",
"role_aliases": [
"Data Engineer",
"Senior Data Engineer",
"Data Pipeline Engineer"
],
"role_archetype": "Data",
"roles_and_responsibilities": [
{
"bullet_count": 5,
"heading": "Responsibilities",
"heading_was_present": true,
"source_marker": {
"first_5_words": "Design and build data pipelines",
"last_5_words": "standards and best practices."
},
"text": "Design and build data pipelines from various cloud data sources for the enterprise data warehouse.\nPartner with Data Engineers, Data architects, domain experts, data analysts and other teams to build foundational data sets that are trusted, well understood, aligned with business strategy and enable self-service.\nOwn and document data pipelines and data lineage.\nSupport and Maintain analytics tech ecosystem (data warehouse, ETL and BI tools).\nContinuously identify areas of improvement and ensure application of standards and best practices.",
"word_count": 64
}
],
"urls": []
},
"rejected": false,
"rejection_reason": null,
"run_id": "fc829bda-6a1e-4086-befc-133024825a6d",
"stage3_signals": {
"alias_found": true,
"alias_match_roles": [
{
"display_name": "Data Engineer",
"kra_matches": null,
"matched_count": null,
"matched_skills": null,
"role_id": 2,
"score": 1.0,
"slug": "data-engineer",
"total_count": null
}
],
"kra_match_roles": [
{
"display_name": "Data Engineer",
"kra_matches": [
{
"kra_text": "Works with data analysts, data scientists, and business stakeholders to define data models, ingestion schedules, and data delivery requirements.",
"sentence": "Partner with Data Engineers, Data architects, domain experts, data analysts and other teams to build foundational data sets that are trusted, well understood, aligned with business strategy and enable self-service.",
"similarity": 0.6885
},
{
"kra_text": "Builds data ingestion pipelines to collect data from transactional databases, third-party APIs, event streams, and file sources into centralized data platforms.",
"sentence": "Design and build data pipelines from various cloud data sources for the enterprise data warehouse.",
"similarity": 0.6694
},
{
"kra_text": "Maintains data catalog entries, column-level data lineage, and technical documentation to support data discoverability and governance across the organization.",
"sentence": "Own and document data pipelines and data lineage.",
"similarity": 0.6361
}
],
"matched_count": null,
"matched_skills": null,
"role_id": 2,
"score": 0.6646,
"slug": "data-engineer",
"total_count": null
},
{
"display_name": "Svelte Frontend Developer",
"kra_matches": [
{
"kra_text": "backend data integration",
"sentence": "Design and build data pipelines from various cloud data sources for the enterprise data warehouse.",
"similarity": 0.4725
},
{
"kra_text": "backend data integration",
"sentence": "Support and Maintain analytics tech ecosystem (data warehouse, ETL and BI tools).",
"similarity": 0.4687
},
{
"kra_text": "backend data integration",
"sentence": "Partner with Data Engineers, Data architects, domain experts, data analysts and other teams to build foundational data sets that are trusted, well understood, aligned with business strategy and enable self-service.",
"similarity": 0.4667
}
],
"matched_count": null,
"matched_skills": null,
"role_id": 92,
"score": 0.4693,
"slug": "svelte-frontend-developer",
"total_count": null
},
{
"display_name": "ML Engineer",
"kra_matches": [
{
"kra_text": "Prepares, cleans, and transforms training datasets, manages feature stores, and builds feature engineering pipelines for model training.",
"sentence": "Partner with Data Engineers, Data architects, domain experts, data analysts and other teams to build foundational data sets that are trusted, well understood, aligned with business strategy and enable self-service.",
"similarity": 0.4905
},
{
"kra_text": "Designs end-to-end ML training pipelines and model inference workflows using TensorFlow, PyTorch, or scikit-learn on cloud ML platforms.",
"sentence": "Design and build data pipelines from various cloud data sources for the enterprise data warehouse.",
"similarity": 0.4672
},
{
"kra_text": "Prepares, cleans, and transforms training datasets, manages feature stores, and builds feature engineering pipelines for model training.",
"sentence": "Support and Maintain analytics tech ecosystem (data warehouse, ETL and BI tools).",
"similarity": 0.4435
}
],
"matched_count": null,
"matched_skills": null,
"role_id": 3,
"score": 0.4671,
"slug": "ml-engineer",
"total_count": null
},
{
"display_name": "MLOps Engineer",
"kra_matches": [
{
"kra_text": "Maintains model versioning, experiment lineage, and artifact tracking using MLflow, DVC, or Weights \u0026 Biases for reproducibility and auditability.",
"sentence": "Own and document data pipelines and data lineage.",
"similarity": 0.5024
},
{
"kra_text": "Supports ML platform incidents by diagnosing model serving failures, feature store pipeline breaks, and training environment configuration issues.",
"sentence": "Support and Maintain analytics tech ecosystem (data warehouse, ETL and BI tools).",
"similarity": 0.4656
},
{
"kra_text": "Validates model performance benchmarks, data schema contracts, and system integration health before signing off on production release readiness.",
"sentence": "Partner with Data Engineers, Data architects, domain experts, data analysts and other teams to build foundational data sets that are trusted, well understood, aligned with business strategy and enable self-service.",
"similarity": 0.4216
}
],
"matched_count": null,
"matched_skills": null,
"role_id": 16,
"score": 0.4632,
"slug": "ml-ops-engineer",
"total_count": null
},
{
"display_name": "DevOps Engineer",
"kra_matches": [
{
"kra_text": "Collaborates with development teams to improve build processes, reduce deployment friction, containerize applications, and adopt DevOps best practices.",
"sentence": "Continuously identify areas of improvement and ensure application of standards and best practices.",
"similarity": 0.5096
},
{
"kra_text": "Collaborates with development teams to improve build processes, reduce deployment friction, containerize applications, and adopt DevOps best practices.",
"sentence": "Partner with Data Engineers, Data architects, domain experts, data analysts and other teams to build foundational data sets that are trusted, well understood, aligned with business strategy and enable self-service.",
"similarity": 0.4324
},
{
"kra_text": "Builds and maintains CI/CD pipelines using Jenkins, GitHub Actions, GitLab CI, or CircleCI to automate build, test, security scanning, and deployment workflows.",
"sentence": "Own and document data pipelines and data lineage.",
"similarity": 0.423
}
],
"matched_count": null,
"matched_skills": null,
"role_id": 10,
"score": 0.455,
"slug": "devops-engineer",
"total_count": null
}
],
"skill_match_roles": []
},
"stage4_decision": {
"alias_collision_detected": false,
"case": "A",
"chosen_role": {
"display_name": "Data Engineer",
"kra_matches": null,
"matched_count": null,
"matched_skills": null,
"role_id": 2,
"score": 1.0,
"slug": "data-engineer",
"total_count": null
},
"confidence": 1.0,
"is_new_role": false,
"llm2_fired": false,
"llm2_reasoning": null,
"matched_dimensions": [],
"matched_kras": [],
"matched_skills": [],
"new_role_display_name": null,
"new_role_slug": null,
"queued": false,
"reasoning": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top absent does not contradict",
"sub_role": null
},
"stage5_updates": {
"centroid_n_after": 361,
"centroid_updated": true,
"collision_log_id": null,
"new_kra_attached": null,
"new_skills_attached": [
{
"is_primary": true,
"queue_id": 17103,
"role_display_name": "Data Engineer",
"role_slug": "data-engineer",
"skill_name": "Data Pipelines",
"status": "pending"
},
{
"is_primary": true,
"queue_id": 17104,
"role_display_name": "Data Engineer",
"role_slug": "data-engineer",
"skill_name": "Cloud Data Sources",
"status": "pending"
},
{
"is_primary": true,
"queue_id": 17105,
"role_display_name": "Data Engineer",
"role_slug": "data-engineer",
"skill_name": "Data Warehouse",
"status": "pending"
},
{
"is_primary": true,
"queue_id": 17106,
"role_display_name": "Data Engineer",
"role_slug": "data-engineer",
"skill_name": "ETL",
"status": "pending"
},
{
"is_primary": true,
"queue_id": 17107,
"role_display_name": "Data Engineer",
"role_slug": "data-engineer",
"skill_name": "BI Tools",
"status": "pending"
},
{
"is_primary": false,
"queue_id": 17108,
"role_display_name": "Data Engineer",
"role_slug": "data-engineer",
"skill_name": "Data Lineage",
"status": "pending"
},
{
"is_primary": false,
"queue_id": 17109,
"role_display_name": "Data Engineer",
"role_slug": "data-engineer",
"skill_name": "Standards",
"status": "pending"
},
{
"is_primary": false,
"queue_id": 17110,
"role_display_name": "Data Engineer",
"role_slug": "data-engineer",
"skill_name": "Best Practices",
"status": "pending"
}
],
"queue_entry_id": null,
"v3_pipeline_triggered": false,
"v3_role_slug": null,
"v3_run_id": null
}
}
API 2 — extract-details
{
"alias_matches": [],
"candidate_roles": [],
"chosen_role": {
"display_name": "Data Engineer",
"id": 2,
"rationale": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top absent does not contradict",
"role_archetype": null,
"slug": "data-engineer",
"source": "db"
},
"dimensions": [],
"input_final_skills": [
"Data Pipelines",
"Cloud Data Sources",
"Data Warehouse",
"ETL",
"BI Tools",
"Data Lineage",
"Standards",
"Best Practices"
],
"input_llm_skills": [
"Data Pipelines",
"Cloud Data Sources",
"Data Warehouse",
"ETL",
"BI Tools",
"Data Lineage",
"Standards",
"Best Practices"
],
"new_aliases_persisted": 0,
"run_id": "fc829bda-6a1e-4086-befc-133024825a6d",
"skills_detail": [
{
"aliases_in_db": [],
"canonical": null,
"dimensions": [],
"input_skill": "Data Pipelines",
"matched_via": null,
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": {
"derived": {
"category": "Data Engineering Tools",
"skill_nature": "CONCEPT",
"sub_category": "general",
"typical_lifespan": "MULTI_YEAR",
"version_strategy": "UNVERSIONED",
"volatility": "MEDIUM"
},
"enrichment": null,
"keep_log": [],
"locked_dimensions": [],
"merge_log": [],
"placed": null,
"relationships": null,
"skill_id": "data-pipelines",
"split_log": [],
"typed": null,
"warnings": []
},
"source_tag": "llm",
"was_in_llm_skills": true
},
{
"aliases_in_db": [],
"canonical": null,
"dimensions": [],
"input_skill": "Cloud Data Sources",
"matched_via": null,
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": {
"derived": {
"category": "Cloud Platforms",
"skill_nature": "PLATFORM",
"sub_category": "general",
"typical_lifespan": "MULTI_YEAR",
"version_strategy": "UNVERSIONED",
"volatility": "MEDIUM"
},
"enrichment": null,
"keep_log": [],
"locked_dimensions": [],
"merge_log": [],
"placed": null,
"relationships": null,
"skill_id": "cloud-data-sources",
"split_log": [],
"typed": null,
"warnings": []
},
"source_tag": "llm",
"was_in_llm_skills": true
},
{
"aliases_in_db": [],
"canonical": null,
"dimensions": [],
"input_skill": "Data Warehouse",
"matched_via": null,
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": {
"derived": {
"category": "Databases",
"skill_nature": "CONCEPT",
"sub_category": "general",
"typical_lifespan": "MULTI_YEAR",
"version_strategy": "UNVERSIONED",
"volatility": "MEDIUM"
},
"enrichment": null,
"keep_log": [],
"locked_dimensions": [],
"merge_log": [],
"placed": null,
"relationships": null,
"skill_id": "data-warehouse",
"split_log": [],
"typed": null,
"warnings": []
},
"source_tag": "llm",
"was_in_llm_skills": true
},
{
"aliases_in_db": [],
"canonical": null,
"dimensions": [],
"input_skill": "ETL",
"matched_via": null,
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": {
"derived": {
"category": "Data Engineering Tools",
"skill_nature": "PRACTICE",
"sub_category": "general",
"typical_lifespan": "MULTI_YEAR",
"version_strategy": "UNVERSIONED",
"volatility": "MEDIUM"
},
"enrichment": null,
"keep_log": [],
"locked_dimensions": [],
"merge_log": [],
"placed": null,
"relationships": null,
"skill_id": "etl",
"split_log": [],
"typed": null,
"warnings": []
},
"source_tag": "llm",
"was_in_llm_skills": true
},
{
"aliases_in_db": [],
"canonical": null,
"dimensions": [],
"input_skill": "BI Tools",
"matched_via": null,
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": {
"derived": {
"category": "Data Engineering Tools",
"skill_nature": "TOOL",
"sub_category": "general",
"typical_lifespan": "MULTI_YEAR",
"version_strategy": "UNVERSIONED",
"volatility": "MEDIUM"
},
"enrichment": null,
"keep_log": [],
"locked_dimensions": [],
"merge_log": [],
"placed": null,
"relationships": null,
"skill_id": "bi-tools",
"split_log": [],
"typed": null,
"warnings": []
},
"source_tag": "llm",
"was_in_llm_skills": true
},
{
"aliases_in_db": [],
"canonical": null,
"dimensions": [],
"input_skill": "Data Lineage",
"matched_via": null,
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": {
"derived": {
"category": "Data Engineering Tools",
"skill_nature": "CONCEPT",
"sub_category": "general",
"typical_lifespan": "MULTI_YEAR",
"version_strategy": "UNVERSIONED",
"volatility": "MEDIUM"
},
"enrichment": null,
"keep_log": [],
"locked_dimensions": [],
"merge_log": [],
"placed": null,
"relationships": null,
"skill_id": "data-lineage",
"split_log": [],
"typed": null,
"warnings": []
},
"source_tag": "llm",
"was_in_llm_skills": true
},
{
"aliases_in_db": [],
"canonical": null,
"dimensions": [],
"input_skill": "Standards",
"matched_via": null,
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": {
"derived": {
"category": "Best Practices",
"skill_nature": "CONCEPT",
"sub_category": "general",
"typical_lifespan": "EVERGREEN",
"version_strategy": "UNVERSIONED",
"volatility": "STABLE"
},
"enrichment": null,
"keep_log": [],
"locked_dimensions": [],
"merge_log": [],
"placed": null,
"relationships": null,
"skill_id": "standards",
"split_log": [],
"typed": null,
"warnings": []
},
"source_tag": "llm",
"was_in_llm_skills": true
},
{
"aliases_in_db": [],
"canonical": null,
"dimensions": [],
"input_skill": "Best Practices",
"matched_via": null,
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": {
"derived": {
"category": "Best Practices",
"skill_nature": "PRACTICE",
"sub_category": "general",
"typical_lifespan": "EVERGREEN",
"version_strategy": "UNVERSIONED",
"volatility": "STABLE"
},
"enrichment": null,
"keep_log": [],
"locked_dimensions": [],
"merge_log": [],
"placed": null,
"relationships": null,
"skill_id": "best-practices",
"split_log": [],
"typed": null,
"warnings": []
},
"source_tag": "llm",
"was_in_llm_skills": true
}
],
"unmatched_skills": [
"Data Pipelines",
"Cloud Data Sources",
"Data Warehouse",
"ETL",
"BI Tools",
"Data Lineage",
"Standards",
"Best Practices"
]
}
API 3 — final-role-output
{
"chosen_role": {
"display_name": "Data Engineer",
"id": 2,
"rationale": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top absent does not contradict",
"role_archetype": null,
"slug": "data-engineer",
"source": "db"
},
"chosen_role_resolution": "in_db",
"final_input_skills": [
{
"skill": "Data Pipelines",
"tag": "new"
},
{
"skill": "Cloud Data Sources",
"tag": "new"
},
{
"skill": "Data Warehouse",
"tag": "new"
},
{
"skill": "ETL",
"tag": "new"
},
{
"skill": "BI Tools",
"tag": "new"
},
{
"skill": "Data Lineage",
"tag": "new"
},
{
"skill": "Standards",
"tag": "new"
},
{
"skill": "Best Practices",
"tag": "new"
}
],
"llm_cost_api1_usd": null,
"llm_cost_api2_usd": null,
"llm_cost_api3_usd": null,
"llm_cost_total_usd": null,
"persistence": {
"items": [],
"new_skills_created": 0,
"role_dimension_saved": 0,
"skill_dimension_saved": 0,
"skipped": 0
},
"planner_output": null,
"run_id": "fc829bda-6a1e-4086-befc-133024825a6d"
}
LLM Calls
Every model call made for this run, in pipeline order. Click a card to see the model's response.