Pipeline run
27ea4ff7-d84b-4cc3-bcf1-ed1a2c19dd1c
Client output enrichment
v2 Skill cluster · Nature of work · AI index · Tech stack maturity · Evidence · KRA descriptionvocab breakdown (legacy)
Signals
Post-classification
Captured for admin review
1 POST /skills/extract-from-jd
2 POST /skills/extract-details
3 POST /skills/final-role-output
Data Engineer
CASE Aslug: data-engineer · id: 2 · source: db
Exact alias hit on data-engineer (1.0) — no other alias at this confidence; skill_top absent does not contradict
Resolution:
in_db
— role exists in library; skill↔dim and role↔dim links saved when applicable.
Job description
Project Role : Data Platform Engineer Project Role Description : Assists with the data platform blueprint and design, encompassing the relevant data platform components. Collaborates with the Integration Architects and Data Architects to ensure cohesive integration between systems and data models. Must have skills : Informatica Intelligent Cloud Services Good to have skills : Google BigQuery Minimum 3 Year(s) Of Experience Is Required Educational Qualification : 15 years full time education Summary: As a Data Platform Engineer, you will assist with the data platform blueprint and design, collaborating with Integration Architects and Data Architects to ensure cohesive integration between systems and data models. You will play a crucial role in the development and maintenance of the data platform components. Join our team in Mumbai and contribute to the success of our data engineering projects. Roles & Responsibilities: - Expected to perform independently and become an SME. - Required active participation/contribution in team discussions. - Contribute in providing solutions to work related problems. - Assist with the data platform blueprint and design. - Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models. - Develop and maintain data platform components. - Implement data integration processes using Informatica Intelligent Cloud Services. - Optimize data workflows and pipelines for efficient data processing. - Troubleshoot and resolve data integration issues. - Ensure data quality and integrity throughout the data platform. - Stay updated with the latest trends and advancements in data engineering. - Provide technical guidance and support to junior team members. Professional & Technical Skills: - Must To Have Skills: Proficiency in Informatica Intelligent Cloud Services. - Good To Have Skills: Experience with Google BigQuery. - Strong understanding of data integration concepts and techniques. - Experience in designing and implementing data workflows and pipelines. - Familiarity with cloud-based data platforms and services. - Knowledge of SQL and database management systems. - Experience with data quality and data governance practices. - Ability to troubleshoot and resolve data integration issues. Additional Information: - The candidate should have a minimum of 3 years of experience in Informatica Intelligent Cloud Services. - This position is based at our Mumbai office. - A 15 years full time education is required. 15 years full time education
Skills from this JD
Each row merges API 1 extraction, API 2 library match / v3 orchestration (dimensions + locked dims), and API 3 persistence tags.
Skill enrichment (orchestrator / LLM)
No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).
- Category
- Cloud Platforms
- Sub-category
- general
- Skill nature
- PLATFORM
- Volatility
- MEDIUM
- Typical lifespan
- MULTI_YEAR
- Version strategy
- UNVERSIONED
Library artifacts (this run)
| Kind | Detail | DB id |
|---|---|---|
| canonical_skill_proposed | Informatica Intelligent Cloud Services | type=Cloud Platforms subtype=general nature=PLATFORM lifespan=MULTI_YEAR |
nano JD Parser — gpt-4.1-nano click to toggle
Show raw JSON
{
"JD_type": "pass",
"about_company": null,
"certifications": [],
"company_name": null,
"ctc": null,
"domain": {
"primary": {
"aliases": [],
"domain": "Other"
},
"secondary": null
},
"education": [
{
"level": "null",
"qualification": "null - null",
"raw": "15 years full time education",
"requirement": "required"
}
],
"experience": {
"max": null,
"min": 3,
"raw": "Minimum 3 Year(s) Of Experience Is Required"
},
"job_locations": [
{
"aliases": [
"Bombay"
],
"city": "Mumbai",
"country": "India",
"state": null,
"work_mode": "onsite"
}
],
"role": "Data Platform Engineer",
"role_aliases": [
"Data Engineer",
"Data Integration Engineer"
],
"role_archetype": "Data",
"roles_and_responsibilities": [
{
"bullet_count": 12,
"heading": "Roles \u0026 Responsibilities",
"heading_was_present": true,
"source_marker": {
"first_5_words": "- Expected to perform independently",
"last_5_words": "to junior team members."
},
"text": "- Expected to perform independently and become an SME.\n- Required active participation/contribution in team discussions.\n- Contribute in providing solutions to work related problems.\n- Assist with the data platform blueprint and design.\n- Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models.\n- Develop and maintain data platform components.\n- Implement data integration processes using Informatica Intelligent Cloud Services.\n- Optimize data workflows and pipelines for efficient data processing.\n- Troubleshoot and resolve data integration issues.\n- Ensure data quality and integrity throughout the data platform.\n- Stay updated with the latest trends and advancements in data engineering.\n- Provide technical guidance and support to junior team members.",
"word_count": 114
}
],
"urls": []
}
API 1 — extract-from-jd click to toggle
{
"final_skills": [
{
"is_primary": true,
"skill_name": "Informatica Intelligent Cloud Services"
}
],
"jd_role": {
"display_name": "Data Platform Engineer",
"rationale": null,
"role_aliases": [
"Data Engineer",
"Data Integration Engineer"
],
"role_archetype": "Data",
"slug": ""
},
"nano_parsed": {
"JD_type": "pass",
"about_company": null,
"certifications": [],
"company_name": null,
"ctc": null,
"domain": {
"primary": {
"aliases": [],
"domain": "Other"
},
"secondary": null
},
"education": [
{
"level": "null",
"qualification": "null - null",
"raw": "15 years full time education",
"requirement": "required"
}
],
"experience": {
"max": null,
"min": 3,
"raw": "Minimum 3 Year(s) Of Experience Is Required"
},
"job_locations": [
{
"aliases": [
"Bombay"
],
"city": "Mumbai",
"country": "India",
"state": null,
"work_mode": "onsite"
}
],
"role": "Data Platform Engineer",
"role_aliases": [
"Data Engineer",
"Data Integration Engineer"
],
"role_archetype": "Data",
"roles_and_responsibilities": [
{
"bullet_count": 12,
"heading": "Roles \u0026 Responsibilities",
"heading_was_present": true,
"source_marker": {
"first_5_words": "- Expected to perform independently",
"last_5_words": "to junior team members."
},
"text": "- Expected to perform independently and become an SME.\n- Required active participation/contribution in team discussions.\n- Contribute in providing solutions to work related problems.\n- Assist with the data platform blueprint and design.\n- Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models.\n- Develop and maintain data platform components.\n- Implement data integration processes using Informatica Intelligent Cloud Services.\n- Optimize data workflows and pipelines for efficient data processing.\n- Troubleshoot and resolve data integration issues.\n- Ensure data quality and integrity throughout the data platform.\n- Stay updated with the latest trends and advancements in data engineering.\n- Provide technical guidance and support to junior team members.",
"word_count": 114
}
],
"urls": []
},
"rejected": false,
"rejection_reason": null,
"run_id": "27ea4ff7-d84b-4cc3-bcf1-ed1a2c19dd1c",
"stage3_signals": {
"alias_found": true,
"alias_match_roles": [
{
"display_name": "Data Engineer",
"kra_matches": null,
"matched_count": null,
"matched_skills": null,
"role_id": 2,
"score": 1.0,
"slug": "data-engineer",
"total_count": null
}
],
"kra_match_roles": [
{
"display_name": "Data Engineer",
"kra_matches": [
{
"kra_text": "Implements data quality validation rules, reconciliation checks, and anomaly detection to ensure data completeness, accuracy, and consistency.",
"sentence": "Ensure data quality and integrity throughout the data platform.",
"similarity": 0.6593
},
{
"kra_text": "Optimizes pipeline throughput, partitioning strategies, and query performance across cloud data warehouses like Snowflake, BigQuery, or Redshift.",
"sentence": "Optimize data workflows and pipelines for efficient data processing.",
"similarity": 0.6523
},
{
"kra_text": "Designs dimensional models, star schemas, data vault structures, and curated data mart tables to support BI tools and self-service analytics consumption.",
"sentence": "Assist with the data platform blueprint and design.",
"similarity": 0.6138
}
],
"matched_count": null,
"matched_skills": null,
"role_id": 2,
"score": 0.6418,
"slug": "data-engineer",
"total_count": null
},
{
"display_name": "Svelte Frontend Developer",
"kra_matches": [
{
"kra_text": "backend data integration",
"sentence": "Troubleshoot and resolve data integration issues.",
"similarity": 0.5869
},
{
"kra_text": "backend data integration",
"sentence": "Develop and maintain data platform components.",
"similarity": 0.5181
},
{
"kra_text": "backend data integration",
"sentence": "Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models.",
"similarity": 0.5145
}
],
"matched_count": null,
"matched_skills": null,
"role_id": 92,
"score": 0.5399,
"slug": "svelte-frontend-developer",
"total_count": null
},
{
"display_name": "AI Engineer",
"kra_matches": [
{
"kra_text": "Designs and implements prompt engineering workflows, few-shot examples, chain-of-thought patterns, and structured output parsing for AI feature pipelines.",
"sentence": "Optimize data workflows and pipelines for efficient data processing.",
"similarity": 0.5375
},
{
"kra_text": "Integrates AI model API responses with application business logic, database writes, event publishing, and downstream service orchestration.",
"sentence": "Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models.",
"similarity": 0.4868
},
{
"kra_text": "Integrates AI model API responses with application business logic, database writes, event publishing, and downstream service orchestration.",
"sentence": "Implement data integration processes using Informatica Intelligent Cloud Services.",
"similarity": 0.473
}
],
"matched_count": null,
"matched_skills": null,
"role_id": 13,
"score": 0.4991,
"slug": "ai-engineer",
"total_count": null
},
{
"display_name": "Python Backend Developer",
"kra_matches": [
{
"kra_text": "Maintain data access and persistence",
"sentence": "Develop and maintain data platform components.",
"similarity": 0.5191
},
{
"kra_text": "Maintain data access and persistence",
"sentence": "Ensure data quality and integrity throughout the data platform.",
"similarity": 0.4966
},
{
"kra_text": "Troubleshoot server-side defects",
"sentence": "Troubleshoot and resolve data integration issues.",
"similarity": 0.4757
}
],
"matched_count": null,
"matched_skills": null,
"role_id": 80,
"score": 0.4971,
"slug": "python-backend-developer",
"total_count": null
},
{
"display_name": "MLOps Engineer",
"kra_matches": [
{
"kra_text": "Validates model performance benchmarks, data schema contracts, and system integration health before signing off on production release readiness.",
"sentence": "Ensure data quality and integrity throughout the data platform.",
"similarity": 0.5263
},
{
"kra_text": "Coordinates model promotion workflows across development, staging, and production environments including integration testing and data contract validation.",
"sentence": "Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models.",
"similarity": 0.4824
},
{
"kra_text": "Automates ML platform operations including scheduled retraining triggers, pipeline orchestration, evaluation workflows, and alerting configuration.",
"sentence": "Optimize data workflows and pipelines for efficient data processing.",
"similarity": 0.482
}
],
"matched_count": null,
"matched_skills": null,
"role_id": 16,
"score": 0.4969,
"slug": "ml-ops-engineer",
"total_count": null
}
],
"skill_match_roles": []
},
"stage4_decision": {
"alias_collision_detected": false,
"case": "A",
"chosen_role": {
"display_name": "Data Engineer",
"kra_matches": null,
"matched_count": null,
"matched_skills": null,
"role_id": 2,
"score": 1.0,
"slug": "data-engineer",
"total_count": null
},
"confidence": 1.0,
"is_new_role": false,
"llm2_fired": false,
"llm2_reasoning": null,
"matched_dimensions": [],
"matched_kras": [],
"matched_skills": [],
"new_role_display_name": null,
"new_role_slug": null,
"queued": false,
"reasoning": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top absent does not contradict",
"sub_role": null
},
"stage5_updates": {
"centroid_n_after": 251,
"centroid_updated": true,
"collision_log_id": null,
"new_kra_attached": null,
"new_skills_attached": [
{
"is_primary": true,
"queue_id": 12410,
"role_display_name": "Data Engineer",
"role_slug": "data-engineer",
"skill_name": "Informatica Intelligent Cloud Services",
"status": "pending"
}
],
"queue_entry_id": null,
"v3_pipeline_triggered": false,
"v3_role_slug": null,
"v3_run_id": null
}
}
API 2 — extract-details
{
"alias_matches": [],
"candidate_roles": [],
"chosen_role": {
"display_name": "Data Engineer",
"id": 2,
"rationale": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top absent does not contradict",
"role_archetype": null,
"slug": "data-engineer",
"source": "db"
},
"dimensions": [],
"input_final_skills": [
"Informatica Intelligent Cloud Services"
],
"input_llm_skills": [
"Informatica Intelligent Cloud Services"
],
"new_aliases_persisted": 0,
"run_id": "27ea4ff7-d84b-4cc3-bcf1-ed1a2c19dd1c",
"skills_detail": [
{
"aliases_in_db": [],
"canonical": null,
"dimensions": [],
"input_skill": "Informatica Intelligent Cloud Services",
"matched_via": null,
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": {
"derived": {
"category": "Cloud Platforms",
"skill_nature": "PLATFORM",
"sub_category": "general",
"typical_lifespan": "MULTI_YEAR",
"version_strategy": "UNVERSIONED",
"volatility": "MEDIUM"
},
"enrichment": null,
"keep_log": [],
"locked_dimensions": [],
"merge_log": [],
"placed": null,
"relationships": null,
"skill_id": "informatica-intelligent-cloud-services",
"split_log": [],
"typed": null,
"warnings": []
},
"source_tag": "llm",
"was_in_llm_skills": true
}
],
"unmatched_skills": [
"Informatica Intelligent Cloud Services"
]
}
API 3 — final-role-output
{
"chosen_role": {
"display_name": "Data Engineer",
"id": 2,
"rationale": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top absent does not contradict",
"role_archetype": null,
"slug": "data-engineer",
"source": "db"
},
"chosen_role_resolution": "in_db",
"final_input_skills": [
{
"skill": "Informatica Intelligent Cloud Services",
"tag": "new"
}
],
"llm_cost_api1_usd": null,
"llm_cost_api2_usd": null,
"llm_cost_api3_usd": null,
"llm_cost_total_usd": null,
"persistence": {
"items": [],
"new_skills_created": 0,
"role_dimension_saved": 0,
"skill_dimension_saved": 0,
"skipped": 0
},
"planner_output": null,
"run_id": "27ea4ff7-d84b-4cc3-bcf1-ed1a2c19dd1c"
}
LLM Calls
Every model call made for this run, in pipeline order. Click a card to see the model's response.