Pipeline run

e676df21-b586-432c-952d-fed47e334f3d

Pipeline LLM cost (USD)

API 1: $0.0048 API 2: $0.0000 API 3: $0.0000 Total: $0.0048

Client output enrichment

v2 Skill cluster · Nature of work · AI index · Tech stack maturity · Evidence · KRA description

role baseline loaded sources · ai_index: jd · nature_of_work: jd · tech_stack_maturity: jd

Nature of work · Data transformation and modeling

Build and operate SQL/ETL data pipelines and AWS-backed data platforms, transforming disparate datasets, enforcing data quality, and troubleshooting issues for product, data, and executive stakeholders. Also present findings and support machine-learning initiatives in a production setting.

""Build processes supporting data transformation, data structures, metadata, dependency and workload management using programming languages such as Java or Python.""

Tech stack maturity

Mainstream Modern cache hit

The stack centers on AWS, Redshift, RDS, Python, PostgreSQL, and SQL, which are standard modern cloud data engineering technologies with some legacy Hadoop presence.

AI index (0 = no AI use, 5 = totally AI-dependent · v2.1)

0.50 / 5

· Title match

✓ Has AI skill

· AI skill (primary)

✓ AI skill (secondary)

· On AI team

· Builds AI products

vocab breakdown (legacy)

Assistants (×1): —

Frameworks (×2): —

Models / concepts (×3): fine-tuning, AI, ML, AI/ML, Machine Learning, Artificial Intelligence

Evidence — skills matched in JD (21)

SQL PostgreSQL Oracle ETL ELT AWS DMS SCT RDS Aurora Redshift Java Python Data Warehousing Data Visualization Data Integration Hadoop Agile Kanban Scrum Machine Learning

Skill cluster (6 dimension groups, role-scoped)

Cloud Platforms

AWS RDS Redshift

Programming Languages for Data Work

SQL Java Python

AI Governance and Model Security

Machine Learning

ETL and ELT Tooling

Hadoop

Relational Database Usage

PostgreSQL

Cross-cutting / unaligned

Oracle ETL ELT DMS SCT Aurora Data Warehousing Data Visualization Data Integration Agile Kanban Scrum

Show KRA description ↓

The role will be a core member of an AI/ML focused incubator team consisting of Product Management, Data Science, Product Design, Design Strategy, Engineering and System Architecture professionals, all focused on developing capabilities, products and services to improve investment outcomes and enrich the customer experience. You will apply your data engineering skills to vast and varied data sets on one of the industry’s leading developmental networks. Present reports and findings to senior level technical and non-technical audience. Implement new technologies in a production environment with product, IT, and data engineering teams. Acting as a catalyst for machine learning initiatives in the organization. Accountable for consistent delivery of functional software. Experience in software development practices and procedures. Passionate about data. Develops original and creative technical solutions to ongoing development efforts. Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases. Solid experience using relational databases (PostgreSQL, Oracle) as well as data movement technologies (ETL/ELT) from disparate sources. Solid experience using data related services in AWS Data services like DMS, SCT, RDS, Aurora, Redshift etc. Experience in database administration. Build processes supporting data transformation, data structures, metadata, dependency and workload management using programming languages such as Java or Python. Data quality management experience, including data fine-tuning and testing to project specifications. Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs. Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement. A successful history of manipulating, processing and extracting value from both small and large disconnected datasets. Experience supporting and working with cross-functional teams in a dynamic environment. Strong analytical and problem-solving skills. Willingness to constantly re-iterate design and code for the best solutions. Experience with financial and legal datasets is an advantage. Experience in using big data platforms (Hadoop). Experience in working on an incubator or accelerator setup is a plus. Bachelor’s or Masters in a technology related field (e.g. Engineering, Computer Science, etc.) required. Academic background in data analysis, data modeling, computer science, engineering, finance, economics, or related discipline preferred. 9+ years of IT experience. 6+ years of data related experience across different sources of data. 6+ years of intermediate level of programming experience using either Python or Java. Demonstrated expertise across a broad range of research/analytics methodologies like data warehousing, data visualization and data integration to develop key findings for highly complex business problems. Ability to turn complex and large volumes of data into 'intelligent data' that answers critical business questions. Strong experience in Agile methodologies (Kanban and Scrum) preferred.

Signals

Skill data-engineer

0.35

Alias data-engineer

1.00

KRA data-engineer

0.60

Post-classification

Centroidupdated · n=521

Alias collision log—

New-role queue—

New skills captured10

New KRA captured—

Captured for admin review

Oracle primary ↔ Data Engineer pending

ETL primary ↔ Data Engineer pending

ELT primary ↔ Data Engineer pending

DMS primary ↔ Data Engineer pending

SCT primary ↔ Data Engineer pending

Aurora primary ↔ Data Engineer pending

Data Warehousing primary ↔ Data Engineer pending

Data Visualization primary ↔ Data Engineer pending

Data Integration primary ↔ Data Engineer pending

Kanban primary ↔ Data Engineer pending

Status: extract_from_jd_done Created: 2026-05-27T17:46:20.983283Z Updated: 2026-06-07T08:00:27.712147Z

Flow Current 3-step pipeline

1 POST /skills/extract-from-jd

2 POST /skills/extract-details

3 POST /skills/final-role-output

Role Chosen role & resolution

No chosen role stored for this run.

Job description

Job Description

Job Title - Principal - Data Engineer

Fidelity Center for Applied Technology’s (FCAT) Artificial Intelligence incubator focuses on machine learning initiatives in financial services domain. In the Artificial Intelligence incubator, we partner with business stakeholders to identify/prioritize top AI opportunities, design, build and deploy solutions that benefits the user and the business. We also research, experiment and publish on latest advancements in AI landscape.

FCAT is seeking a Principal Data Engineer with solid data engineering background. As a member of the FCAT Artificial Intelligence Incubator, you will be responsible for data analysis and integration across multiple source repositories for a wide variety of structured and unstructured data. You will work on all phases of projects, from design to deployment into production along with a team of Data Scientist, DevOps Engineers and Software Engineers.

The Purpose of your Role
The role will be a core member of an AI/ML focused incubator team consisting of Product Management, Data Science, Product Design, Design Strategy, Engineering and System Architecture professionals, all focused on developing capabilities, products and services to improve investment outcomes and enrich the customer experience.You will apply your data engineering skills to vast and varied data sets on one of the industry’s leading developmental networks. Present reports and findings to senior level technical and non-technical audienceImplement new technologies in a production environment with product, IT, and data engineering teamsActing as a catalyst for machine learning initiatives in the organization

The Value You Deliver
Accountable for consistent delivery of functional softwareExperience in software development practices and proceduresPassionate about dataDevelops original and creative technical solutions to ongoing development efforts

The Skills that are Key to this role - Technical & Behavioral
Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases.Solid experience using relational databases (PostgreSQL, Oracle) as well as data movement technologies (ETL/ELT) from disparate sourcesSolid experience using data related services in AWS Data services like DMS, SCT, RDS, Aurora, Redshift etc.Experience in database administrationBuild processes supporting data transformation, data structures, metadata, dependency and workload management using programming languages such as Java or Python.Data quality management experience, including data fine-tuning and testing to project specificationsWork with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs.Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.A successful history of manipulating, processing and extracting value from both small and large disconnected datasets.Experience supporting and working with cross-functional teams in a dynamic environmentStrong analytical and problem-solving skillsWillingness to constantly re-iterate design and code for the best solutions

The Skills that are Good to Have
Experience with financial and legal datasets is an advantageExperience in using big data platforms (Hadoop)Experience in working on an incubator or accelerator setup is a plus

How Your Work Impacts the Organization

At Fidelity, we are focused on making our financial expertise broadly accessible and effective in helping people live the lives they want— from the 23 million people investing their life savings, to the 20,000 businesses managing their employee benefits programs, to the10,000 advisors and institutions needing innovative technology solutions to invest their clients’ money. To do this well, as a privately held company, we place a high degree of value in nurturing a work environment that attracts the best talent and reflects our commitment to being an employer of choice.

Underneath this stable and trusted brand is one of the most interesting innovation practices you never heard of - The Fidelity Center of Applied Technology (‘FCAT”). FCAT is a centralized function whose mandate is to catalyze innovation across the firm. Our FCAT teams prototype and pilot new businesses and capabilities that continue to distinguish our brand as the best customer experience in the financial services industry. These teams focus on exploring big industry game-changing technologies such as crypto currencies, artificial intelligence, virtual/augmented reality & cloud computing. Additionally, FCAT’s world class research team identifies trends and works with partners across the innovation ecosystem to “bring the outside in.” We collaborate with MIT, Harvard, Stanford and other academic institutions, and partner with Accelerators, Start Ups and other industry leaders to drive new ideas and innovation practices across the firm.

The Expertise we require
Bachelor’s or Masters in a technology related field (e.g. Engineering, Computer Science, etc.) requiredAcademic background in data analysis, data modeling, computer science, engineering, finance, economics, or related discipline preferred9+ years of IT experience6+ years of data related experience across different sources of data6+ years of intermediate level of programming experience using either Python or JavaDemonstrated expertise across a broad range of research/analytics methodologies like data warehousing, data visualization and data integration to develop key findings for highly complex business problems.Ability to turn complex and large volumes of data into 'intelligent data' that answers critical business questions.Strong experience in Agile methodologies (Kanban and Scrum) preferred

Certifications

Company Overview

At Fidelity, we are focused on making our financial expertise broadly accessible and effective in helping people live the lives they want. We are a privately held company that places a high degree of value in creating and nurturing a work environment that attracts the best talent and reflects our commitment to our associates. We are proud of our diverse and inclusive workplace where we respect and value our associates for their unique perspectives and experiences. Fidelity India has been the Global Inhouse Center of Fidelity Investments since 2003 with offices in Bangalore and Chennai. For information about working at Fidelity, visit India.Fidelity.com.

Fidelity Investments is an equal opportunity employer.

Skills from this JD

Each row merges API 1 extraction, API 2 library match / v3 orchestration (dimensions + locked dims), and API 3 persistence tags.

SQL Primary No API 2 row (run stopped after API 1 or history missing)

PostgreSQL Primary No API 2 row (run stopped after API 1 or history missing)

Oracle Primary No API 2 row (run stopped after API 1 or history missing)

ETL Primary No API 2 row (run stopped after API 1 or history missing)

ELT Primary No API 2 row (run stopped after API 1 or history missing)

AWS Primary No API 2 row (run stopped after API 1 or history missing)

DMS Primary No API 2 row (run stopped after API 1 or history missing)

SCT Primary No API 2 row (run stopped after API 1 or history missing)

RDS Primary No API 2 row (run stopped after API 1 or history missing)

Aurora Primary No API 2 row (run stopped after API 1 or history missing)

Redshift Primary No API 2 row (run stopped after API 1 or history missing)

Java Primary No API 2 row (run stopped after API 1 or history missing)

Python Primary No API 2 row (run stopped after API 1 or history missing)

Data Warehousing Primary No API 2 row (run stopped after API 1 or history missing)

Data Visualization Primary No API 2 row (run stopped after API 1 or history missing)

Data Integration Primary No API 2 row (run stopped after API 1 or history missing)

Hadoop Primary No API 2 row (run stopped after API 1 or history missing)

Agile Primary No API 2 row (run stopped after API 1 or history missing)

Kanban Primary No API 2 row (run stopped after API 1 or history missing)

Scrum Primary No API 2 row (run stopped after API 1 or history missing)

Machine Learning Secondary No API 2 row (run stopped after API 1 or history missing)

Library artifacts (this run)

No artifact rows for this run.

nano JD Parser — gpt-4.1-nano click to toggle

RolePrincipal - Data Engineer

CompanyFidelity Center for Applied Technology

Experience6+ years of data related experience across different sources of data

DomainFinancial Services

Location Bangalore, India (null)

JD type pass

Show raw JSON

{
  "JD_type": "pass",
  "about_company": {
    "source_marker": {
      "first_5_words": "At Fidelity, we are focused",
      "last_5_words": "visit India.Fidelity.com."
    },
    "text": "At Fidelity, we are focused on making our financial expertise broadly accessible and effective in helping people live the lives they want. We are a privately held company that places a high degree of value in creating and nurturing a work environment that attracts the best talent and reflects our commitment to our associates. We are proud of our diverse and inclusive workplace where we respect and value our associates for their unique perspectives and experiences. Fidelity India has been the Global Inhouse Center of Fidelity Investments since 2003 with offices in Bangalore and Chennai. For information about working at Fidelity, visit India.Fidelity.com.",
    "word_count": 84
  },
  "certifications": [],
  "company_name": "Fidelity Center for Applied Technology",
  "ctc": null,
  "domain": {
    "primary": {
      "aliases": [
        "Finance",
        "Banking"
      ],
      "domain": "Financial Services"
    },
    "secondary": null
  },
  "education": [
    {
      "level": "Bachelor\u0027s",
      "qualification": "BTECH/BE/BSC - Technology Related Field",
      "raw": "Bachelor\u2019s or Masters in a technology related field (e.g. Engineering, Computer Science, etc.) required",
      "requirement": "required"
    },
    {
      "level": "Bachelor\u0027s",
      "qualification": "BTECH/BE/BSC - Data Analysis / Data Modeling / Computer Science / Engineering / Finance / Economics",
      "raw": "Academic background in data analysis, data modeling, computer science, engineering, finance, economics, or related discipline preferred",
      "requirement": "preferred"
    }
  ],
  "experience": {
    "max": null,
    "min": 6,
    "raw": "6+ years of data related experience across different sources of data"
  },
  "job_locations": [
    {
      "aliases": [
        "Bengaluru"
      ],
      "city": "Bangalore",
      "country": "India",
      "state": "Karnataka",
      "work_mode": "null"
    },
    {
      "aliases": [],
      "city": "Chennai",
      "country": "India",
      "state": "Tamil Nadu",
      "work_mode": "null"
    }
  ],
  "role": "Principal - Data Engineer",
  "role_aliases": [
    "Data Engineer",
    "Senior Data Engineer",
    "Lead Data Engineer"
  ],
  "role_archetype": "Data",
  "roles_and_responsibilities": [
    {
      "bullet_count": 0,
      "heading": "The Purpose of your Role",
      "heading_was_present": true,
      "source_marker": {
        "first_5_words": "The role will be a",
        "last_5_words": "initiatives in the organization."
      },
      "text": "The role will be a core member of an AI/ML focused incubator team consisting of Product Management, Data Science, Product Design, Design Strategy, Engineering and System Architecture professionals, all focused on developing capabilities, products and services to improve investment outcomes and enrich the customer experience. You will apply your data engineering skills to vast and varied data sets on one of the industry\u2019s leading developmental networks. Present reports and findings to senior level technical and non-technical audience. Implement new technologies in a production environment with product, IT, and data engineering teams. Acting as a catalyst for machine learning initiatives in the organization.",
      "word_count": 90
    },
    {
      "bullet_count": 0,
      "heading": "The Value You Deliver",
      "heading_was_present": true,
      "source_marker": {
        "first_5_words": "Accountable for consistent delivery",
        "last_5_words": "to ongoing development efforts."
      },
      "text": "Accountable for consistent delivery of functional software. Experience in software development practices and procedures. Passionate about data. Develops original and creative technical solutions to ongoing development efforts.",
      "word_count": 40
    },
    {
      "bullet_count": 0,
      "heading": "The Skills that are Key to this role - Technical \u0026 Behavioral",
      "heading_was_present": true,
      "source_marker": {
        "first_5_words": "Advanced working SQL knowledge and",
        "last_5_words": "design and code for the best solutions."
      },
      "text": "Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases. Solid experience using relational databases (PostgreSQL, Oracle) as well as data movement technologies (ETL/ELT) from disparate sources. Solid experience using data related services in AWS Data services like DMS, SCT, RDS, Aurora, Redshift etc. Experience in database administration. Build processes supporting data transformation, data structures, metadata, dependency and workload management using programming languages such as Java or Python. Data quality management experience, including data fine-tuning and testing to project specifications. Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs. Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement. A successful history of manipulating, processing and extracting value from both small and large disconnected datasets. Experience supporting and working with cross-functional teams in a dynamic environment. Strong analytical and problem-solving skills. Willingness to constantly re-iterate design and code for the best solutions.",
      "word_count": 233
    },
    {
      "bullet_count": 0,
      "heading": "The Skills that are Good to Have",
      "heading_was_present": true,
      "source_marker": {
        "first_5_words": "Experience with financial and legal",
        "last_5_words": "or accelerator setup is a plus."
      },
      "text": "Experience with financial and legal datasets is an advantage. Experience in using big data platforms (Hadoop). Experience in working on an incubator or accelerator setup is a plus.",
      "word_count": 30
    },
    {
      "bullet_count": 0,
      "heading": "The Expertise we require",
      "heading_was_present": true,
      "source_marker": {
        "first_5_words": "Bachelor\u2019s or Masters in a",
        "last_5_words": "methodologies (Kanban and Scrum) preferred."
      },
      "text": "Bachelor\u2019s or Masters in a technology related field (e.g. Engineering, Computer Science, etc.) required. Academic background in data analysis, data modeling, computer science, engineering, finance, economics, or related discipline preferred. 9+ years of IT experience. 6+ years of data related experience across different sources of data. 6+ years of intermediate level of programming experience using either Python or Java. Demonstrated expertise across a broad range of research/analytics methodologies like data warehousing, data visualization and data integration to develop key findings for highly complex business problems. Ability to turn complex and large volumes of data into \u0027intelligent data\u0027 that answers critical business questions. Strong experience in Agile methodologies (Kanban and Scrum) preferred.",
      "word_count": 118
    }
  ],
  "urls": []
}

API 1 — extract-from-jd click to toggle

{
  "final_skills": [
    {
      "is_primary": true,
      "skill_name": "SQL"
    },
    {
      "is_primary": true,
      "skill_name": "PostgreSQL"
    },
    {
      "is_primary": true,
      "skill_name": "Oracle"
    },
    {
      "is_primary": true,
      "skill_name": "ETL"
    },
    {
      "is_primary": true,
      "skill_name": "ELT"
    },
    {
      "is_primary": true,
      "skill_name": "AWS"
    },
    {
      "is_primary": true,
      "skill_name": "DMS"
    },
    {
      "is_primary": true,
      "skill_name": "SCT"
    },
    {
      "is_primary": true,
      "skill_name": "RDS"
    },
    {
      "is_primary": true,
      "skill_name": "Aurora"
    },
    {
      "is_primary": true,
      "skill_name": "Redshift"
    },
    {
      "is_primary": true,
      "skill_name": "Java"
    },
    {
      "is_primary": true,
      "skill_name": "Python"
    },
    {
      "is_primary": true,
      "skill_name": "Data Warehousing"
    },
    {
      "is_primary": true,
      "skill_name": "Data Visualization"
    },
    {
      "is_primary": true,
      "skill_name": "Data Integration"
    },
    {
      "is_primary": true,
      "skill_name": "Hadoop"
    },
    {
      "is_primary": true,
      "skill_name": "Agile"
    },
    {
      "is_primary": true,
      "skill_name": "Kanban"
    },
    {
      "is_primary": true,
      "skill_name": "Scrum"
    },
    {
      "is_primary": false,
      "skill_name": "Machine Learning"
    }
  ],
  "jd_role": {
    "display_name": "Principal - Data Engineer",
    "rationale": null,
    "role_aliases": [
      "Data Engineer",
      "Senior Data Engineer",
      "Lead Data Engineer"
    ],
    "role_archetype": "Data",
    "slug": ""
  },
  "nano_parsed": {
    "JD_type": "pass",
    "about_company": {
      "source_marker": {
        "first_5_words": "At Fidelity, we are focused",
        "last_5_words": "visit India.Fidelity.com."
      },
      "text": "At Fidelity, we are focused on making our financial expertise broadly accessible and effective in helping people live the lives they want. We are a privately held company that places a high degree of value in creating and nurturing a work environment that attracts the best talent and reflects our commitment to our associates. We are proud of our diverse and inclusive workplace where we respect and value our associates for their unique perspectives and experiences. Fidelity India has been the Global Inhouse Center of Fidelity Investments since 2003 with offices in Bangalore and Chennai. For information about working at Fidelity, visit India.Fidelity.com.",
      "word_count": 84
    },
    "certifications": [],
    "company_name": "Fidelity Center for Applied Technology",
    "ctc": null,
    "domain": {
      "primary": {
        "aliases": [
          "Finance",
          "Banking"
        ],
        "domain": "Financial Services"
      },
      "secondary": null
    },
    "education": [
      {
        "level": "Bachelor\u0027s",
        "qualification": "BTECH/BE/BSC - Technology Related Field",
        "raw": "Bachelor\u2019s or Masters in a technology related field (e.g. Engineering, Computer Science, etc.) required",
        "requirement": "required"
      },
      {
        "level": "Bachelor\u0027s",
        "qualification": "BTECH/BE/BSC - Data Analysis / Data Modeling / Computer Science / Engineering / Finance / Economics",
        "raw": "Academic background in data analysis, data modeling, computer science, engineering, finance, economics, or related discipline preferred",
        "requirement": "preferred"
      }
    ],
    "experience": {
      "max": null,
      "min": 6,
      "raw": "6+ years of data related experience across different sources of data"
    },
    "job_locations": [
      {
        "aliases": [
          "Bengaluru"
        ],
        "city": "Bangalore",
        "country": "India",
        "state": "Karnataka",
        "work_mode": "null"
      },
      {
        "aliases": [],
        "city": "Chennai",
        "country": "India",
        "state": "Tamil Nadu",
        "work_mode": "null"
      }
    ],
    "role": "Principal - Data Engineer",
    "role_aliases": [
      "Data Engineer",
      "Senior Data Engineer",
      "Lead Data Engineer"
    ],
    "role_archetype": "Data",
    "roles_and_responsibilities": [
      {
        "bullet_count": 0,
        "heading": "The Purpose of your Role",
        "heading_was_present": true,
        "source_marker": {
          "first_5_words": "The role will be a",
          "last_5_words": "initiatives in the organization."
        },
        "text": "The role will be a core member of an AI/ML focused incubator team consisting of Product Management, Data Science, Product Design, Design Strategy, Engineering and System Architecture professionals, all focused on developing capabilities, products and services to improve investment outcomes and enrich the customer experience. You will apply your data engineering skills to vast and varied data sets on one of the industry\u2019s leading developmental networks. Present reports and findings to senior level technical and non-technical audience. Implement new technologies in a production environment with product, IT, and data engineering teams. Acting as a catalyst for machine learning initiatives in the organization.",
        "word_count": 90
      },
      {
        "bullet_count": 0,
        "heading": "The Value You Deliver",
        "heading_was_present": true,
        "source_marker": {
          "first_5_words": "Accountable for consistent delivery",
          "last_5_words": "to ongoing development efforts."
        },
        "text": "Accountable for consistent delivery of functional software. Experience in software development practices and procedures. Passionate about data. Develops original and creative technical solutions to ongoing development efforts.",
        "word_count": 40
      },
      {
        "bullet_count": 0,
        "heading": "The Skills that are Key to this role - Technical \u0026 Behavioral",
        "heading_was_present": true,
        "source_marker": {
          "first_5_words": "Advanced working SQL knowledge and",
          "last_5_words": "design and code for the best solutions."
        },
        "text": "Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases. Solid experience using relational databases (PostgreSQL, Oracle) as well as data movement technologies (ETL/ELT) from disparate sources. Solid experience using data related services in AWS Data services like DMS, SCT, RDS, Aurora, Redshift etc. Experience in database administration. Build processes supporting data transformation, data structures, metadata, dependency and workload management using programming languages such as Java or Python. Data quality management experience, including data fine-tuning and testing to project specifications. Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs. Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement. A successful history of manipulating, processing and extracting value from both small and large disconnected datasets. Experience supporting and working with cross-functional teams in a dynamic environment. Strong analytical and problem-solving skills. Willingness to constantly re-iterate design and code for the best solutions.",
        "word_count": 233
      },
      {
        "bullet_count": 0,
        "heading": "The Skills that are Good to Have",
        "heading_was_present": true,
        "source_marker": {
          "first_5_words": "Experience with financial and legal",
          "last_5_words": "or accelerator setup is a plus."
        },
        "text": "Experience with financial and legal datasets is an advantage. Experience in using big data platforms (Hadoop). Experience in working on an incubator or accelerator setup is a plus.",
        "word_count": 30
      },
      {
        "bullet_count": 0,
        "heading": "The Expertise we require",
        "heading_was_present": true,
        "source_marker": {
          "first_5_words": "Bachelor\u2019s or Masters in a",
          "last_5_words": "methodologies (Kanban and Scrum) preferred."
        },
        "text": "Bachelor\u2019s or Masters in a technology related field (e.g. Engineering, Computer Science, etc.) required. Academic background in data analysis, data modeling, computer science, engineering, finance, economics, or related discipline preferred. 9+ years of IT experience. 6+ years of data related experience across different sources of data. 6+ years of intermediate level of programming experience using either Python or Java. Demonstrated expertise across a broad range of research/analytics methodologies like data warehousing, data visualization and data integration to develop key findings for highly complex business problems. Ability to turn complex and large volumes of data into \u0027intelligent data\u0027 that answers critical business questions. Strong experience in Agile methodologies (Kanban and Scrum) preferred.",
        "word_count": 118
      }
    ],
    "urls": []
  },
  "rejected": false,
  "rejection_reason": null,
  "run_id": "e676df21-b586-432c-952d-fed47e334f3d",
  "stage3_signals": {
    "alias_found": true,
    "alias_match_roles": [
      {
        "display_name": "Data Engineer",
        "kra_matches": null,
        "matched_count": null,
        "matched_skills": null,
        "role_id": 2,
        "score": 1.0,
        "slug": "data-engineer",
        "total_count": null
      }
    ],
    "kra_match_roles": [
      {
        "display_name": "Data Engineer",
        "kra_matches": [
          {
            "kra_text": "Works with data analysts, data scientists, and business stakeholders to define data models, ingestion schedules, and data delivery requirements.",
            "sentence": "Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs.",
            "similarity": 0.672
          },
          {
            "kra_text": "Implements data quality validation rules, reconciliation checks, and anomaly detection to ensure data completeness, accuracy, and consistency.",
            "sentence": "Data quality management experience, including data fine-tuning and testing to project specifications.",
            "similarity": 0.5823
          },
          {
            "kra_text": "Develops batch and real-time streaming data pipelines using Apache Spark, Apache Kafka, Apache Flink, or Airflow for data movement and processing at scale.",
            "sentence": "Build processes supporting data transformation, data structures, metadata, dependency and workload management using programming languages such as Java or Python.",
            "similarity": 0.5521
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 2,
        "score": 0.6022,
        "slug": "data-engineer",
        "total_count": null
      },
      {
        "display_name": "Fullstack Developer",
        "kra_matches": [
          {
            "kra_text": "Designs and queries relational databases like PostgreSQL and document stores like MongoDB, writing migrations, indexes, and optimized queries.",
            "sentence": "Solid experience using relational databases (PostgreSQL, Oracle) as well as data movement technologies (ETL/ELT) from disparate sources.",
            "similarity": 0.5849
          },
          {
            "kra_text": "Works closely with product managers and UX designers to translate requirements and wireframes into working software features through iterative development.",
            "sentence": "Develops original and creative technical solutions to ongoing development efforts.",
            "similarity": 0.5308
          },
          {
            "kra_text": "Works closely with product managers and UX designers to translate requirements and wireframes into working software features through iterative development.",
            "sentence": "Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs.",
            "similarity": 0.516
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 15,
        "score": 0.5439,
        "slug": "full-stack-engineer",
        "total_count": null
      },
      {
        "display_name": "Backend Developer",
        "kra_matches": [
          {
            "kra_text": "Investigates and resolves production incidents, API bugs, and service degradation through root cause analysis, hotfixes, and post-mortems.",
            "sentence": "Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.",
            "similarity": 0.5782
          },
          {
            "kra_text": "Writes database access logic including SQL queries, ORM mappings, stored procedures, and migration scripts for relational databases like PostgreSQL and MySQL.",
            "sentence": "Solid experience using relational databases (PostgreSQL, Oracle) as well as data movement technologies (ETL/ELT) from disparate sources.",
            "similarity": 0.5396
          },
          {
            "kra_text": "Writes database access logic including SQL queries, ORM mappings, stored procedures, and migration scripts for relational databases like PostgreSQL and MySQL.",
            "sentence": "Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases.",
            "similarity": 0.4949
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 1,
        "score": 0.5376,
        "slug": "backend-engineer",
        "total_count": null
      },
      {
        "display_name": "Flutter Developer",
        "kra_matches": [
          {
            "kra_text": "collaborate with design, product, and backend teams",
            "sentence": "Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs.",
            "similarity": 0.5466
          },
          {
            "kra_text": "collaborate with design, product, and backend teams",
            "sentence": "Implement new technologies in a production environment with product, IT, and data engineering teams.",
            "similarity": 0.5285
          },
          {
            "kra_text": "collaborate with design, product, and backend teams",
            "sentence": "Experience supporting and working with cross-functional teams in a dynamic environment.",
            "similarity": 0.4672
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 74,
        "score": 0.5141,
        "slug": "flutter-developer",
        "total_count": null
      },
      {
        "display_name": "DevOps Engineer",
        "kra_matches": [
          {
            "kra_text": "Collaborates with development teams to improve build processes, reduce deployment friction, containerize applications, and adopt DevOps best practices.",
            "sentence": "Implement new technologies in a production environment with product, IT, and data engineering teams.",
            "similarity": 0.5437
          },
          {
            "kra_text": "Collaborates with development teams to improve build processes, reduce deployment friction, containerize applications, and adopt DevOps best practices.",
            "sentence": "Build processes supporting data transformation, data structures, metadata, dependency and workload management using programming languages such as Java or Python.",
            "similarity": 0.5057
          },
          {
            "kra_text": "Collaborates with development teams to improve build processes, reduce deployment friction, containerize applications, and adopt DevOps best practices.",
            "sentence": "Develops original and creative technical solutions to ongoing development efforts.",
            "similarity": 0.4777
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 10,
        "score": 0.5091,
        "slug": "devops-engineer",
        "total_count": null
      }
    ],
    "skill_match_roles": [
      {
        "display_name": "Data Engineer",
        "kra_matches": null,
        "matched_count": 7,
        "matched_skills": [
          "AWS",
          "Hadoop",
          "Java",
          "Python",
          "RDS",
          "Redshift",
          "SQL"
        ],
        "role_id": 2,
        "score": 0.35,
        "slug": "data-engineer",
        "total_count": 20
      },
      {
        "display_name": "Fullstack Developer",
        "kra_matches": null,
        "matched_count": 6,
        "matched_skills": [
          "AWS",
          "Java",
          "PostgreSQL",
          "Python",
          "RDS",
          "Redshift"
        ],
        "role_id": 15,
        "score": 0.3,
        "slug": "full-stack-engineer",
        "total_count": 20
      },
      {
        "display_name": "Backend Developer",
        "kra_matches": null,
        "matched_count": 6,
        "matched_skills": [
          "AWS",
          "Java",
          "PostgreSQL",
          "Python",
          "RDS",
          "Redshift"
        ],
        "role_id": 1,
        "score": 0.3,
        "slug": "backend-engineer",
        "total_count": 20
      },
      {
        "display_name": "Engineering Manager",
        "kra_matches": null,
        "matched_count": 6,
        "matched_skills": [
          "AWS",
          "Agile",
          "Java",
          "Python",
          "SQL",
          "Scrum"
        ],
        "role_id": 121,
        "score": 0.3,
        "slug": "engineering-manager",
        "total_count": 20
      },
      {
        "display_name": "Python Backend Developer",
        "kra_matches": null,
        "matched_count": 5,
        "matched_skills": [
          "AWS",
          "PostgreSQL",
          "Python",
          "RDS",
          "Redshift"
        ],
        "role_id": 80,
        "score": 0.25,
        "slug": "python-backend-developer",
        "total_count": 20
      }
    ]
  },
  "stage4_decision": {
    "alias_collision_detected": false,
    "case": "A",
    "chosen_role": {
      "display_name": "Data Engineer",
      "kra_matches": null,
      "matched_count": null,
      "matched_skills": null,
      "role_id": 2,
      "score": 1.0,
      "slug": "data-engineer",
      "total_count": null
    },
    "confidence": 1.0,
    "is_new_role": false,
    "llm2_fired": false,
    "llm2_reasoning": null,
    "matched_dimensions": [],
    "matched_kras": [],
    "matched_skills": [],
    "new_role_display_name": null,
    "new_role_slug": null,
    "queued": false,
    "reasoning": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top data-engineer 0.35 does not contradict",
    "sub_role": null
  },
  "stage5_updates": {
    "centroid_n_after": 521,
    "centroid_updated": true,
    "collision_log_id": null,
    "new_kra_attached": null,
    "new_skills_attached": [
      {
        "is_primary": true,
        "queue_id": 25727,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "Oracle",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 25728,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "ETL",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 25729,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "ELT",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 25730,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "DMS",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 25731,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "SCT",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 25732,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "Aurora",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 25733,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "Data Warehousing",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 25734,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "Data Visualization",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 25735,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "Data Integration",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 25736,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "Kanban",
        "status": "pending"
      }
    ],
    "queue_entry_id": null,
    "v3_pipeline_triggered": false,
    "v3_role_slug": null,
    "v3_run_id": null
  }
}

API 2 — extract-details

{}

API 3 — final-role-output

{}