Pipeline run
9d19bc35-b551-41a5-bc0f-bbb5aec8a408
Client output enrichment
v2 Skill cluster · Nature of work · AI index · Tech stack maturity · Evidence · KRA descriptionNature of work
—
Tech stack maturity
Mainstream Modern
AI index (0 = no AI use, 5 = totally AI-dependent · v2.1)
0.50 / 5
· Title match
✓ Has AI skill
· AI skill (primary)
✓ AI skill (secondary)
· On AI team
· Builds AI products
vocab breakdown (legacy)
Assistants (×1):
—
Frameworks (×2):
pgvector
Models / concepts (×3):
AI
Evidence — skills matched in JD (15)
PostgreSQL
SQL
Elasticsearch
Redis
Python
Alembic
golang-migrate
TimescaleDB
ClickHouse
Kafka
Airflow
dbt
Metabase
Superset
pgvector
Skill cluster (0 dimension groups, role-scoped)
Status:
extract_from_jd_done
Created: 2026-05-10T13:31:35.451432Z
Updated: 2026-05-10T13:31:35.451432Z
Flow
Current 3-step pipeline
1 POST /skills/extract-from-jd
2 POST /skills/extract-details
3 POST /skills/final-role-output
Role
Chosen role & resolution
No chosen role stored for this run.
Job description
Job Summary You will design and optimize the database architecture supporting marketplace modules. You'll manage PostgreSQL (including TimescaleDB for time-series data), Elasticsearch for search, Redis for caching, and build ETL pipelines for analytics and reporting dashboards. Key Responsibilities / Duties Database Design & Management Design PostgreSQL schemas for 40+ tables across 6 domains Implement partitioning strategies for large tables (transactions, listings) Configure TimescaleDB hypertables for time-series data Write and optimize complex SQL queries Manage database migrations (Alembic/golang-migrate) Search & Analytics Design Elasticsearch indices for multi-module search Implement faceted search with aggregations Build real-time analytics pipelines using Kafka + ClickHouse Create reporting tables for dashboards (Entity Admin, Finance Manager) Performance Implement query optimization and indexing strategies Configure read replicas for analytics workloads Set up connection pooling (PgBouncer) Monitor database performance (slow query logs, pg_stat_statements) Data Governance Implement data retention policies Manage backup and recovery procedures Ensure GDPR/CCPA compliance for data deletion Implement data anonymization for M7 (Enterprise Data) Required Skills & Qualifications Must-Have (4+ years overall) SkillProficiencyNotes PostgreSQL4+ yearsPerformance tuning, partitioning, replication SQL4+ yearsComplex joins, window functions, CTEs Elasticsearch2+ yearsIndex design, aggregations, query DSL Redis2+ yearsCaching, rate limiting Python2+ yearsETL scripts Database Migrations2+ yearsAlembic or golang-migrate Preferred / Good-to-Have Skills SkillWhy It Matters TimescaleDBTime-series (price history, utilization metrics) ClickHouseReal-time analytics KafkaEvent streaming Airflow / dbtETL orchestration Metabase / SupersetDashboarding pgvectorVector search for AI features Key Tables / Data Models Transactions: tables – transactions, escrow_holds | Row growth estimate – 10M/year Listings: tables – listings, listing_versions | Row growth estimate – 5M/year Price History: tables – price_history (TimescaleDB) | Row growth estimate – 100M/year AI Conversations: tables – ai_conversations | Row growth estimate – 50M/year
Skills from this JD
Each row merges API 1 extraction, API 2 library match / v3 orchestration (dimensions + locked dims), and API 3 persistence tags.
PostgreSQL
Primary
No API 2 row (run stopped after API 1 or history missing)
SQL
Primary
No API 2 row (run stopped after API 1 or history missing)
Elasticsearch
Primary
No API 2 row (run stopped after API 1 or history missing)
Redis
Primary
No API 2 row (run stopped after API 1 or history missing)
Python
Primary
No API 2 row (run stopped after API 1 or history missing)
Alembic
Primary
No API 2 row (run stopped after API 1 or history missing)
golang-migrate
Primary
No API 2 row (run stopped after API 1 or history missing)
TimescaleDB
Secondary
No API 2 row (run stopped after API 1 or history missing)
ClickHouse
Secondary
No API 2 row (run stopped after API 1 or history missing)
Kafka
Secondary
No API 2 row (run stopped after API 1 or history missing)
Airflow
Secondary
No API 2 row (run stopped after API 1 or history missing)
dbt
Secondary
No API 2 row (run stopped after API 1 or history missing)
Metabase
Secondary
No API 2 row (run stopped after API 1 or history missing)
Superset
Secondary
No API 2 row (run stopped after API 1 or history missing)
pgvector
Secondary
No API 2 row (run stopped after API 1 or history missing)
Library artifacts (this run)
No artifact rows for this run.
API 1 — extract-from-jd click to toggle
{
"final_skills": [
{
"is_primary": true,
"skill_name": "PostgreSQL"
},
{
"is_primary": true,
"skill_name": "SQL"
},
{
"is_primary": true,
"skill_name": "Elasticsearch"
},
{
"is_primary": true,
"skill_name": "Redis"
},
{
"is_primary": true,
"skill_name": "Python"
},
{
"is_primary": true,
"skill_name": "Alembic"
},
{
"is_primary": true,
"skill_name": "golang-migrate"
},
{
"is_primary": false,
"skill_name": "TimescaleDB"
},
{
"is_primary": false,
"skill_name": "ClickHouse"
},
{
"is_primary": false,
"skill_name": "Kafka"
},
{
"is_primary": false,
"skill_name": "Airflow"
},
{
"is_primary": false,
"skill_name": "dbt"
},
{
"is_primary": false,
"skill_name": "Metabase"
},
{
"is_primary": false,
"skill_name": "Superset"
},
{
"is_primary": false,
"skill_name": "pgvector"
}
],
"run_id": null
}
API 2 — extract-details
{}
API 3 — final-role-output
{}
LLM Calls
Every model call made for this run, in pipeline order. Click a card to see the model's response.
Loading…