All roles

[Remote] Principal Data Scientist (AI)

Remote · USA Full-time New today

Note: The job is a remote job and is open to candidates in USA. Octave is a company that provides mission-critical software for organizations to make informed decisions across the asset lifecycle. They are seeking a Principal Data Scientist to build predictive models and implement Generative AI features for their compliance management platform, requiring expertise in developing and maintaining ML systems in production environments.

Responsibilities

  • Build and deploy Generative AI features using foundation models (AWS Bedrock, OpenAI, Anthropic Claude) and RAG architectures with vector databases for compliance document understanding
  • Design agentic AI systems that autonomously handle compliance workflows, document review, regulatory mapping, and multi-step reasoning tasks
  • Implement comprehensive LLM evaluation frameworks with automated pipelines, custom metrics, benchmark datasets, and safety guardrails ensuring regulatory compliance
  • Build end-to-end MLOps pipelines for model training, deployment, monitoring, versioning, and automated retraining with drift detection
  • Develop predictive models for compliance risk scoring, regulatory change impact, anomaly detection, and time-series forecasting
  • Write production-quality Python code for data processing, feature engineering, API development (FastAPI/Flask), and ETL/ELT workflows
  • Lead A/B experiments and product analytics to measure AI feature impact and drive data-driven decision-making
  • Create explainability frameworks (SHAP/LIME) and monitoring dashboards ensuring transparency and regulatory adherence
  • Collaborate with cross-functional teams to translate business needs into ML solutions and communicate insights to stakeholders

Skills

  • 7+ years in data science, ML engineering, or related roles
  • 3+ years building NLP/generative AI applications and implementing MLOps in production
  • Bachelor's or Master's degree in Data Science, Computer Science, Statistics, or related field (PhD preferred)
  • Track record of deploying ML systems processing large-scale datasets with proper monitoring and governance
  • Python (5+ years): Production-level experience with Pandas, NumPy, scikit-learn, XGBoost, TensorFlow/PyTorch, Hugging Face Transformers, FastAPI/Flask, MLflow, and pytest
  • SQL: Advanced proficiency with complex queries, window functions, and optimization
  • Machine Learning & NLP: Strong foundation in supervised/unsupervised learning, deep learning, document understanding, text classification, and semantic analysis
  • Generative AI & LLMs: Hands-on experience with foundation models (GPT, Claude, Llama), prompt engineering, RAG architectures, and vector databases (Pinecone, Weaviate, Chroma)
  • MLOps & ModelOps: End-to-end experience with ML pipelines, experiment tracking (MLflow, W&B), model versioning, feature stores, drift detection, CI/CD for ML, and Docker containerization
  • LLM Evaluation: Experience with evaluation frameworks (RAGAS, DeepEval), custom metrics, benchmark datasets, and human-in-the-loop validation
  • Cloud & AWS: Experience with AWS services including SageMaker, Bedrock, S3, Lambda, EC2, and CloudWatch
  • Statistics & Experimentation: Strong foundation in statistics, A/B testing, causal inference, and experimental design
  • Visualization: Proficiency with Tableau, Power BI, or Python visualization libraries
  • Experience with agentic AI frameworks (LangGraph, LangChain, AutoGen, CrewAI)
  • Knowledge of Life Sciences/regulated industries (FDA, EMA, ISO, GxP) and compliance management systems
  • Familiarity with big data tools (Spark, Databricks, Snowflake), orchestration (Airflow, Kubeflow), and monitoring tools (Datadog, Prometheus)
  • Experience with LLM fine-tuning, document processing libraries, multi-modal AI, or distributed training
  • Understanding of ML governance, bias detection, model risk management, and data privacy regulations (GDPR, CCPA, HIPAA)
  • Experience working in agile environments with Jira
  • AWS ML certifications or similar credentials

Company Overview

  • Octave provides mission-critical software that empowers organizations to make informed decisions across every stage of the asset lifecycle. It was founded in 1985, and is headquartered in Madison, Alabama, USA, with a workforce of 5001-10000 employees. Its website is https://www.octave.com/.
  • Apply To This Job

    Related roles

    [Remote] AI Workplace Solutions Engineer

    Remote · USA Full-time

    [Remote] MuleSoft Integration Consultant

    Remote · USA Full-time

    [Remote] Client Development Representative, Healthcare (El Paso, Texas)

    Remote · USA Full-time

    [Remote] KSOP Senior Administrator

    Remote · USA Full-time

    [Remote] Remote Clinical Psychologist - Indiana

    Remote · USA Full-time

    [Remote] Program Financial Analyst (FP&A)

    Remote · USA Full-time

    [Remote] Inside Sales Commercial Senior Consultant II

    Remote · USA Full-time

    [Remote] Cybersecurity Analyst, IT Operations

    Remote · USA Full-time

    [Remote] Business Development Manager

    Remote · USA Full-time

    [Remote] Data Center Operations Technician (Remote Hands)

    Remote · USA Full-time

    Experienced Shopify Data Entry & Order Specialist – E-commerce Operations and Customer Experience

    Remote · USA Full-time

    New Products Strategy Director

    Remote · USA Full-time

    Experienced Customer Service Tools Support Specialist – Driving Operational Efficiency and Excellence in arenaflex's Customer Service Framework

    Remote · USA Full-time

    Enterprise Account Executive - DACH

    Remote · USA Full-time

    Customer Experience Operations Analyst

    Remote · USA Full-time

    Hybrid BH Clinical Care Manager (LICSW or LMHC) - Lowell and surrounding

    Remote · USA Full-time

    Data Coordinator Assistant

    Remote · USA Full-time

    Finance Associate - Multi-Entity Accounting (Malaysia)

    Remote · USA Full-time

    Sr. Backend Software Engineer (Video Engineering)

    Remote · USA Full-time

    Entry-Level Remote Data Entry Specialist – arenaflex Product Information Management – $75,000/Year – Work‑From‑Home

    Remote · USA Full-time