Available for opportunities  ·  Oslo, Norway

AI & Machine Learning
Engineer

Building AI and machine learning systems with a focus on measurable operational outcomes. Documented results: 43.5% anomaly detection precision improvement, 98% factuality accuracy, and a production agentic architecture grounded in real North Sea field data. M.S. Applied Computer Science, UiT Arctic University of Norway.

Agentic AI / LangGraph ML & Anomaly Detection Azure & AWS Governed agents & HITL MLOps / CI Evaluation RAG & Retrieval
Md Saidul Islam — AI Engineer & Data Scientist
Open to work

Md Saidul Islam

AI & Machine Learning Engineer

Oslo, Norway

43.5% Anomaly Precision Gain
98% Factuality Accuracy
1.00 RAGAS Faithfulness

About Me

I build AI and machine learning systems with a focus on measurable operational outcomes rather than model accuracy in isolation. My work at HHS Robotics on live warehouse data produced two systems with documented results: a 43.5% improvement in anomaly detection precision with a 13.7% reduction in false positives, and a document intelligence system that achieved 98% factuality accuracy against a 53% TF-IDF baseline, with a 37.2% improvement in information extraction accuracy and a 29.8% reduction in verification errors.

I approach agent design with explicit uncertainty gates and human escalation paths because systems that act on low-confidence output without a fallback are not ready for operational environments. Current focus: Vakt — a multi-domain AI governance platform for Norwegian financial services (CFO document intelligence, AML/KYC/SAR compliance with HITL gates, and infrastructure posture on Terraform/Checkov) with append-only audit trails and tool policies enforced in code; and NorgeOps (NorthSea AgentOps), a reference architecture for agentic offshore production surveillance on real Equinor Volve field data.

I work comfortably across the boundary between technical and non-technical stakeholders, translating model behaviour and confidence levels into language that domain experts can act on. My default when building is to reuse what the platform already provides and build custom only when there is a clear gap — discipline matters more than cleverness in industrial AI delivery.

Languages: English (Full professional)  ·  Norwegian Bokmål (Beginner, actively learning)  ·  Bengali (Native)  ·  Hindi (Spoken)

43.5%Anomaly Detection Precision Gain
98%Factuality Accuracy
1.00RAGAS Faithfulness
97.1%Faster Training vs Baseline

Education

Sep 2022 – May 2025

M.S. Applied Computer Science  ·  UiT, Norway

AI & Intelligent Agents · Algorithms Design · Systems Programming · Grade: B (Thesis)

Sep 2016 – May 2020

B.S. Software Engineering  ·  Daffodil International University

Data Structures · Algorithms · Database Systems · Software Engineering

2016 – 2025

Competitive Programming  ·  Beecrowd / URI Online Judge

146+ challenges solved · 85%+ accuracy · Dynamic programming, graphs, data structures

Professional Experience

Research Assistant BEaM · UiT

UiT – The Arctic University of Norway  ·  Department of Building, Energy and Material Technology  ·  Narvik, Norway

May 2026 – Present

Ongoing computational research at UiT Narvik’s BEaM group — applying machine learning and data engineering to support sustainable energy and environmental engineering studies.

  • Build data-driven workflows for faculty-led research: literature synthesis, structured datasets, and predictive modeling
  • Apply Python-based ML and statistical methods in a cross-disciplinary research setting
  • Collaborate with supervisors on analysis, documentation, and academic outputs
PythonMachine LearningData Engineering ResearchPandasScientific Computing

Software Developer

Jobswoop AS  ·  Oslo, Norway

Nov 2025 – Feb 2026

Delivered production features for a multi-tenant B2B SaaS platform — same governance patterns applied in industrial AI contexts where consequential decisions require human approval gates.

  • Built production features on a multi-tenant B2B SaaS (NestJS, Next.js 15, TypeScript): RBAC, per-tenant data isolation, notification workflows, and document flows with traceability requirements
  • Shipped a Gemini AI-assisted drafting feature with a human approval gate before any AI-generated content was published — same governance pattern required when AI operates near consequential decisions
  • PostgreSQL schema design, Prisma migrations, encrypted asset storage on AWS S3; GitHub Actions CI/CD introduced from scratch
  • Code review and agile sprints with stakeholder demos
NestJSNext.js 15TypeScript PostgreSQLPrismaAWS S3 Gemini AIDockerGitHub Actions

Systems and AI Developer M.S. Research

HHS Robotics / UiT  ·  Narvik, Norway

Jan 2025 – Oct 2025

Full-time embedded at HHS Robotics, building AI systems on live production data from an operational warehouse robot. Research conducted on operational systems and published as M.Sc. thesis (UiT, 2025, NVA).

  • ARMADA (anomaly detection): Context-Augmented Anomaly Detection combining Isolation Forest, One-Class SVM, and LOF with fine-tuned GPT for maintenance recommendations. 43.5% improvement in detection precision, 13.7% false-positive reduction, and 97.1% faster training than Anomaly Transformer baseline on live telemetry data
  • FACTS (document intelligence): Layout-aware PDF processing with LLM-based factuality verification. 98% factuality accuracy vs 53% TF-IDF baseline; 37.2% improvement in information extraction accuracy and 29.8% reduction in verification errors across factual, procedural, list, and reasoning query types
  • Kafka-to-PostgreSQL real-time telemetry pipeline; FastAPI services; Streamlit operational dashboards; all results statistically validated (Welch's t-test, formal error bounds)
PythonFastAPIKafka PostgreSQLLLM fine-tuningLangChain DockerMLflowStreamlit

Shift Leader

Burger King  ·  Narvik, Norway

May 2023 – Jul 2025

Led shift operations for a team of 8–12 people, developing practical judgment for when to resolve independently and when to escalate — a pattern that transfers directly to cross-functional technical work.

  • Task scheduling, stock checks, and real-time coordination to maintain service quality under high-volume periods; escalated equipment and supply issues to management with clear situational summaries so decisions could be made quickly without losing context
  • Handled customer conflict resolution directly and on the spot, de-escalating situations calmly while keeping the rest of the team focused on operations
Team LeadershipOperations Escalation ManagementCross-functional Coordination

Skills & Expertise

Agentic AI & LLM Engineering

LangGraph (StateGraph)CheckpointingConditional routing Planner–Executor–Critic–ChallengerUncertainty Gate ReAct loops + tool callingMulti-agent coordination Session tool allowlistsDSPy prompt optimization Human-in-the-loop (HITL)Append-only audit events Prompt injection defense

RAG & Retrieval

Hybrid vector + BM25 (Azure AI Search)pgvector + RRF Chunk hashingField-overview injection Lexical overlap rerankingStreaming citations (SSE) RAGAS / faithfulness checksLLM-as-judge evals AWS OpenSearch Serverless

ML & Anomaly Detection

Isolation ForestOne-Class SVMLOF CUSUM state machinesZ-score rolling windows Adaptive thresholdsPersistence streaks Refractory cooldownTemporal train/test split Welch's t-testscikit-learn

MLOps & Evaluation

MLflow Model RegistryStaged promotionRollback DVC pipelinesGitHub Actions CI (OIDC) Parallel quality gatesGolden / eval datasets Agent eval runners (mock + live)Adversarial probes RAGAS + custom assertionsDSPy BootstrapFewShot

Systems & Integration

FastAPI (async + Pydantic v2)Kafka / Azure Service Bus PostgreSQL / pgvectorRow-Level Security (RLS) JWT / JWKS (OIDC)Event-driven (async queues) NestJSNext.js 14 App RouterPrisma

Cloud, Observability & Delivery

Azure Container AppsAzure Service Bus Azure OpenAIAzure AI Search Azure Blob / Key VaultManaged identity (DefaultAzureCredential) AWS BedrockOpenSearch ServerlessDynamoDB Prometheus /metricsOpenTelemetry (OTLP) structlog (JSON)Checkov / IaC scanning Kubernetes (HPA, NetworkPolicy)GrafanaDocker Terraform (azurerm)GitHub Actions (OIDC → ACR)

Core Competency Levels

Agentic AI & LLM Engineering (LangGraph, uncertainty gates)92%
RAG & Retrieval (hybrid BM25 + pgvector, RAGAS evaluation)90%
ML & Anomaly Detection (ensemble methods, CUSUM, statistical validation)88%
MLOps & CI Evaluation (MLflow, DVC, GitHub Actions quality gates)87%
Cloud & Delivery (Azure, AWS, Docker, Terraform, OIDC)85%

AI, Data & Cloud Projects

Production-grade systems across Generative AI, Machine Learning, Data Engineering, and Cloud infrastructure.

Research

Research Assistant  ·  UiT Narvik

BEaM Research Group  ·  Department of Building, Energy and Material Technology  ·  May 2026 – Present

Ongoing

Machine Learning for Sustainable Energy & Environmental Systems

UiT – The Arctic University of Norway  ·  Campus Narvik

Current role supporting computational research at the BEaM group — combining data engineering and machine learning with building, energy, and materials research. Day-to-day work spans curated datasets, predictive modeling, and research documentation in collaboration with faculty supervisors.

MLPredictive modeling & analysis
DataStructured research datasets
BEaMEnergy & environmental focus
Current Focus
  • Literature synthesis and research data curation
  • Machine learning pipelines for engineering research applications
  • Analysis, validation, and documentation for academic outputs

Specific project details are not shared publicly while research is in progress.

Master's Thesis  ·  UiT, Arctic University of Norway

Computer Science & Computational Engineering  ·  Completed May 2025  ·  Grade: B

Automated Data Analysis with Large Language Models for Warehouse Robotics Applications

Industry Partner: HHS Robotics AS

Developed two novel AI frameworks — ARMADA (Context-Augmented Anomaly Detection) and FACTS (Factual AI Contextualization & Troubleshooting System) — integrating LLMs with domain-specific processing for automated diagnostics in industrial warehouse robotics.

43.5%Anomaly detection precision improvement
97.1%Reduction in model training time
37.2%Document processing accuracy gain
98%Factuality verification accuracy
40%False-positive rate reduction
68–78%Documentation search time saved
Technical Approach
  • Context-Augmented Anomaly Detection (CAAD) — ensemble of Isolation Forest, One-Class SVM, Local Outlier Factor
  • Fine-tuned GPT models for maintenance recommendation generation
  • Multimodal document processing with layout-aware PDF analysis
  • Mathematical factuality verification using TF-IDF and named entity overlap
  • Deployed across 50+ operational warehouse robots with 99.5% uptime

Awards & Achievements

1st Runner-Up 2nd Place
AWS Hackathon November 2025

AWS GenAI Hackathon 2025 — First Runner-Up

Privacy Policy & Terms Analyzer — AI-powered browser extension using AWS Bedrock & Generative AI

Awarded first runner-up for an innovative full-stack GenAI application that automatically analyzes privacy policies, terms & conditions, and cookie agreements using AWS Bedrock (Claude Sonnet 4) — surfacing the 3–5 most harmful clauses for users and enabling contextual RAG-powered chat about any policy document.

Full-Stack End-to-end build
1,536-dim Titan Embeddings
RAG Chat Contextual Q&A
Sig V4 Custom AWS Auth
AI & Cloud Services
  • AWS Bedrock — Claude Sonnet 4 for policy analysis & harmful clause detection
  • Amazon Titan Embeddings (1,536-dimensional vectors) for semantic search
  • AWS OpenSearch Serverless — vector database powering RAG contextual chat
  • AWS DynamoDB — NoSQL storage for analyzed policies with full CRUD
Architecture & Engineering
  • Chrome/Edge browser extension (JavaScript) with custom AWS Signature V4 auth — direct browser-to-AWS API without backend proxy
  • Python FastAPI backend — CORS middleware, service layer architecture (Bedrock, DynamoDB, Vector DB), Pydantic validation
  • boto3 SDK integration across all AWS services
AWS Bedrock Claude Sonnet 4 Titan Embeddings DynamoDB OpenSearch Serverless Python / FastAPI boto3 JavaScript Browser Extension RAG AWS Sig V4

Certifications

Professional development across Azure, Data Engineering, MLOps, and Generative AI — verified credentials from LinkedIn Learning and industry platforms.

Get in Touch

Open to AI Engineer and ML Engineer roles — full-time. Based in Oslo, Norway.

Send a Message