Md Saidul Islam — AI & Machine Learning Engineer

Who I Am

About Me

I build AI and machine learning systems with a focus on measurable operational outcomes rather than model accuracy in isolation. My work at HHS Robotics on live warehouse data produced two systems with documented results: a 43.5% improvement in anomaly detection precision with a 13.7% reduction in false positives, and a document intelligence system that achieved 98% factuality accuracy against a 53% TF-IDF baseline, with a 37.2% improvement in information extraction accuracy and a 29.8% reduction in verification errors.

I approach agent design with explicit uncertainty gates and human escalation paths because systems that act on low-confidence output without a fallback are not ready for operational environments. Current focus: Vakt — a multi-domain AI governance platform for Norwegian financial services (CFO document intelligence, AML/KYC/SAR compliance with HITL gates, and infrastructure posture on Terraform/Checkov) with append-only audit trails and tool policies enforced in code; and NorgeOps (NorthSea AgentOps), a reference architecture for agentic offshore production surveillance on real Equinor Volve field data.

I work comfortably across the boundary between technical and non-technical stakeholders, translating model behaviour and confidence levels into language that domain experts can act on. My default when building is to reuse what the platform already provides and build custom only when there is a clear gap — discipline matters more than cleverness in industrial AI delivery.

Languages: English (Full professional) · Norwegian Bokmål (Beginner, actively learning) · Bengali (Native) · Hindi (Spoken)

43.5%Anomaly Detection Precision Gain

98%Factuality Accuracy

1.00RAGAS Faithfulness

97.1%Faster Training vs Baseline

Education

Sep 2022 – May 2025

M.S. Applied Computer Science · UiT, Norway

AI & Intelligent Agents · Algorithms Design · Systems Programming · Grade: B (Thesis)

Transcript

Sep 2016 – May 2020

B.S. Software Engineering · Daffodil International University

Data Structures · Algorithms · Database Systems · Software Engineering

Transcript Diploma

2016 – 2025

Competitive Programming · Beecrowd / URI Online Judge

146+ challenges solved · 85%+ accuracy · Dynamic programming, graphs, data structures

Work History

Professional Experience

Research Assistant BEaM · UiT

UiT – The Arctic University of Norway · Department of Building, Energy and Material Technology · Narvik, Norway

May 2026 – Present

Ongoing computational research at UiT Narvik’s BEaM group — applying machine learning and data engineering to support sustainable energy and environmental engineering studies.

Build data-driven workflows for faculty-led research: literature synthesis, structured datasets, and predictive modeling
Apply Python-based ML and statistical methods in a cross-disciplinary research setting
Collaborate with supervisors on analysis, documentation, and academic outputs

PythonMachine LearningData Engineering ResearchPandasScientific Computing

Software Developer

Jobswoop AS · Oslo, Norway

Nov 2025 – Feb 2026

Delivered production features for a multi-tenant B2B SaaS platform — same governance patterns applied in industrial AI contexts where consequential decisions require human approval gates.

Built production features on a multi-tenant B2B SaaS (NestJS, Next.js 15, TypeScript): RBAC, per-tenant data isolation, notification workflows, and document flows with traceability requirements
Shipped a Gemini AI-assisted drafting feature with a human approval gate before any AI-generated content was published — same governance pattern required when AI operates near consequential decisions
PostgreSQL schema design, Prisma migrations, encrypted asset storage on AWS S3; GitHub Actions CI/CD introduced from scratch
Code review and agile sprints with stakeholder demos

NestJSNext.js 15TypeScript PostgreSQLPrismaAWS S3 Gemini AIDockerGitHub Actions

Systems and AI Developer M.S. Research

HHS Robotics / UiT · Narvik, Norway

Jan 2025 – Oct 2025

Full-time embedded at HHS Robotics, building AI systems on live production data from an operational warehouse robot. Research conducted on operational systems and published as M.Sc. thesis (UiT, 2025, NVA).

ARMADA (anomaly detection): Context-Augmented Anomaly Detection combining Isolation Forest, One-Class SVM, and LOF with fine-tuned GPT for maintenance recommendations. 43.5% improvement in detection precision, 13.7% false-positive reduction, and 97.1% faster training than Anomaly Transformer baseline on live telemetry data
FACTS (document intelligence): Layout-aware PDF processing with LLM-based factuality verification. 98% factuality accuracy vs 53% TF-IDF baseline; 37.2% improvement in information extraction accuracy and 29.8% reduction in verification errors across factual, procedural, list, and reasoning query types
Kafka-to-PostgreSQL real-time telemetry pipeline; FastAPI services; Streamlit operational dashboards; all results statistically validated (Welch's t-test, formal error bounds)

PythonFastAPIKafka PostgreSQLLLM fine-tuningLangChain DockerMLflowStreamlit

Shift Leader

Burger King · Narvik, Norway

May 2023 – Jul 2025

Led shift operations for a team of 8–12 people, developing practical judgment for when to resolve independently and when to escalate — a pattern that transfers directly to cross-functional technical work.

Task scheduling, stock checks, and real-time coordination to maintain service quality under high-volume periods; escalated equipment and supply issues to management with clear situational summaries so decisions could be made quickly without losing context
Handled customer conflict resolution directly and on the spot, de-escalating situations calmly while keeping the rest of the team focused on operations

Team LeadershipOperations Escalation ManagementCross-functional Coordination

Technical Stack

Skills & Expertise

Agentic AI & LLM Engineering

LangGraph (StateGraph)CheckpointingConditional routing Planner–Executor–Critic–ChallengerUncertainty Gate ReAct loops + tool callingMulti-agent coordination Session tool allowlistsDSPy prompt optimization Human-in-the-loop (HITL)Append-only audit events Prompt injection defense

RAG & Retrieval

Hybrid vector + BM25 (Azure AI Search)pgvector + RRF Chunk hashingField-overview injection Lexical overlap rerankingStreaming citations (SSE) RAGAS / faithfulness checksLLM-as-judge evals AWS OpenSearch Serverless

ML & Anomaly Detection

Isolation ForestOne-Class SVMLOF CUSUM state machinesZ-score rolling windows Adaptive thresholdsPersistence streaks Refractory cooldownTemporal train/test split Welch's t-testscikit-learn

MLOps & Evaluation

MLflow Model RegistryStaged promotionRollback DVC pipelinesGitHub Actions CI (OIDC) Parallel quality gatesGolden / eval datasets Agent eval runners (mock + live)Adversarial probes RAGAS + custom assertionsDSPy BootstrapFewShot

Systems & Integration

FastAPI (async + Pydantic v2)Kafka / Azure Service Bus PostgreSQL / pgvectorRow-Level Security (RLS) JWT / JWKS (OIDC)Event-driven (async queues) NestJSNext.js 14 App RouterPrisma

Cloud, Observability & Delivery

Azure Container AppsAzure Service Bus Azure OpenAIAzure AI Search Azure Blob / Key VaultManaged identity (DefaultAzureCredential) AWS BedrockOpenSearch ServerlessDynamoDB Prometheus /metricsOpenTelemetry (OTLP) structlog (JSON)Checkov / IaC scanning Kubernetes (HPA, NetworkPolicy)GrafanaDocker Terraform (azurerm)GitHub Actions (OIDC → ACR)

Core Competency Levels

Agentic AI & LLM Engineering (LangGraph, uncertainty gates)92%

RAG & Retrieval (hybrid BM25 + pgvector, RAGAS evaluation)90%

ML & Anomaly Detection (ensemble methods, CUSUM, statistical validation)88%

MLOps & CI Evaluation (MLflow, DVC, GitHub Actions quality gates)87%

Cloud & Delivery (Azure, AWS, Docker, Terraform, OIDC)85%

Portfolio

AI, Data & Cloud Projects

Production-grade systems across Generative AI, Machine Learning, Data Engineering, and Cloud infrastructure.

Academic Work

Research

Research Assistant · UiT Narvik

BEaM Research Group · Department of Building, Energy and Material Technology · May 2026 – Present

Ongoing

Machine Learning for Sustainable Energy & Environmental Systems

UiT – The Arctic University of Norway · Campus Narvik

Current role supporting computational research at the BEaM group — combining data engineering and machine learning with building, energy, and materials research. Day-to-day work spans curated datasets, predictive modeling, and research documentation in collaboration with faculty supervisors.

MLPredictive modeling & analysis

DataStructured research datasets

BEaMEnergy & environmental focus

Current Focus

Literature synthesis and research data curation
Machine learning pipelines for engineering research applications
Analysis, validation, and documentation for academic outputs

Specific project details are not shared publicly while research is in progress.

Master's Thesis · UiT, Arctic University of Norway

Computer Science & Computational Engineering · Completed May 2025 · Grade: B

Automated Data Analysis with Large Language Models for Warehouse Robotics Applications

Industry Partner: HHS Robotics AS

Developed two novel AI frameworks — ARMADA (Context-Augmented Anomaly Detection) and FACTS (Factual AI Contextualization & Troubleshooting System) — integrating LLMs with domain-specific processing for automated diagnostics in industrial warehouse robotics.

43.5%Anomaly detection precision improvement

97.1%Reduction in model training time

37.2%Document processing accuracy gain

98%Factuality verification accuracy

40%False-positive rate reduction

68–78%Documentation search time saved

Technical Approach

Context-Augmented Anomaly Detection (CAAD) — ensemble of Isolation Forest, One-Class SVM, Local Outlier Factor
Fine-tuned GPT models for maintenance recommendation generation
Multimodal document processing with layout-aware PDF analysis
Mathematical factuality verification using TF-IDF and named entity overlap
Deployed across 50+ operational warehouse robots with 99.5% uptime

Recognition

Awards & Achievements

1^st Runner-Up 2nd Place

AWS Hackathon November 2025

AWS GenAI Hackathon 2025 — First Runner-Up

Privacy Policy & Terms Analyzer — AI-powered browser extension using AWS Bedrock & Generative AI

Awarded first runner-up for an innovative full-stack GenAI application that automatically analyzes privacy policies, terms & conditions, and cookie agreements using AWS Bedrock (Claude Sonnet 4) — surfacing the 3–5 most harmful clauses for users and enabling contextual RAG-powered chat about any policy document.

Full-Stack End-to-end build

1,536-dim Titan Embeddings

RAG Chat Contextual Q&A

Sig V4 Custom AWS Auth

AI & Cloud Services

AWS Bedrock — Claude Sonnet 4 for policy analysis & harmful clause detection
Amazon Titan Embeddings (1,536-dimensional vectors) for semantic search
AWS OpenSearch Serverless — vector database powering RAG contextual chat
AWS DynamoDB — NoSQL storage for analyzed policies with full CRUD

Architecture & Engineering

Chrome/Edge browser extension (JavaScript) with custom AWS Signature V4 auth — direct browser-to-AWS API without backend proxy
Python FastAPI backend — CORS middleware, service layer architecture (Bedrock, DynamoDB, Vector DB), Pydantic validation
boto3 SDK integration across all AWS services

AWS Bedrock Claude Sonnet 4 Titan Embeddings DynamoDB OpenSearch Serverless Python / FastAPI boto3 JavaScript Browser Extension RAG AWS Sig V4

Learning & Credentials

Certifications

Professional development across Azure, Data Engineering, MLOps, and Generative AI — verified credentials from LinkedIn Learning and industry platforms.

Let's Talk

Get in Touch

Open to AI Engineer and ML Engineer roles — full-time. Based in Oslo, Norway.

Contact Details

Email

saidulislambinalisayed@outlook.com

Phone

+47 462 62 219

Location

Oslo, Norway

LinkedIn GitHub Beecrowd

Send a Message

Full Name *

Email *

Subject

Message *

I agree to the Privacy Policy *