About Me
Engineering AI & Data Systems at Scale
I am Qijin Xu (Jack), a Toronto-based lead AI and machine learning engineer delivering production AI systems that combine robust data engineering with modern LLM agents to move critical business metrics.
Bio
Summary
Lead AI/ML Engineer delivering measurable impact via production AI systems. Drove 90% support reduction (525% ROI) with RAG for 50k+ users, and 50% labor reduction with LLM agents (LangGraph), drawing on full-stack expertise from data systems (Spark, Kafka), classic ML (321% fraud lift) to modern LLMs.
Expertise & Stack
Python, Java, Go, TypeScript, SQL, Azure, AWS, GCP, Spark, Kafka, Airflow, LangGraph, RAG, MCP, OpenAI API, and full-stack ownership from ingestion to production AI services.
Credentials
B.Sc. in Computer Science, York University | Microsoft Certified: Azure Solutions Architect Expert | Project Management Professional (PMP).
Beyond Code
Open-source contributor, avid gamer, and builder of assistant tools that blend computer vision, machine learning, and gameplay analytics.
Skills
AI & LLM Engineering
Backend & System Design
Cloud & DevOps
Programming Languages
Data & ML Infrastructure
Full-Stack Development
Experience
CGI Inc.
Lead AI/ML Engineer (Tech Lead), AI Center of Excellence
- LangGraph dual-state agent for a proprietary coding language: orchestrated vLLM-served LLMs with retrieval, EBNF structured outputs, and PII filtering to automate documentation, business-rule extraction, and migration readiness; targeting a 50% labor reduction for support teams.
Data Architect and Engineering Lead, Senior Consultant
- CGI Impact Award 2024: Architected and shipped an LLM-powered enterprise RAG chatbot (LangChain across Azure, AWS, GCP) delivering about 90% autonomous resolution for a 50k+ user base and 525% ROI in 12 months.
- Built an OpenLineage-based lineage tool that auto-documents extensible multi-source flows with LLM-assisted root-cause suggestions, shrinking investigation cycles across three incidents per month.
- Automated AxiomSL rule updates with an LLM copilot for RBC, cutting manual updates by 80% while preserving governance controls.
Data Engineer, Consultant
- Managed the largest automobile-insurance policy-history database in Canada (2 TB on SQL Server) with 99.9% uptime and median queries under 200 ms.
- Delivered production ETL and ELT pipelines for 22+ micro data marts (each under six tables with monthly SLAs), trimming report runtimes from 30 minutes to 5 minutes.
- Shipped a Gradient-Boost plus Logistic Regression loss-ratio model that recovered fraud losses 321% versus a top-5 bank benchmark through a Python and T-SQL pipeline.
Elder Laboratory, York University
Research Assistant, Computer Vision (Part-Time)
- Developed AttentiveVision, a computer-vision pipeline for hockey broadcast video in Python (PyTorch, TensorFlow, OpenCV), training a CNN-LSTM stack with mean average precision above 70% on event detection across 18 hours of game footage.
- Engineered a real-time player tracking system using a Kalman-filter tracker that delivered under 100 ms latency at 30 FPS, capturing pose and movement for live speed and heat-map analytics.
- Created an event-detection annotation loop (goal, timeout, penalty) with active learning that cut manual labeling effort and raised dataset reliability to Cohen's kappa of 0.92.
Nascent Digital
Full-Stack Developer Intern
- Built end-to-end features with React, TypeScript, and Node.js; contributed major components to company-wide homepage rewrite.
- Shipped TELUS product pages by partnering with UX/QA/DevOps and creating reusable React components plus CI/CD hooks; finished design-to-deploy in 7 days.
- Introduced automated visual-regression testing with Nightwatch.js and Puppeteer; removed ~90% of manual QA checks.
Education
B.Sc. in Computer Science (Specialized Honours)
York University, Toronto
- Graduated with GPA 3.9/4.0
- First Class with Distinction
Certifications
Microsoft Certified: Azure Solutions Architect Expert
Microsoft
Issued: 2022
Google Cloud Professional Cloud Developer
Issued: 2023