Felix Hirwa Nshuti

MS student in Electrical and Computer Engineering at Carnegie Mellon University (CMU).

About

I am Felix Hirwa Nshuti, an MS student in Electrical and Computer Engineering at Carnegie Mellon University (CMU). My work resides at the critical intersection of computer architecture, optimizing compilers, and theoretical machine learning. I focus on building next-generation ML infrastructure by integrating low-level systems engineering with high-level mathematical abstractions, specifically leveraging convex optimization to close the gap between algorithmic intent and architectural execution.

My research and development efforts center on compiler infrastructure for AI workloads, exploring how intermediate representations (IR) can effectively transform computation graphs into hardware-aware schedules. Additionally, I specialize in ML-driven signal processing, where I apply tensor-level optimizations to ensure predictable performance and scalability in real-time inference environments.

As an active open-source contributor and maintainer, I am dedicated to community collaboration and the development of verifiable, efficient AI systems. Previously, I completed my Bachelor's in Computer Science and Engineering at Pandit Deendayal Energy University (PDEU), focusing on programming languages, compilers, and machine learning.

Research Interests

In my free time, I play football and share memes with friends.

Research

Accepted Papers

Accepted Conference Talks

Selected Projects

Enhancing Prophet with Question-Aware Captioning for Knowledge-Based VQA

Knowledge-Based Visual Question Answering — Python, PyTorch, Transformers, NLP, CV, VQA — Aug 2025 to Dec 2025

  • Improved performance of knowledge-based VQA systems by integrating question-aware captioning.
  • Built a module that generates contextually relevant captions to improve knowledge retrieval.

Context-Aware Demand Forecasting in Pittsburgh's Bike Share System

Time Series Forecasting with Contextual Features — Python, Pandas, NumPy, XGBoost, CatBoost, statsmodels — Aug 2025 to Dec 2025

  • Incorporated temporal, weather, and event-based features to enhance prediction accuracy.
  • Applied advanced time series analysis to capture complex usage patterns.

TransISA — Lightweight CISC-to-RISC Transpiler

Compiler Design and Architecture Translation — LLVM, x86, AArch64, C++, IR, CFG — Dec 2024 to May 2025

  • Built a compiler pipeline using the LLVM C++ API to translate x86 to AArch64 via LLVM IR.
  • Extracted and analyzed CFGs to support instruction-level translation.

Scaling Deep Learning Backends with sktime

Machine Learning with Time Series — Python, PyTorch, TensorFlow, Darts, CI/CD — May 2024 to Aug 2024

  • Added new classification models to sktime using PyTorch (GRU and GRU-FCNN Classifiers).
  • Migrated classifier models from legacy sktime-dl to the sktime main repository.
  • Implemented the modular interface of darts regression models in sktime.