Publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. ArXiv
    Ensuring Fair LLM Serving Amid Diverse Applications
    Redwan Ibne Seraj Khan, Kunal Jain, Haiying Shen, and 12 more authors
    In ArXiv, 2025
  2. Under Review
    Intelligent Heterogeneous Resource Configuration Framework for Transformer-based Inference Models
    Hadeel Albahar, Redwan Ibne Seraj Khan, Shruti Dongare, and 3 more authors
    In Review, 2025
  3. Under Review
    Reinforcement Learning-based Adaptive Scheduling for ML Workloads
    Shruti Dongare, Redwan Ibne Seraj Khan, Hadeel Albahar, and 2 more authors
    In Review, 2025
  4. Under Preparation
    Enhancing LLM training through Strategic Tensor Offloading across Multiple Memory Tiers
    Sabiha Afroz, Redwan Ibne Seraj Khan, Hadeel Albahar, and 2 more authors
    In Preparation, 2025

2024

  1. ACM SoCC 2024
    FedCaSe: Enhancing Federated Learning with Heterogeneity-aware Caching and Scheduling
    Redwan Ibne Seraj Khan, Arnab K. Paul, Xun Jian, and 2 more authors
    In ACM Symposium on Cloud Computing, Nov 2024

2023

  1. USENIX FAST’23
    SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training
    Redwan Ibne Seraj Khan, Ahmad Hossein Yazdani, Yuqi Fu, and 5 more authors
    In 21st USENIX Conference on File and Storage Technologies (FAST 23), Feb 2023

2020

  1. IEEE CLOUD’20
    On the use of containers in high performance computing environments
    Subil Abraham, Arnab K Paul, Redwan Ibne Seraj Khan, and 1 more author
    In 2020 IEEE 13th International Conference on Cloud Computing (CLOUD), Feb 2020