Frayed RoPE and Long Inputs: A Geometric PerspectiveDavis WertheimerAozhong Zhanget al.2026ICLR 2026Conference paper
Universal Position Interpolation: Unified Context Scaling for Hybrid Mamba-Transformer ModelsHaochen ShenDavis Wertheimeret al.2026ICLR 2026Conference paper
Is Finer Better? The Limits of Microscaling Formats in Large Language ModelsAndrea FasoliMonodeep Karet al.2026ICLR 2026Conference paper
Advancing Fluorescence Light Detection and Ranging in Scattering Media with a Physics-Guided Mixture-of-Experts and Evidential CriticsIsmail ErbasFerhat Demikiranet al.2025NeurIPS 2025Workshop paper
Accelerating LLM Inference via Dynamic KV Cache Placement in Heterogeneous Memory SystemYunhua FangRui Xieet al.2025IEEE Computer Architecture LettersPaper
Generative AI Through CAS Lens: An Integrated Overview of Algorithmic Optimizations, Architectural Advances, and Automated DesignsChuan ZhangYou Youet al.2025IEEE JESTCSPaper
Compressed Decentralized Momentum Stochastic Gradient Methods for Nonconvex OptimizationWei LiuAnweshit Pandaet al.2025TMLRPaper
CLoQ: Enhancing Fine-Tuning of Quantized LLMs via Calibrated LoRA InitializationYanxia DengAozhong Zhanget al.2025TMLRPaper
COMQ: A Backpropagation-Free Algorithm for Post-Training QuantizationAozhong ZhangZi Yanget al.2025IEEE AccessPaper
Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime ImagingIsmail ErbasVikas Pandeyet al.2024NeurIPS 2024Workshop paper
30 Oct 2017US9806615On-chip Dc-dc Power Converters With Fully Integrated Gan Power Switches, Silicon Cmos Transistors And Magnetic Inductors
16 Oct 2017US9793336High Resistivity Iron-based, Thermally Stable Magnetic Material For On-chip Integrated Inductors