Researchers at Tsinghua University and Z.ai built IndexCache to eliminate redundant computation in sparse attention models ...