FINCH: Prompt-guided Key-Value Cache Compression for Large Language Models

Authors

Giulio Corallo

Paolo Papotti