ESACT: An End-to-End Sparse Accelerator for Compute-Intensive Transformers via Local Similarity
The article introduces ESACT, an end-to-end sparse accelerator designed to enhance the efficiency of compute-intensive Transformers by leveraging local similarity for sparse acceleration. This approach addresses the limitations of existing accelerators that primarily focus on intra-row sparsity and often incur high computational costs.