Conferences >2023 13th International Confe...

Energy Saving Based on Transformer Models with LeakyReLU Activation Function

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

In this paper, energy saving based on transformers with LeakyReLU attention mechanisms is discussed. Softmax functions in attention mechanisms of transformers are replace...Show More

Metadata

Abstract:

In this paper, energy saving based on transformers with LeakyReLU attention mechanisms is discussed. Softmax functions in attention mechanisms of transformers are replaced by LeakyReLU functions, which include ReLU functions as special cases. The goal of doing so is to explore possible trans-former architectures with reduced computational complexity for saving electrical energy in the inference phase. Theoretical analysis based on a general-purpose computing model shows that, under given conditions and assumptions, the worst-case time complexities for computing attention in transformers are in the rank, from low to high complexity, of ReLU, LeakyReLU, and softmax. In particular, as shown in experimental results on language translation and deterministic network flow aggregation tasks, transformers with ReLU (LeakyReLU with 0 negative slope) and LeakyReLU (0.1 negative slope) attention consume less average computation time compared to that with softmax attention in inference phases. The theoretical and experimental results show that the transformers with LeakyReLU activation may save energy in the language translation and deterministic networking tasks.

Published in: 2023 13th International Conference on Information Science and Technology (ICIST)

Date of Conference: 08-14 December 2023

Date Added to IEEE Xplore: 29 December 2023

ISBN Information:

ISSN Information:

DOI: 10.1109/ICIST59754.2023.10367091

Conference Location: Cairo, Egypt

Funding Agency:

Contents

References is not available for this document.

Energy Saving Based on Transformer Models with LeakyReLU Activation Function

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Energy Saving Based on Transformer Models with LeakyReLU Activation Function

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?