Efficient Nlp

The KV Cache: Memory Usage in Transformers Efficient NLP 55,675 1 год назад
The Most Accurate Speech-to-text APIs in 2025 Efficient NLP 1,484 1 месяц назад
How is Beam Search Really Implemented? Efficient NLP 16,915 1 год назад
Speculative Decoding: When Two LLMs are Faster than One Efficient NLP 19,132 1 год назад
Rotary Positional Embeddings: Combining Absolute and Relative Efficient NLP 46,426 1 год назад
Can Whisper be used for real-time streaming ASR? Efficient NLP 19,240 11 месяцев назад
5.Modern NLP Transformers and Large Language Models, 5.5 Parameter efficient fine tuning PEFT 2024. Artificial Intelligence - All in One 14 8 месяцев назад
Efficient Language Models: finding your optimal architecture Robert Monarch 124 4 года назад
Trends in Model Size & Computational Efficiency in NLP HuggingFace 1,375 4 года назад
The Architecture of Chrome Extension Permissions Efficient NLP 977 5 месяцев назад