Tunadorable

Exploring Learning Dynamics in Concept Space

Exploring Learning Dynamics in Concept Space Tunadorable 730 2,433 11 месяцев назад

Models inside models inside models

Models inside models inside models Tunadorable 710 2,367 9 месяцев назад

Accelerated Training by Amplifying Slow Gradients

Accelerated Training by Amplifying Slow Gradients Tunadorable 10K 32,816 1 год назад

Why Are Neural Network Loss Landscapes So Weirdly Connected?

Why Are Neural Network Loss Landscapes So Weirdly Connected? Tunadorable 756 2,520 1 год назад

How to make neural networks better at learning new things

How to make neural networks better at learning new things Tunadorable 612 2,041 4 месяца назад

Let's Build Llama 3 From Scratch, in Code, Spelled Out

Let's Build Llama 3 From Scratch, in Code, Spelled Out Tunadorable 2K 7,824 1 год назад

Can LLMs Learn by Teaching Other LLMs?

Can LLMs Learn by Teaching Other LLMs? Tunadorable 493 1,642 10 месяцев назад

The Structured Task Hypothesis

The Structured Task Hypothesis Tunadorable 557 1,856 11 месяцев назад

GPT2 is AS GOOD as Neuroscientists at Predicting Research Results?!

GPT2 is AS GOOD as Neuroscientists at Predicting Research Results?! Tunadorable 453 1,509 1 год назад

what’s the definition of intelligence? #ai #intelligence https://arxiv.org/pdf/2312.09546v1.pdf

what’s the definition of intelligence? #ai #intelligence https://arxiv.org/pdf/2312.09546v1.pdf Tunadorable 117 391 1 год назад

Do we really need to use every single transformer layer?

Do we really need to use every single transformer layer? Tunadorable 695 2,317 9 месяцев назад

neurosymbolic AI is silly #ai #neurosymbolicai https://arxiv.org/pdf/2401.01040.pdf

neurosymbolic AI is silly #ai #neurosymbolicai https://arxiv.org/pdf/2401.01040.pdf Tunadorable 712 2,373 1 год назад

Training an LLM tokenizer on GPUs with a lot of data (better than Karpathy!!)

Training an LLM tokenizer on GPUs with a lot of data (better than Karpathy!!) Tunadorable 259 863 1 месяц назад

Emergent Abilities of LLMs #ai #emergence https://arxiv.org/pdf/2206.07682.pdf

Emergent Abilities of LLMs #ai #emergence https://arxiv.org/pdf/2206.07682.pdf Tunadorable 153 510 1 год назад

#ai #machinelearning #arxiv https://arxiv.org/pdf/2305.18741.pdf

#ai #machinelearning #arxiv https://arxiv.org/pdf/2305.18741.pdf Tunadorable 20 67 1 год назад

What would it mean for an AI to "understand"?

What would it mean for an AI to "understand"? Tunadorable 1K 3,774 10 месяцев назад

#ai #machinelearning #artificialintelligence https://arxiv.org/pdf/2307.06945.pdf

#ai #machinelearning #artificialintelligence https://arxiv.org/pdf/2307.06945.pdf Tunadorable 8 25 1 год назад

What does AI have to do with Plato's Allegory of the Cave?

What does AI have to do with Plato's Allegory of the Cave? Tunadorable 2K 7,179 1 год назад

MaskMoE: Forcing rare tokens to only use one expert

MaskMoE: Forcing rare tokens to only use one expert Tunadorable 361 1,202 10 месяцев назад

Let's build Google's Gemma: from scratch, in code, spelled out

Let's build Google's Gemma: from scratch, in code, spelled out Tunadorable 974 3,247 1 год назад

Tunadorable. Смотреть видео: Exploring Learning Dynamics In Concept Space, Models Inside Models Inside Models, Accelerated Training By Amplifying Slow Gradients, Why Are Neural Network Loss Landscapes So Weirdly Connected.