Tired: loss minimalists. Wired: loss maximalists.
by @sharifshameem :)

@theartofmadeline
NASA

ellievsbear

oozey mess
hello vonnie
One Nice Bug Per Day

Origami Around

Kaledo Art
$LAYYYTER
"I'm Dorothy Gale from Kansas"
RMH

Product Placement
2025 on Tumblr: Trends That Defined the Year
Mike Driver
styofa doing anything
art blog(derogatory)
I'd rather be in outer space 🛸
trying on a metaphor
Lint Roller? I Barely Know Her
cherry valley forever

seen from United States
seen from United Kingdom

seen from TĂĽrkiye
seen from Netherlands
seen from Lithuania
seen from Brazil

seen from Brazil

seen from Brazil

seen from United States
seen from United States

seen from United States

seen from United States

seen from United States

seen from China
seen from United States
seen from United States

seen from Indonesia
seen from Bangladesh

seen from Romania
seen from United States
@lossfunctions
Tired: loss minimalists. Wired: loss maximalists.
by @sharifshameem :)

Anya is live and ready to show you everything. Watch her strip, dance, and perform exclusive shows just for you. Interact in real-time and make your fantasies come true.
Free to watch • No registration required • HD streaming
Datasets with a data loader without a shuffle after each epoch? Generously contributed by @richardgalvez.
A highly amusing specimen from @_karfly . Truly baffles the mind.
“The Snek”, a gracious contribution from @TheReibel and Vicki :)
Evades diagnosis. Graciously contributed by @bleyddyn.

Anya is live and ready to show you everything. Watch her strip, dance, and perform exclusive shows just for you. Interact in real-time and make your fantasies come true.
Free to watch • No registration required • HD streaming
Another loss function contributed by Ray Zhang. Diagnosis impossible.
A heart rate or a loss function? :)
This one of a custom implementation of an RNN, graciously contributed by Ray Zhang.
Blue: baseline. Red: attempt to create a new architecture :D
Contributed by Hyun Jae Kim.
An educational post! We’re looking at the validation accuracy of a model as a function of dropout we train with. This trend is consistent with my overall experience: models with dropout train faster, but models with higher dropout win eventually. The dropout of one model is quite extreme (0.85), but it is gaining on the others! What’s going to happen as we train longer? #soexciting
Spatial Transformer Network identifying right whales, L2 reg and loss plot.
 Contributed by ‏@robibok

Anya is live and ready to show you everything. Watch her strip, dance, and perform exclusive shows just for you. Interact in real-time and make your fantasies come true.
Free to watch • No registration required • HD streaming
"the slow start", contributed by Tom White.
This RNN smoothly forgets everything it has learned. God knows what happened. Contributed by Jeremy, as seen on his blog post https://jblkacademic.wordpress.com/2015/09/02/find-your-dream-job/
Taming Spatial Transformer Networks, contributed by Diogo. For the record, it’s not supposed to look like that.
A nasty-looking plateau. Sometimes. Contributed by @Luke_Metz .
“One survivor” contributed by Taco. Beautiful overfitting curves exhibiting exotic non-U shapes

Anya is live and ready to show you everything. Watch her strip, dance, and perform exclusive shows just for you. Interact in real-time and make your fantasies come true.
Free to watch • No registration required • HD streaming
Ah, the Sharp Corner Loss (SCL). Bad initialization a prime suspect.
A beautiful rainbow of learning! This code is definitely bug free. Learning rate decay might be slightly too high.