Discussion about this post

User's avatar
Tirtharaj Mukherjee's avatar

Nice article Boris. Thanks for putting it up

Expand full comment
George's avatar

how much did you play with rwkv ? To my eyes it's so much more elegant than transformers but I've been disappointed by it on "real" issue around data extraction. It seems to be much more stochastic parroty, but maybe it boils down to the training more so than anything.

Expand full comment
2 more comments...

No posts