it is actually shown that The straightforward pre-instruction endeavor of predicting which caption goes with which picture is definitely an effective and scalable way to know SOTA graphic representations from scratch https://k2spiceshop.com/product/liquid-k2-on-paper-online/