<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Featherless AI - recursive dev blog]]></title><description><![CDATA[Development blog for Featherless AI from the same folks working on the RWKV open source project]]></description><link>https://substack.recursal.ai</link><image><url>https://substackcdn.com/image/fetch/$s_!RY89!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F655b233b-f955-4a8e-b220-6e4f392736ef_160x160.png</url><title>Featherless AI - recursive dev blog</title><link>https://substack.recursal.ai</link></image><generator>Substack</generator><lastBuildDate>Mon, 13 Apr 2026 19:27:50 GMT</lastBuildDate><atom:link href="https://substack.recursal.ai/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Recursal AI]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[featherless@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[featherless@substack.com]]></itunes:email><itunes:name><![CDATA[Featherless AI - dev blog]]></itunes:name></itunes:owner><itunes:author><![CDATA[Featherless AI - dev blog]]></itunes:author><googleplay:owner><![CDATA[featherless@substack.com]]></googleplay:owner><googleplay:email><![CDATA[featherless@substack.com]]></googleplay:email><googleplay:author><![CDATA[Featherless AI - dev blog]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[The world's most reliable AI agent that actually works - where Claude, Gemini, and o3 fail]]></title><description><![CDATA[That will do the boring chores in life, for you]]></description><link>https://substack.recursal.ai/p/the-worlds-most-reliable-ai-agent</link><guid isPermaLink="false">https://substack.recursal.ai/p/the-worlds-most-reliable-ai-agent</guid><dc:creator><![CDATA[Eugene Cheah]]></dc:creator><pubDate>Fri, 23 May 2025 22:20:13 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div id="youtube2-hgXiwwMtDJg" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;hgXiwwMtDJg&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/hgXiwwMtDJg?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>Most companies and use cases do not care about PhD-level capabilities. They just want an AI model to depend on with absolute reliability, for the simple tasks in life.</p><p>Imagine a highly competent, dependable, reliable assistant, who does all your chores&#8211;instead of the caffeine-overloaded genius, who sometimes works when it &#8220;feels like it&#8221;--but otherwise causes more harm than good half the time.</p><p>That's what we just built with the <a href="http://featherless.ai">Featherless.ai</a> Action-R1 model &amp; agent, which achieved SotA (State-of-the-Art) in the REAL (Realistic Evaluations for Agents Leaderboard) benchmark.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MOo2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MOo2!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png 424w, https://substackcdn.com/image/fetch/$s_!MOo2!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png 848w, https://substackcdn.com/image/fetch/$s_!MOo2!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png 1272w, https://substackcdn.com/image/fetch/$s_!MOo2!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MOo2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png" width="1456" height="818" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:818,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:564424,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/164269885?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!MOo2!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png 424w, https://substackcdn.com/image/fetch/$s_!MOo2!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png 848w, https://substackcdn.com/image/fetch/$s_!MOo2!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png 1272w, https://substackcdn.com/image/fetch/$s_!MOo2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">realevals.xyz score benchmark</figcaption></figure></div><p>Our <a href="https://www.realevals.xyz/details/92a67e60-bb83-4be9-9422-e93746dc13a0">AI agent (test result at link)</a> achieved a 65% success rate on the REAL benchmark. This makes it nearly 50% better than <a href="https://www.realevals.xyz/details/074c427c-18b2-4c34-8443-b569644fff72">Anthropic computer use (result link)</a>, the next best model &amp; framework by a major lab, at 42%.</p><div><hr></div><h1><strong>What is the REAL benchmark?</strong></h1><p>What is interesting about the REAL benchmark is that it tests over 110 practical real-world tasks. These are not university-level knowledge tasks, but real-world chores that reflect what people do online every day, like booking flights, organizing and replying emails, and shopping for groceries. Basically, office desk work.</p><p>These are tested using a controlled test replica, which mirrors 11 major websites, including Airbnb, Amazon, Gmail, LinkedIn, and Uber.</p><p><a href="https://arxiv.org/abs/2504.11543">The paper for the benchmark can be found here</a>.</p><div><hr></div><h1>Why does reliability matter?</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!UkEE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UkEE!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png 424w, https://substackcdn.com/image/fetch/$s_!UkEE!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png 848w, https://substackcdn.com/image/fetch/$s_!UkEE!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png 1272w, https://substackcdn.com/image/fetch/$s_!UkEE!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UkEE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png" width="590" height="405" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:405,&quot;width&quot;:590,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:35032,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/164269885?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!UkEE!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png 424w, https://substackcdn.com/image/fetch/$s_!UkEE!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png 848w, https://substackcdn.com/image/fetch/$s_!UkEE!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png 1272w, https://substackcdn.com/image/fetch/$s_!UkEE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Side by Side reliability comparisons for top use cases </figcaption></figure></div><p>Frontier AI agents have been plateauing at around 43% overall task completion. We hit 65%, leapfrogging it by 22 percentage points.</p><p>More importantly, on certain sites, such as the Omnizon (aka Amazon clone), our agent reached a 100% success rate, while the next best managed 60%.</p><p><strong>And the 99%+ distinction matters.</strong></p><p>Because you can now rely on this model and agent for tasks in this domain.</p><p>If a model is only at 50% success rate at a category of task, you spend more time and energy &#8220;babysitting&#8221; the models. In several cases, the model will require human intervention which can take more bandwidth than the task itself.</p><p>Raising reliability means that you can hand off tasks to the AI, eliminating the frustration many workers face with AI in the enterprise.</p><p>For the first time, we have built an AI agent you can rely on for tasks within a handful of platforms with 99%+ success. We will eventually expand to 99%+ reliability for all web platforms.</p><div><hr></div><h1><strong>How we did it</strong></h1><p>We partnered with the team at UI-licious, who automate end-to-end testing for their customers with AI at scale, building on their experience and knowledge in UI test automation and their proprietary PetaByte scale dataset which they have built up over the years on UI testing.</p><p>Together, we co-built a specialized AI action model and agent harness. This helps the AI understand instructions and how to navigate complex and dynamic UIs.</p><p>As a bonus, these AI agents are not just navigating and completing tasks. They are finding and issuing bug reports, which we will be following up with the eval maintainers on &#128521;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mEar!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mEar!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png 424w, https://substackcdn.com/image/fetch/$s_!mEar!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png 848w, https://substackcdn.com/image/fetch/$s_!mEar!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png 1272w, https://substackcdn.com/image/fetch/$s_!mEar!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mEar!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png" width="1456" height="818" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:818,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1244065,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/164269885?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!mEar!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png 424w, https://substackcdn.com/image/fetch/$s_!mEar!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png 848w, https://substackcdn.com/image/fetch/$s_!mEar!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png 1272w, https://substackcdn.com/image/fetch/$s_!mEar!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Hopefully will be passed to another AI to fix the bugs!</figcaption></figure></div><div><hr></div><h1>The bigger picture: attacking multi-billion dollar markets</h1><p>The <a href="https://www.statista.com/statistics/1309384/worldwide-rpa-software-market-size">$3B+ Robotic Process Automation (RPA) market</a> and the <a href="https://www.datainsightsmarket.com/reports/ui-test-automation-software-1947276">$20B+ UI testing marke</a>t are dominated by brittle tools that break easily and require constant maintenance, where AI agents achieve about 60% reliability.</p><p>Today&#8217;s RPA tools like UIPath operate like glorified macros, breaking when user interfaces change or when variability is introduced. Businesses spend millions maintaining these systems, fixing broken RPA scripts.</p><p>Agentic AI is different. It learns. It adapts. It interacts with interfaces the way humans do: observing, interpreting, and acting based on context. It allows automation over dynamic interfaces.</p><p>And we&#8217;re just getting started.</p><p>As we scale our AI research in the open source space, we will be working alongside industry partners to build reliable AI Agents and models for their own industry specific use, with their own proprietary datasets.</p><p>So we will be increasing reliability, not just in automated UI testing, but in all future domains as well, with industry partners from accounting to legal.</p><p>Raising reliability means unlocking the 90% of AI projects that fail to enter production within enterprises, as the AI was just &#8220;not reliable enough&#8221; for day-to-day office tasks.</p><p>Raising reliability means bringing into production your AI agent into the market.</p><p>If you are interested in partnering with us, and making your AI more SoTA-reliable, please reach out to us.</p><div><hr></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JA52!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JA52!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png 424w, https://substackcdn.com/image/fetch/$s_!JA52!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png 848w, https://substackcdn.com/image/fetch/$s_!JA52!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png 1272w, https://substackcdn.com/image/fetch/$s_!JA52!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JA52!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png" width="610" height="455.54987212276217" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:584,&quot;width&quot;:782,&quot;resizeWidth&quot;:610,&quot;bytes&quot;:74206,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/164269885?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fd60670-1a7d-4740-9efe-75b10e6c6285_782x614.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!JA52!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png 424w, https://substackcdn.com/image/fetch/$s_!JA52!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png 848w, https://substackcdn.com/image/fetch/$s_!JA52!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png 1272w, https://substackcdn.com/image/fetch/$s_!JA52!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1>How can we try this new action model &amp; agent?</h1><p>Look out for a follow-up launch announcement with our design partners at UI-licious.</p><p>This AI action model and agent will be co-launched on both platforms.</p><p>For early access, you can sign up here: <br><a href="https://forms.gle/wxwQ2z12xf1KzFvPA">https://forms.gle/wxwQ2z12xf1KzFvPA</a></p><p>Priority access will be given to portfolio companies among our investors and users.</p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://substack.recursal.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Featherless AI - recursive dev blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[RADLADS: Dropping the cost of AI architecture experiment by 250x]]></title><description><![CDATA[Unlocking and accelerating the next wave of AI architecture research]]></description><link>https://substack.recursal.ai/p/radlads-dropping-the-cost-of-ai-architecture</link><guid isPermaLink="false">https://substack.recursal.ai/p/radlads-dropping-the-cost-of-ai-architecture</guid><dc:creator><![CDATA[Eugene Cheah]]></dc:creator><pubDate>Mon, 12 May 2025 18:13:32 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/8b0ab9da-e776-4944-9881-fc754d946fa3_1024x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Why do most large AI research labs swear by scaling and avoid architecture research?</p><ul><li><p><strong>What works small often fails big</strong> &#8212; Architectural innovations that show promise at 1M parameters may break down at 1B or 50B.</p></li><li><p><strong>Validating at scale is expensive</strong> &#8212; Training from scratch to test a new architecture at meaningful scale can cost at least $5&#8211;10M.</p></li><li><p><strong>High risk, uncertain reward</strong> &#8212; You&#8217;re just as likely to degrade performance as improve it&#8212;making architecture exploration financially unsustainable for most labs.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://images.unsplash.com/photo-1554768803-2ae381da5645?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxidXJuJTIwbW9uZXl8ZW58MHx8fHwxNzQ2OTM1MDI0fDA&amp;ixlib=rb-4.1.0&amp;q=80&amp;w=1080" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://images.unsplash.com/photo-1554768803-2ae381da5645?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxidXJuJTIwbW9uZXl8ZW58MHx8fHwxNzQ2OTM1MDI0fDA&amp;ixlib=rb-4.1.0&amp;q=80&amp;w=1080 424w, https://images.unsplash.com/photo-1554768803-2ae381da5645?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxidXJuJTIwbW9uZXl8ZW58MHx8fHwxNzQ2OTM1MDI0fDA&amp;ixlib=rb-4.1.0&amp;q=80&amp;w=1080 848w, https://images.unsplash.com/photo-1554768803-2ae381da5645?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxidXJuJTIwbW9uZXl8ZW58MHx8fHwxNzQ2OTM1MDI0fDA&amp;ixlib=rb-4.1.0&amp;q=80&amp;w=1080 1272w, https://images.unsplash.com/photo-1554768803-2ae381da5645?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxidXJuJTIwbW9uZXl8ZW58MHx8fHwxNzQ2OTM1MDI0fDA&amp;ixlib=rb-4.1.0&amp;q=80&amp;w=1080 1456w" sizes="100vw"><img src="https://images.unsplash.com/photo-1554768803-2ae381da5645?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxidXJuJTIwbW9uZXl8ZW58MHx8fHwxNzQ2OTM1MDI0fDA&amp;ixlib=rb-4.1.0&amp;q=80&amp;w=1080" width="5149" height="2191" data-attrs="{&quot;src&quot;:&quot;https://images.unsplash.com/photo-1554768803-2ae381da5645?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxidXJuJTIwbW9uZXl8ZW58MHx8fHwxNzQ2OTM1MDI0fDA&amp;ixlib=rb-4.1.0&amp;q=80&amp;w=1080&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:2191,&quot;width&quot;:5149,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;burning banknotes&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="burning banknotes" title="burning banknotes" srcset="https://images.unsplash.com/photo-1554768803-2ae381da5645?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxidXJuJTIwbW9uZXl8ZW58MHx8fHwxNzQ2OTM1MDI0fDA&amp;ixlib=rb-4.1.0&amp;q=80&amp;w=1080 424w, https://images.unsplash.com/photo-1554768803-2ae381da5645?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxidXJuJTIwbW9uZXl8ZW58MHx8fHwxNzQ2OTM1MDI0fDA&amp;ixlib=rb-4.1.0&amp;q=80&amp;w=1080 848w, https://images.unsplash.com/photo-1554768803-2ae381da5645?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxidXJuJTIwbW9uZXl8ZW58MHx8fHwxNzQ2OTM1MDI0fDA&amp;ixlib=rb-4.1.0&amp;q=80&amp;w=1080 1272w, https://images.unsplash.com/photo-1554768803-2ae381da5645?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxidXJuJTIwbW9uZXl8ZW58MHx8fHwxNzQ2OTM1MDI0fDA&amp;ixlib=rb-4.1.0&amp;q=80&amp;w=1080 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Money Burning Photo - by <a href="true">Jp Valery</a> on <a href="https://unsplash.com">Unsplash</a></figcaption></figure></div><p>Training a state-of-the-art language model from scratch costs roughly $5-10M&#8212;just to validate a new attention mechanism, recurrence scheme, or memory system.</p><p>From our team experience, it typically takes 20&#8211;80 architecture iterations to achieve a 10%+ improvement. We've done this four times over the past two years.</p><p>For most AI labs, that level of experimentation would cost around $250 million in research GPU time. From that perspective, it's often more rational to invest in scaling model parameters and datasets for a near-guaranteed performance gain of ~10%.</p><p>At Featherless, we believe this bottleneck in architecture validation has slowed progress&#8212;not only in capabilities but in reliability.</p><p>But what if the cost to validate an architecture dropped from $5 million to $20K?</p><p>With that same $250 million, we could run over 12,500 iterations, uncovering 100+ architecture improvements, each with 10%+ gains. Compounded, that&#8217;s a theoretical 1,378,000% improvement in performance.</p><p>That&#8217;s why we&#8217;re excited about RADLADS.</p><div><hr></div><h1><strong>Introducing RADLADS</strong></h1><p>RADLADS (Rapid Attention Distillation to Linear Attention Decoders at Scale) is a new method for converting massive transformer models (e.g., Qwen-72B) into new AI models with alternative attention mechanisms&#8212;at a fraction of the original training cost.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!XWoX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F585341ab-6138-41ce-ab1f-2ea2578bec2c_701x461.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!XWoX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F585341ab-6138-41ce-ab1f-2ea2578bec2c_701x461.png 424w, https://substackcdn.com/image/fetch/$s_!XWoX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F585341ab-6138-41ce-ab1f-2ea2578bec2c_701x461.png 848w, https://substackcdn.com/image/fetch/$s_!XWoX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F585341ab-6138-41ce-ab1f-2ea2578bec2c_701x461.png 1272w, https://substackcdn.com/image/fetch/$s_!XWoX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F585341ab-6138-41ce-ab1f-2ea2578bec2c_701x461.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!XWoX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F585341ab-6138-41ce-ab1f-2ea2578bec2c_701x461.png" width="701" height="461" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/585341ab-6138-41ce-ab1f-2ea2578bec2c_701x461.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:461,&quot;width&quot;:701,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!XWoX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F585341ab-6138-41ce-ab1f-2ea2578bec2c_701x461.png 424w, https://substackcdn.com/image/fetch/$s_!XWoX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F585341ab-6138-41ce-ab1f-2ea2578bec2c_701x461.png 848w, https://substackcdn.com/image/fetch/$s_!XWoX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F585341ab-6138-41ce-ab1f-2ea2578bec2c_701x461.png 1272w, https://substackcdn.com/image/fetch/$s_!XWoX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F585341ab-6138-41ce-ab1f-2ea2578bec2c_701x461.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ul><li><p>Total cost: $2,000&#8211;$20,000</p></li><li><p>Tokens used: ~500 million</p></li><li><p>Training time: A few days on accessible cloud GPUs (8&#215; MI300)</p></li><li><p>Cost reduction: ~250&#215; reduction in the cost of scientific experimentation</p></li></ul><p>Instead of training from scratch, we convert existing models to new attention architectures in three steps:</p><ol><li><p>Align hidden states between the original transformer and the target attention architecture</p></li><li><p>Distill output behavior (logits) from the original model</p></li><li><p>Fine-tune for long-context performance</p></li></ol><p>You can read about the process details from our paper review on <a href="https://huggingface.co/papers/2505.03005">huggingface</a>  and <a href="https://www.arxiv.org/abs/2505.03005">arxiv</a>. This is the same technique that allowed us to train our latest 72B<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> attention-free, with only 8 GPU&#8217;s.</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;bd373744-7aac-4094-b1f5-1042c1742031&quot;,&quot;caption&quot;:&quot;We are proud to announce the updated QRWKV-72B and 32B.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;&#129727;QRWKV-72B and 32B : Training large attention free models, with only 8 GPU's&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:99170118,&quot;name&quot;:&quot;Eugene Cheah&quot;,&quot;bio&quot;:&quot;Builds Attention-Free Transformer AI models (http://wiki.rwkv.com) from scratch, CEO @ featherless.ai (prv recursal.ai) - Also known for k8s infra &amp; UI testing tools, webapps, and GPU.js, Hot-takes/Views are my own&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8dcb57e-6203-4be3-ae29-03732db5c5f7_460x460.jpeg&quot;,&quot;is_guest&quot;:true,&quot;bestseller_tier&quot;:null,&quot;primaryPublicationSubscribeUrl&quot;:&quot;https://substack.tech-talk-cto.com/subscribe?&quot;,&quot;primaryPublicationUrl&quot;:&quot;https://substack.tech-talk-cto.com&quot;,&quot;primaryPublicationName&quot;:&quot;Tech Talk CTO&quot;,&quot;primaryPublicationId&quot;:1004639}],&quot;post_date&quot;:&quot;2025-03-24T17:30:34.860Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/$s_!iyqS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F525f7c7c-725d-4174-b2f7-7d3c6ccc3c04_2743x1773.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://substack.recursal.ai/p/qwerky-72b-and-32b-training-large&quot;,&quot;section_name&quot;:&quot;RWKV News&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:159379897,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:14,&quot;comment_count&quot;:3,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;Featherless AI - recursive dev blog&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!RY89!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F655b233b-f955-4a8e-b220-6e4f392736ef_160x160.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2mkT!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56983e6d-be9a-4e39-b8c7-95b1ab746cd4_1024x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2mkT!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56983e6d-be9a-4e39-b8c7-95b1ab746cd4_1024x1024.png 424w, https://substackcdn.com/image/fetch/$s_!2mkT!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56983e6d-be9a-4e39-b8c7-95b1ab746cd4_1024x1024.png 848w, https://substackcdn.com/image/fetch/$s_!2mkT!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56983e6d-be9a-4e39-b8c7-95b1ab746cd4_1024x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!2mkT!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56983e6d-be9a-4e39-b8c7-95b1ab746cd4_1024x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2mkT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56983e6d-be9a-4e39-b8c7-95b1ab746cd4_1024x1024.png" width="296" height="296" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/56983e6d-be9a-4e39-b8c7-95b1ab746cd4_1024x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:296,&quot;bytes&quot;:2410734,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/163307360?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56983e6d-be9a-4e39-b8c7-95b1ab746cd4_1024x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2mkT!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56983e6d-be9a-4e39-b8c7-95b1ab746cd4_1024x1024.png 424w, https://substackcdn.com/image/fetch/$s_!2mkT!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56983e6d-be9a-4e39-b8c7-95b1ab746cd4_1024x1024.png 848w, https://substackcdn.com/image/fetch/$s_!2mkT!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56983e6d-be9a-4e39-b8c7-95b1ab746cd4_1024x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!2mkT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56983e6d-be9a-4e39-b8c7-95b1ab746cd4_1024x1024.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1>What does this mean for research?</h1><p>RADLADS is already changing how we explore AI architecture. We can now:</p><ul><li><p>Rapidly test novel attention mechanisms and hybrid designs</p></li><li><p>Iterate on model structures in days, not months</p></li><li><p>Validate alignment and interpretability hypotheses at scale</p></li></ul><p>This isn&#8217;t just about RWKV&#8212;it opens doors for advancing Transformers, State Space models, xLSTMs, and architectures yet to be imagined. Its about accelerating our pace of research.</p><p>And we&#8217;re not doing it alone. Since announcing our work, we've collaborated with other researchers to validate multiple attention mechanisms, including Transformer-based variants.</p><blockquote><p><em>Reach out to us if you have any attention alternative your research team or university lab is working on and looking to validate in collaboration.</em></p></blockquote><p>It&#8217;s all part of our mission to make personalized reliable AI &#8212; and eventually AGI &#8212; a reality</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;bdbc8ded-382b-4c87-9cce-8cdf1701c321&quot;,&quot;caption&quot;:&quot;If you want to find out more about the latest Qwerky model, that makes all of this possible, it is recommended to read this first:&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;&#128739;&#65039; Our roadmap to Personalized AI and AGI&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:99170118,&quot;name&quot;:&quot;Eugene Cheah&quot;,&quot;bio&quot;:&quot;Builds Attention-Free Transformer AI models (http://wiki.rwkv.com) from scratch, CEO @ featherless.ai (prv recursal.ai) - Also known for k8s infra &amp; UI testing tools, webapps, and GPU.js, Hot-takes/Views are my own&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8dcb57e-6203-4be3-ae29-03732db5c5f7_460x460.jpeg&quot;,&quot;is_guest&quot;:true,&quot;bestseller_tier&quot;:null,&quot;primaryPublicationSubscribeUrl&quot;:&quot;https://substack.tech-talk-cto.com/subscribe?&quot;,&quot;primaryPublicationUrl&quot;:&quot;https://substack.tech-talk-cto.com&quot;,&quot;primaryPublicationName&quot;:&quot;Tech Talk CTO&quot;,&quot;primaryPublicationId&quot;:1004639}],&quot;post_date&quot;:&quot;2025-03-24T17:40:14.797Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/139429cb-892d-41ef-9894-5ca461ef1510_1646x1058.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://substack.recursal.ai/p/our-roadmap-to-personalized-ai-and&quot;,&quot;section_name&quot;:&quot;RWKV News&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:159708830,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:4,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;Featherless AI - recursive dev blog&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F655b233b-f955-4a8e-b220-6e4f392736ef_160x160.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="pullquote"><p><strong>One more thing:</strong><br>QRWKV2, based on the RWKV architecture &amp; Qwen 3 models, is already training...</p><p><strong>Translation:</strong><br>A linear GPT-4o class text model is on its way...<br>After that its O1, and O3 class</p></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://substack.recursal.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Featherless AI - recursive dev blog! Subscribe for free to receive new posts and support our work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>This model was originally published as Qwerky-72B. However, due to confusion with another similar naming company/model, we have been requested to avoid using the Qwerky name, so we have renamed our models to QRWKV-72B</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[🛣️ Our roadmap to Personalized AI and AGI]]></title><description><![CDATA[We will not train a trillion parameter model: A <100B active parameters is all you need.]]></description><link>https://substack.recursal.ai/p/our-roadmap-to-personalized-ai-and</link><guid isPermaLink="false">https://substack.recursal.ai/p/our-roadmap-to-personalized-ai-and</guid><dc:creator><![CDATA[Eugene Cheah]]></dc:creator><pubDate>Mon, 24 Mar 2025 17:40:14 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/139429cb-892d-41ef-9894-5ca461ef1510_1646x1058.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>If you want to find out more about the latest QRWKV<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> model, that makes all of this possible, it is recommended to read this first:</p><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:159379897,&quot;url&quot;:&quot;https://substack.recursal.ai/p/qwerky-72b-and-32b-training-large&quot;,&quot;publication_id&quot;:2073186,&quot;publication_name&quot;:&quot;Featherless AI - recursive dev blog&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!RY89!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F655b233b-f955-4a8e-b220-6e4f392736ef_160x160.png&quot;,&quot;title&quot;:&quot;&#129727;QRWKV-72B and 32B : Training large attention free models, with only 8 GPU's&quot;,&quot;truncated_body_text&quot;:&quot;We are proud to announce the updated QRWKV-72B and 32B.&quot;,&quot;date&quot;:&quot;2025-03-24T17:30:34.860Z&quot;,&quot;like_count&quot;:14,&quot;comment_count&quot;:3,&quot;bylines&quot;:[{&quot;id&quot;:99170118,&quot;name&quot;:&quot;Eugene Cheah&quot;,&quot;handle&quot;:&quot;techtalkcto&quot;,&quot;previous_name&quot;:&quot;PicoCreator&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8dcb57e-6203-4be3-ae29-03732db5c5f7_460x460.jpeg&quot;,&quot;bio&quot;:&quot;Builds Attention-Free Transformer AI models (http://wiki.rwkv.com) from scratch, CEO @ featherless.ai (prv recursal.ai) - Also known for k8s infra &amp; UI testing tools, webapps, and GPU.js, Hot-takes/Views are my own&quot;,&quot;profile_set_up_at&quot;:&quot;2022-07-17T02:17:11.664Z&quot;,&quot;reader_installed_at&quot;:null,&quot;twitter_screen_name&quot;:&quot;picocreator&quot;,&quot;is_guest&quot;:true,&quot;bestseller_tier&quot;:null,&quot;primaryPublicationId&quot;:1004639,&quot;primaryPublicationName&quot;:&quot;Tech Talk CTO&quot;,&quot;primaryPublicationUrl&quot;:&quot;https://substack.tech-talk-cto.com&quot;,&quot;primaryPublicationSubscribeUrl&quot;:&quot;https://substack.tech-talk-cto.com/subscribe?&quot;}],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:false,&quot;type&quot;:&quot;newsletter&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://substack.recursal.ai/p/qwerky-72b-and-32b-training-large?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!RY89!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F655b233b-f955-4a8e-b220-6e4f392736ef_160x160.png"><span class="embedded-post-publication-name">Featherless AI - recursive dev blog</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title">&#129727;QRWKV-72B and 32B : Training large attention free models, with only 8 GPU's</div></div><div class="embedded-post-body">We are proud to announce the updated QRWKV-72B and 32B&#8230;</div><div class="embedded-post-cta-wrapper"><span class="embedded-post-cta">Read more</span></div><div class="embedded-post-meta">a year ago &#183; 14 likes &#183; 3 comments &#183; Eugene Cheah</div></a></div><p>In the past quarter, we have seen the following breakthroughs in open-source</p><ul><li><p>GPT-4o Mini class open models (deepseek, qwen-32b) on our platform</p></li><li><p>72B Attention-Free QRWKV model built on 8 GPUs</p></li><li><p>Both with more knowledge and capability than an average human</p></li><li><p>A tipping point with RWKV v6 and v7 improvements</p></li></ul><p>As such, my prediction (Eugene Cheah) for 2025 is that we are at the inflection point. </p><ul><li><p>Where we scale towards model reliability, with better memories</p></li><li><p>Instead of scaling a trillion parameters</p></li></ul><div><hr></div><h1>Our vision for AGI in summary</h1><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;e4b0675a-c277-4f83-bdb9-80325f9b3203&quot;,&quot;duration&quot;:null}"></div><blockquote><p>The video above, is the condensed version of our vision for personalized AI &amp; AGI<br>The writing below, is the longer form with some additional details</p></blockquote><div><hr></div><h2>The scaling wall for bigger models</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!aqC1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafec588c-225c-4e56-952b-f733448cfa72_4116x2356.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!aqC1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafec588c-225c-4e56-952b-f733448cfa72_4116x2356.png 424w, https://substackcdn.com/image/fetch/$s_!aqC1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafec588c-225c-4e56-952b-f733448cfa72_4116x2356.png 848w, https://substackcdn.com/image/fetch/$s_!aqC1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafec588c-225c-4e56-952b-f733448cfa72_4116x2356.png 1272w, https://substackcdn.com/image/fetch/$s_!aqC1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafec588c-225c-4e56-952b-f733448cfa72_4116x2356.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!aqC1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafec588c-225c-4e56-952b-f733448cfa72_4116x2356.png" width="1456" height="833" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/afec588c-225c-4e56-952b-f733448cfa72_4116x2356.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:833,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:4482172,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/159708830?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafec588c-225c-4e56-952b-f733448cfa72_4116x2356.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!aqC1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafec588c-225c-4e56-952b-f733448cfa72_4116x2356.png 424w, https://substackcdn.com/image/fetch/$s_!aqC1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafec588c-225c-4e56-952b-f733448cfa72_4116x2356.png 848w, https://substackcdn.com/image/fetch/$s_!aqC1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafec588c-225c-4e56-952b-f733448cfa72_4116x2356.png 1272w, https://substackcdn.com/image/fetch/$s_!aqC1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafec588c-225c-4e56-952b-f733448cfa72_4116x2356.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <a href="https://futurism.com/ai-researchers-tech-industry-dead-end">[Futurism Link]</a>, <a href="https://techcrunch.com/2025/01/23/metas-yann-lecun-predicts-a-new-ai-architectures-paradigm-within-5-years-and-decade-of-robotics/">[Techcrunch Link]</a></figcaption></figure></div><p>The problem with scaling today is within the fundamental promise is for a step-increased improvement in capability, for every 10x in parameter size.</p><p>While it has remain somewhat true, that there is a step-up improvement in capability. <a href="https://arxiv.org/abs/2411.13055">The problem is in the diminishing returns, on both training and inference costs.</a></p><p>We are now within Billion&#8217;s of investment dollar range for the current GPT 4.5 model, with no clear answer if OpenAI Super AGI goal is achievable with another 10x, or 100x, or 1000x. After which, we are starting to talk in Trillions of dollars. Just for training. With even more dollars required to actually run the model.</p><p>While this path might still make sense, potentially for &#8220;Super AGI&#8221;.</p><p>It makes no sense for &#8220;Human-level AGI&#8221;. Because &#8230;.</p><div><hr></div><h2>Short Context AGI - is already here (sort of)</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_udp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb42b0738-1bab-427d-b29f-5f9b33295273_1024x529.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_udp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb42b0738-1bab-427d-b29f-5f9b33295273_1024x529.jpeg 424w, https://substackcdn.com/image/fetch/$s_!_udp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb42b0738-1bab-427d-b29f-5f9b33295273_1024x529.jpeg 848w, https://substackcdn.com/image/fetch/$s_!_udp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb42b0738-1bab-427d-b29f-5f9b33295273_1024x529.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!_udp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb42b0738-1bab-427d-b29f-5f9b33295273_1024x529.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_udp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb42b0738-1bab-427d-b29f-5f9b33295273_1024x529.jpeg" width="1024" height="529" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b42b0738-1bab-427d-b29f-5f9b33295273_1024x529.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:529,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_udp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb42b0738-1bab-427d-b29f-5f9b33295273_1024x529.jpeg 424w, https://substackcdn.com/image/fetch/$s_!_udp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb42b0738-1bab-427d-b29f-5f9b33295273_1024x529.jpeg 848w, https://substackcdn.com/image/fetch/$s_!_udp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb42b0738-1bab-427d-b29f-5f9b33295273_1024x529.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!_udp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb42b0738-1bab-427d-b29f-5f9b33295273_1024x529.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Character AI, showcases a vast array of unique personalities</figcaption></figure></div><p>For the vast majority of users on applications like Character.AI, within that context window of approximately 8,000 tokens.</p><p>These AI characters, within the window, is AGI, with perhaps some flaws, but a unique character experience regardless. It&#8217;s what drive the 28 Million plus users to keep engaging with the platform.</p><p>The limits are however clear</p><ul><li><p>How they are unable to reliably follow instructions within memory</p></li><li><p>Nor are they able to gracefully handle memories beyond their context length</p></li></ul><div><hr></div><h2>The lack of reliable AI memory</h2><div class="pullquote"><p>What holds back AI (and AI agents) is &#8230;<br><strong>The lack of reliable understanding, in memories</strong></p></div><p>Because here lies an irony, these models are no doubt knowledgeable and capable.</p><p>Today&#8217;s best models for both open source (ie. DeepSeek R1) and close source, for example, is no-doubt capable of doing PhD level math and physics, with some degree of reliability (let say 1-out-of-30 times)</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QulY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1fed6aa-cf38-4c87-b843-48dd57029510_2007x1313.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QulY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1fed6aa-cf38-4c87-b843-48dd57029510_2007x1313.png 424w, https://substackcdn.com/image/fetch/$s_!QulY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1fed6aa-cf38-4c87-b843-48dd57029510_2007x1313.png 848w, https://substackcdn.com/image/fetch/$s_!QulY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1fed6aa-cf38-4c87-b843-48dd57029510_2007x1313.png 1272w, https://substackcdn.com/image/fetch/$s_!QulY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1fed6aa-cf38-4c87-b843-48dd57029510_2007x1313.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QulY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1fed6aa-cf38-4c87-b843-48dd57029510_2007x1313.png" width="593" height="388.13804945054943" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f1fed6aa-cf38-4c87-b843-48dd57029510_2007x1313.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:953,&quot;width&quot;:1456,&quot;resizeWidth&quot;:593,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QulY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1fed6aa-cf38-4c87-b843-48dd57029510_2007x1313.png 424w, https://substackcdn.com/image/fetch/$s_!QulY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1fed6aa-cf38-4c87-b843-48dd57029510_2007x1313.png 848w, https://substackcdn.com/image/fetch/$s_!QulY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1fed6aa-cf38-4c87-b843-48dd57029510_2007x1313.png 1272w, https://substackcdn.com/image/fetch/$s_!QulY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1fed6aa-cf38-4c87-b843-48dd57029510_2007x1313.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Grok, demonstrating its ability to calculate Mars to Earth orbital transfer window</figcaption></figure></div><p>But yet they lack the reliability to do basic college-level task (30-out-of-30 times), be it as a cashier, or any simple agent needing to do a long multi step process.</p><p>This is also known, as the &#8220;Compounding AI Agent Error&#8221; problem</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!GfzF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff942a0b9-608a-4fc0-ab86-4372bfb6f6ec_1101x677.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!GfzF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff942a0b9-608a-4fc0-ab86-4372bfb6f6ec_1101x677.png 424w, https://substackcdn.com/image/fetch/$s_!GfzF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff942a0b9-608a-4fc0-ab86-4372bfb6f6ec_1101x677.png 848w, https://substackcdn.com/image/fetch/$s_!GfzF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff942a0b9-608a-4fc0-ab86-4372bfb6f6ec_1101x677.png 1272w, https://substackcdn.com/image/fetch/$s_!GfzF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff942a0b9-608a-4fc0-ab86-4372bfb6f6ec_1101x677.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!GfzF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff942a0b9-608a-4fc0-ab86-4372bfb6f6ec_1101x677.png" width="560" height="344.3415077202543" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f942a0b9-608a-4fc0-ab86-4372bfb6f6ec_1101x677.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:677,&quot;width&quot;:1101,&quot;resizeWidth&quot;:560,&quot;bytes&quot;:242164,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/159708830?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff942a0b9-608a-4fc0-ab86-4372bfb6f6ec_1101x677.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!GfzF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff942a0b9-608a-4fc0-ab86-4372bfb6f6ec_1101x677.png 424w, https://substackcdn.com/image/fetch/$s_!GfzF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff942a0b9-608a-4fc0-ab86-4372bfb6f6ec_1101x677.png 848w, https://substackcdn.com/image/fetch/$s_!GfzF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff942a0b9-608a-4fc0-ab86-4372bfb6f6ec_1101x677.png 1272w, https://substackcdn.com/image/fetch/$s_!GfzF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff942a0b9-608a-4fc0-ab86-4372bfb6f6ec_1101x677.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Even Google DeepMind Founder and CEO, Demis Hassabis, was warning about the compounding AI error problem, just a few days ago at the Google Vertex Event.</figcaption></figure></div><p>And fixing the reliability problem does not need a bigger model. We already have proof of this as observed across our platform&#8230;</p><div><hr></div><h2>How reliability is being solved in production today?</h2><p>One of the interesting patterns we have observed on our platform is the workload differences between individual and scaled-up commercial use cases at featherless</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!x7Cd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d092e0c-11b1-4153-a9d2-5dcd00be8e32_4116x1660.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!x7Cd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d092e0c-11b1-4153-a9d2-5dcd00be8e32_4116x1660.png 424w, https://substackcdn.com/image/fetch/$s_!x7Cd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d092e0c-11b1-4153-a9d2-5dcd00be8e32_4116x1660.png 848w, https://substackcdn.com/image/fetch/$s_!x7Cd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d092e0c-11b1-4153-a9d2-5dcd00be8e32_4116x1660.png 1272w, https://substackcdn.com/image/fetch/$s_!x7Cd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d092e0c-11b1-4153-a9d2-5dcd00be8e32_4116x1660.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!x7Cd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d092e0c-11b1-4153-a9d2-5dcd00be8e32_4116x1660.png" width="724" height="291.8873626373626" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8d092e0c-11b1-4153-a9d2-5dcd00be8e32_4116x1660.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:587,&quot;width&quot;:1456,&quot;resizeWidth&quot;:724,&quot;bytes&quot;:1984523,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/159708830?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d092e0c-11b1-4153-a9d2-5dcd00be8e32_4116x1660.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!x7Cd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d092e0c-11b1-4153-a9d2-5dcd00be8e32_4116x1660.png 424w, https://substackcdn.com/image/fetch/$s_!x7Cd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d092e0c-11b1-4153-a9d2-5dcd00be8e32_4116x1660.png 848w, https://substackcdn.com/image/fetch/$s_!x7Cd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d092e0c-11b1-4153-a9d2-5dcd00be8e32_4116x1660.png 1272w, https://substackcdn.com/image/fetch/$s_!x7Cd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d092e0c-11b1-4153-a9d2-5dcd00be8e32_4116x1660.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Featherless.ai individual users model usage chart - by finetune, and by model class</figcaption></figure></div><p>For example, in the above, we show, how our individual users are running thousands of fine-tuned model - with a bias towards the top models like the DeepSeek R1, or LLaMA3 70B as expected.</p><p>But this dramatically changes when we see the models running by commercial users in production at scale.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!yEin!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe6947f66-40d5-41c4-8f45-9bc87f8522c6_3844x3844.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!yEin!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe6947f66-40d5-41c4-8f45-9bc87f8522c6_3844x3844.png 424w, https://substackcdn.com/image/fetch/$s_!yEin!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe6947f66-40d5-41c4-8f45-9bc87f8522c6_3844x3844.png 848w, https://substackcdn.com/image/fetch/$s_!yEin!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe6947f66-40d5-41c4-8f45-9bc87f8522c6_3844x3844.png 1272w, https://substackcdn.com/image/fetch/$s_!yEin!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe6947f66-40d5-41c4-8f45-9bc87f8522c6_3844x3844.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!yEin!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe6947f66-40d5-41c4-8f45-9bc87f8522c6_3844x3844.png" width="470" height="470" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e6947f66-40d5-41c4-8f45-9bc87f8522c6_3844x3844.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:470,&quot;bytes&quot;:2675503,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/159708830?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe6947f66-40d5-41c4-8f45-9bc87f8522c6_3844x3844.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!yEin!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe6947f66-40d5-41c4-8f45-9bc87f8522c6_3844x3844.png 424w, https://substackcdn.com/image/fetch/$s_!yEin!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe6947f66-40d5-41c4-8f45-9bc87f8522c6_3844x3844.png 848w, https://substackcdn.com/image/fetch/$s_!yEin!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe6947f66-40d5-41c4-8f45-9bc87f8522c6_3844x3844.png 1272w, https://substackcdn.com/image/fetch/$s_!yEin!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe6947f66-40d5-41c4-8f45-9bc87f8522c6_3844x3844.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Commercial scaled-up production workload</figcaption></figure></div><p>The graph is not a mistake, the vast majority of the production workload we see at scale (by request or token count). Is the 1-year-old mistral Nemo-12b (including both base and finetuned)</p><ul><li><p>Not the latest model</p></li><li><p>Not the biggest model</p></li><li><p>That is good enough, for predictable prompt engineering or finetuning</p></li></ul><p>This is consistent with AI systems and agents in production at scale, where either or both of the following solutions are used.</p><ul><li><p><strong>AI Engineering:</strong> large problems are scoped, and broken into smaller tasks, solvable via prompt engineering</p></li><li><p><strong>Finetuning:</strong> specialized domain-specific model is finetuned for specific tasks</p></li></ul><p>In both cases, significant engineering effort is required </p><ul><li><p>to design the AI agent, and its workflow, with prompt engineering</p></li><li><p>to calibrate the dataset for finetuning, requiring dedicated specialized talent. Due to the limitations of <strong>catastrophic forgetting</strong></p></li></ul><p>The later, is typically reserved for larger teams, due to the difficulty in scaling the resources required to &#8220;get it right&#8221;. Due to the highly trial-and-error nature of the process involved in both tasks.</p><blockquote><p><strong>Catastrophic forgetting: Is the challenge faced by all existing model, where existing knowledge and capabilities is lost. While adding in new knowledge.</strong></p></blockquote><div class="pullquote"><p>For reliability at scale, it&#8217;s less about model size &#8230;<br>And more about, designing the task, and breaking it into reliable parts.<br><br>Prompt Engineering for initial results, and PMF (Product Market Fit), <br>Finetuning for longer-term reliability.</p></div><h2>How RWKV memory module can solve this</h2><p>If all AI models are already &#8220;capable&#8221; enough to potentially handle the vast majority of commercial tasks. Especially when they are finetuned for high reliability.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8UNC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa25d14e7-30f3-4b0a-aa9a-31b3a7613f85_610x676.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8UNC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa25d14e7-30f3-4b0a-aa9a-31b3a7613f85_610x676.png 424w, https://substackcdn.com/image/fetch/$s_!8UNC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa25d14e7-30f3-4b0a-aa9a-31b3a7613f85_610x676.png 848w, https://substackcdn.com/image/fetch/$s_!8UNC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa25d14e7-30f3-4b0a-aa9a-31b3a7613f85_610x676.png 1272w, https://substackcdn.com/image/fetch/$s_!8UNC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa25d14e7-30f3-4b0a-aa9a-31b3a7613f85_610x676.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8UNC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa25d14e7-30f3-4b0a-aa9a-31b3a7613f85_610x676.png" width="402" height="445.49508196721314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a25d14e7-30f3-4b0a-aa9a-31b3a7613f85_610x676.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:676,&quot;width&quot;:610,&quot;resizeWidth&quot;:402,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;RWKV-V4 language modeling architecture&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="RWKV-V4 language modeling architecture" title="RWKV-V4 language modeling architecture" srcset="https://substackcdn.com/image/fetch/$s_!8UNC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa25d14e7-30f3-4b0a-aa9a-31b3a7613f85_610x676.png 424w, https://substackcdn.com/image/fetch/$s_!8UNC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa25d14e7-30f3-4b0a-aa9a-31b3a7613f85_610x676.png 848w, https://substackcdn.com/image/fetch/$s_!8UNC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa25d14e7-30f3-4b0a-aa9a-31b3a7613f85_610x676.png 1272w, https://substackcdn.com/image/fetch/$s_!8UNC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa25d14e7-30f3-4b0a-aa9a-31b3a7613f85_610x676.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A recurrent model, with memories being part of its core design. Dramatically simplify the process of &#8220;finetuning&#8221;. In particular around the topic of catastrophic forgetting</p><p>For example, it is possible to &#8220;memory tune&#8221; the memory module state (Time Mix, in the diagram), which by default is left as blank in our base models. </p><p>This will not induce any <strong>Catastrophic forgetting, </strong>as none of the initial model weights is modified. A process we been testing, and implementing for a few pilot customers.</p><p>Recurrent models, also by their nature/design, is trained to handle memories, much closer to how we humans do so. Which makes it naturally more scalable for solving the reliable memory problem transformers face.</p><p>It also brings about multiple additional benefits, such as 100x lower inference cost (due to its linear scaling nature).</p><div class="pullquote"><p>But more importantly, it&#8217;s not about what it can do today.<br>But what we can build tomorrow, iterating on this technology path.</p></div><div><hr></div><h2>We are by no means perfect today (or ever), <br>it is why we iterate, more than anyone else</h2><p>The RWKV team, is arguably the only team today, who has been consistently be making step function improvements in our architecture every half a year.</p><p>This can be best seen by our model performance chart across the past 2 years, from version 4 to version 7.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!I995!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74af0ec4-4d11-4578-b7c5-9e67bb015e3a_1702x1204.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!I995!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74af0ec4-4d11-4578-b7c5-9e67bb015e3a_1702x1204.png 424w, https://substackcdn.com/image/fetch/$s_!I995!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74af0ec4-4d11-4578-b7c5-9e67bb015e3a_1702x1204.png 848w, https://substackcdn.com/image/fetch/$s_!I995!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74af0ec4-4d11-4578-b7c5-9e67bb015e3a_1702x1204.png 1272w, https://substackcdn.com/image/fetch/$s_!I995!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74af0ec4-4d11-4578-b7c5-9e67bb015e3a_1702x1204.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!I995!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74af0ec4-4d11-4578-b7c5-9e67bb015e3a_1702x1204.png" width="576" height="407.4725274725275" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/74af0ec4-4d11-4578-b7c5-9e67bb015e3a_1702x1204.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1030,&quot;width&quot;:1456,&quot;resizeWidth&quot;:576,&quot;bytes&quot;:333254,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/159708830?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74af0ec4-4d11-4578-b7c5-9e67bb015e3a_1702x1204.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!I995!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74af0ec4-4d11-4578-b7c5-9e67bb015e3a_1702x1204.png 424w, https://substackcdn.com/image/fetch/$s_!I995!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74af0ec4-4d11-4578-b7c5-9e67bb015e3a_1702x1204.png 848w, https://substackcdn.com/image/fetch/$s_!I995!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74af0ec4-4d11-4578-b7c5-9e67bb015e3a_1702x1204.png 1272w, https://substackcdn.com/image/fetch/$s_!I995!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74af0ec4-4d11-4578-b7c5-9e67bb015e3a_1702x1204.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A process that we intend to accelerate, moving forward.</p><div><hr></div><h2>How to achieve Personalized AI </h2><p>With the appropriate resource support. We expect to iterate into a reliable version of personalized AI within the next year or two.</p><p>We define this, as being able to &#8220;Memory Tune&#8221; without <strong>Catastrophic forgetting, </strong>and with high accuracy in narrow tasks. With 100 Million tokens or less. Into production use cases.</p><p>Quickly, and without the need for highly specialized professionals.</p><p>This is not considered &#8220;AGI&#8221;, because it will be constrained up to what it was &#8220;designed to be trained on&#8221;. As it would not permanently be learning new knowledge.</p><p>However, this is not an issue for most commercial use cases, as it would become the workhorse that drives the AI agent adoption cycle.</p><div><hr></div><h2>And make it &#8594; Personalized AGI</h2><p>Once personalized AI is mastered, it would allow us to focus on the next step - to automate the process for the model to collect and reflect on their &#8220;day-to-day&#8221; interaction. For further memory tuning &#8220;in the night&#8221;.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!N0Kr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9366943b-0f04-4780-88d5-4ab55c6fe42c_4116x2446.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!N0Kr!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9366943b-0f04-4780-88d5-4ab55c6fe42c_4116x2446.png 424w, https://substackcdn.com/image/fetch/$s_!N0Kr!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9366943b-0f04-4780-88d5-4ab55c6fe42c_4116x2446.png 848w, https://substackcdn.com/image/fetch/$s_!N0Kr!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9366943b-0f04-4780-88d5-4ab55c6fe42c_4116x2446.png 1272w, https://substackcdn.com/image/fetch/$s_!N0Kr!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9366943b-0f04-4780-88d5-4ab55c6fe42c_4116x2446.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!N0Kr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9366943b-0f04-4780-88d5-4ab55c6fe42c_4116x2446.png" width="1456" height="865" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9366943b-0f04-4780-88d5-4ab55c6fe42c_4116x2446.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:865,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:852389,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/159708830?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9366943b-0f04-4780-88d5-4ab55c6fe42c_4116x2446.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!N0Kr!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9366943b-0f04-4780-88d5-4ab55c6fe42c_4116x2446.png 424w, https://substackcdn.com/image/fetch/$s_!N0Kr!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9366943b-0f04-4780-88d5-4ab55c6fe42c_4116x2446.png 848w, https://substackcdn.com/image/fetch/$s_!N0Kr!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9366943b-0f04-4780-88d5-4ab55c6fe42c_4116x2446.png 1272w, https://substackcdn.com/image/fetch/$s_!N0Kr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9366943b-0f04-4780-88d5-4ab55c6fe42c_4116x2446.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>For most part, this would be done incrementally, instead of a giant leap.</p><p>This is due to how its an extension of the Personalized AI memory tuning process. </p><p>The main differentiator, however, is we would now allow the model to decide on new knowledge and information from its day to day experiences to &#8220;learn from&#8221;.</p><p>However, we do not expect this automated tuning process to be infinitely stable. Due to potential memory loss in between past a certain scale. Be it a billion or a trillion tokens.</p><p>However, what this will simply translate to, is a lifespan for a continuous Personal AGI agent. One that can span days, to weeks, months, and eventually years for each iterative improvement.</p><div><hr></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://images.unsplash.com/photo-1470252649378-9c29740c9fa8?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxzdW5yaXNlfGVufDB8fHx8MTc0Mjc3NjMzNXww&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://images.unsplash.com/photo-1470252649378-9c29740c9fa8?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxzdW5yaXNlfGVufDB8fHx8MTc0Mjc3NjMzNXww&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 424w, https://images.unsplash.com/photo-1470252649378-9c29740c9fa8?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxzdW5yaXNlfGVufDB8fHx8MTc0Mjc3NjMzNXww&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 848w, https://images.unsplash.com/photo-1470252649378-9c29740c9fa8?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxzdW5yaXNlfGVufDB8fHx8MTc0Mjc3NjMzNXww&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 1272w, https://images.unsplash.com/photo-1470252649378-9c29740c9fa8?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxzdW5yaXNlfGVufDB8fHx8MTc0Mjc3NjMzNXww&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 1456w" sizes="100vw"><img src="https://images.unsplash.com/photo-1470252649378-9c29740c9fa8?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxzdW5yaXNlfGVufDB8fHx8MTc0Mjc3NjMzNXww&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080" width="4159" height="2773" data-attrs="{&quot;src&quot;:&quot;https://images.unsplash.com/photo-1470252649378-9c29740c9fa8?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxzdW5yaXNlfGVufDB8fHx8MTc0Mjc3NjMzNXww&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:2773,&quot;width&quot;:4159,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;trees under cloudy sky during sunset&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="trees under cloudy sky during sunset" title="trees under cloudy sky during sunset" srcset="https://images.unsplash.com/photo-1470252649378-9c29740c9fa8?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxzdW5yaXNlfGVufDB8fHx8MTc0Mjc3NjMzNXww&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 424w, https://images.unsplash.com/photo-1470252649378-9c29740c9fa8?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxzdW5yaXNlfGVufDB8fHx8MTc0Mjc3NjMzNXww&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 848w, https://images.unsplash.com/photo-1470252649378-9c29740c9fa8?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxzdW5yaXNlfGVufDB8fHx8MTc0Mjc3NjMzNXww&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 1272w, https://images.unsplash.com/photo-1470252649378-9c29740c9fa8?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHw0fHxzdW5yaXNlfGVufDB8fHx8MTc0Mjc3NjMzNXww&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Photo by <a href="true">Dawid Zawi&#322;a</a> on <a href="https://unsplash.com">Unsplash</a></figcaption></figure></div><h1>Why all of this is inevitable</h1><p>It is either a binary outcome, can recurrent models, be iterated and scaled (not just by param size), to be more reliable, and have better memories.</p><ul><li><p>We have provided evidence that the above statement is true, on a progressive trend for 2 years, despite the limited resources invested in the team.</p></li><li><p>We have provided evidence that recurrent models can scale up to match the capability of some of the largest open transformer models.</p><p></p></li></ul><p>Assuming the above is true</p><ul><li><p>The high-level roadmap is here, in public.</p></li><li><p>Better and more capable open transformer models will arrive, <br>which will speed up the process for us to iterate on. <br>(little to no training from scratch required)<br></p></li><li><p>We have enough resources to keep scaling our inference platform, <br>while slowly iterating on this roadmap, with a handful of researchers.</p></li><li><p>Even if we Featherless were to suddenly disappear. Enough resources and momentum within the open model space, will let future teams slowly eventually follow this same roadmap laid out.</p><p></p></li></ul><p>Overall, we expect the following timeline, if featherless AI is properly supported in this journey, to seize the opportunity window</p><ul><li><p>&lt; 2 Years for personalized AI</p></li><li><p>&lt; 4 Years for personalized AGI</p><p></p></li></ul><p>Left on its own, with Moore&#8217;s law level of improvement in computing, we expect the hardware requirement to iterate on this roadmap to enter the &#8220;personal computing&#8221; space in 4 years. Once that is reached, we expect a rapid innovation cycle that leads to the same result, be it within or outside the USA.</p><div class="pullquote"><p>So my question is: Do you want to support us in making this happen in 2-4 years?<br>And seize the opportunity with us. Or would you like to miss out on it when it inevitably comes</p></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>This model was originally published as Qwerky-72B. However, due to confusion with another similar naming company/model, we have been requested to avoid using the Qwerky name, so we have renamed our models to QRWKV-72B</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[🪿QRWKV-72B and 32B : Training large attention free models, with only 8 GPU's]]></title><description><![CDATA[&#8252;&#65039; Attention is NOT all you need &#8252;&#65039;]]></description><link>https://substack.recursal.ai/p/qwerky-72b-and-32b-training-large</link><guid isPermaLink="false">https://substack.recursal.ai/p/qwerky-72b-and-32b-training-large</guid><dc:creator><![CDATA[Eugene Cheah]]></dc:creator><pubDate>Mon, 24 Mar 2025 17:30:34 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!iyqS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F525f7c7c-725d-4174-b2f7-7d3c6ccc3c04_2743x1773.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!iyqS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F525f7c7c-725d-4174-b2f7-7d3c6ccc3c04_2743x1773.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!iyqS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F525f7c7c-725d-4174-b2f7-7d3c6ccc3c04_2743x1773.png 424w, https://substackcdn.com/image/fetch/$s_!iyqS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F525f7c7c-725d-4174-b2f7-7d3c6ccc3c04_2743x1773.png 848w, https://substackcdn.com/image/fetch/$s_!iyqS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F525f7c7c-725d-4174-b2f7-7d3c6ccc3c04_2743x1773.png 1272w, https://substackcdn.com/image/fetch/$s_!iyqS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F525f7c7c-725d-4174-b2f7-7d3c6ccc3c04_2743x1773.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!iyqS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F525f7c7c-725d-4174-b2f7-7d3c6ccc3c04_2743x1773.png" width="1456" height="941" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/525f7c7c-725d-4174-b2f7-7d3c6ccc3c04_2743x1773.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:941,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!iyqS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F525f7c7c-725d-4174-b2f7-7d3c6ccc3c04_2743x1773.png 424w, https://substackcdn.com/image/fetch/$s_!iyqS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F525f7c7c-725d-4174-b2f7-7d3c6ccc3c04_2743x1773.png 848w, https://substackcdn.com/image/fetch/$s_!iyqS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F525f7c7c-725d-4174-b2f7-7d3c6ccc3c04_2743x1773.png 1272w, https://substackcdn.com/image/fetch/$s_!iyqS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F525f7c7c-725d-4174-b2f7-7d3c6ccc3c04_2743x1773.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>We are proud to announce the updated QRWKV-72B<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> and 32B.</p><p>Both models are available on huggingface and featherless.ai</p><ul><li><p>32B | <a href="https://huggingface.co/featherless-ai/Qwerky-QwQ-32B">Hugging Face Link</a> | <a href="https://featherless.ai/models/featherless-ai/Qwerky-QwQ-32B/readme">Featherless AI Link</a></p></li><li><p>72B | <a href="https://featherless.ai/models/featherless-ai/Qwerky-72B/readme">Hugging Face Link</a> | <a href="https://featherless.ai/models/featherless-ai/Qwerky-72B/readme">Featherless AI Link</a></p></li></ul><p>The largest model to date - that is not based on the transformer attention architecture. </p><p>Surpassing existing transformer models in several benchmarks, while following right behind in others.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vJPd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e250c60-f42f-48e4-b56a-bb69981a0277_3975x1035.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vJPd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e250c60-f42f-48e4-b56a-bb69981a0277_3975x1035.png 424w, https://substackcdn.com/image/fetch/$s_!vJPd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e250c60-f42f-48e4-b56a-bb69981a0277_3975x1035.png 848w, https://substackcdn.com/image/fetch/$s_!vJPd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e250c60-f42f-48e4-b56a-bb69981a0277_3975x1035.png 1272w, https://substackcdn.com/image/fetch/$s_!vJPd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e250c60-f42f-48e4-b56a-bb69981a0277_3975x1035.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vJPd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e250c60-f42f-48e4-b56a-bb69981a0277_3975x1035.png" width="724" height="188.5132075471698" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8e250c60-f42f-48e4-b56a-bb69981a0277_3975x1035.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1035,&quot;width&quot;:3975,&quot;resizeWidth&quot;:724,&quot;bytes&quot;:783967,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/159379897?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f937f8b-4b3b-4895-a276-033b71099d9b_4378x1438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vJPd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e250c60-f42f-48e4-b56a-bb69981a0277_3975x1035.png 424w, https://substackcdn.com/image/fetch/$s_!vJPd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e250c60-f42f-48e4-b56a-bb69981a0277_3975x1035.png 848w, https://substackcdn.com/image/fetch/$s_!vJPd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e250c60-f42f-48e4-b56a-bb69981a0277_3975x1035.png 1272w, https://substackcdn.com/image/fetch/$s_!vJPd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8e250c60-f42f-48e4-b56a-bb69981a0277_3975x1035.png 1456w" sizes="100vw"></picture><div></div></div></a></figure></div><p>This builds on our previous experiments in converting the QRWKV6, where we converted the previous Qwen 2.5 32B model to RWKV. And the previous 72B preview.</p><p>Which we applied instead for the Qwen-QwQ-32B model and the Qwen-72B model respectively.</p><p>But lets take a step back at what this means &#8230;</p><div><hr></div><h1>We now have a model far surpassing <br>GPT-3.5 turbo, without QKV attention</h1><p>While slowly closing in on GPT-4O-mini</p><p>With lower inference cost, param size, and better performance.</p><div class="pullquote"><p>In 2024: When we proposed scaling up RWKV to replace attention.<br>Many believed transformer attention, is the <em><strong>only</strong></em> viable path <br>to GPT 3.5 or better intelligence. Today this is disproven false.</p></div><h2>We need no super cluster - only a single server.</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6tPh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9d6e658-6567-4250-adc4-962b71a16ee6_1260x709.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6tPh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9d6e658-6567-4250-adc4-962b71a16ee6_1260x709.jpeg 424w, https://substackcdn.com/image/fetch/$s_!6tPh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9d6e658-6567-4250-adc4-962b71a16ee6_1260x709.jpeg 848w, https://substackcdn.com/image/fetch/$s_!6tPh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9d6e658-6567-4250-adc4-962b71a16ee6_1260x709.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!6tPh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9d6e658-6567-4250-adc4-962b71a16ee6_1260x709.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6tPh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9d6e658-6567-4250-adc4-962b71a16ee6_1260x709.jpeg" width="1260" height="709" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b9d6e658-6567-4250-adc4-962b71a16ee6_1260x709.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:709,&quot;width&quot;:1260,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6tPh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9d6e658-6567-4250-adc4-962b71a16ee6_1260x709.jpeg 424w, https://substackcdn.com/image/fetch/$s_!6tPh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9d6e658-6567-4250-adc4-962b71a16ee6_1260x709.jpeg 848w, https://substackcdn.com/image/fetch/$s_!6tPh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9d6e658-6567-4250-adc4-962b71a16ee6_1260x709.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!6tPh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb9d6e658-6567-4250-adc4-962b71a16ee6_1260x709.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Because we were keeping most of the feed forward network layer the same.<br>We can perform the conversion, (barely) within a single server of 8 MI300 GPU&#8217;s</p><p>Requiring the full 192GB VRAM allocation per GPU</p><div><hr></div><h1>How the conversion is done: A summary</h1><p>While more details will be revealed in an upcoming paper. The core idea is similar to the previous <a href="https://substack.recursal.ai/p/q-rwkv-6-32b-instruct-preview">QRWKV6 conversion</a> , but this time we apply it to the Qwen-72B and QwQ-32B models</p><p>At a high level, you take an existing transformer model</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4UqB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc19f66e6-aa7d-4626-a7a6-30afaed08cfa_2330x1124.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4UqB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc19f66e6-aa7d-4626-a7a6-30afaed08cfa_2330x1124.png 424w, https://substackcdn.com/image/fetch/$s_!4UqB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc19f66e6-aa7d-4626-a7a6-30afaed08cfa_2330x1124.png 848w, https://substackcdn.com/image/fetch/$s_!4UqB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc19f66e6-aa7d-4626-a7a6-30afaed08cfa_2330x1124.png 1272w, https://substackcdn.com/image/fetch/$s_!4UqB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc19f66e6-aa7d-4626-a7a6-30afaed08cfa_2330x1124.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4UqB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc19f66e6-aa7d-4626-a7a6-30afaed08cfa_2330x1124.png" width="1456" height="702" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c19f66e6-aa7d-4626-a7a6-30afaed08cfa_2330x1124.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:702,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:340672,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/159379897?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc19f66e6-aa7d-4626-a7a6-30afaed08cfa_2330x1124.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4UqB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc19f66e6-aa7d-4626-a7a6-30afaed08cfa_2330x1124.png 424w, https://substackcdn.com/image/fetch/$s_!4UqB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc19f66e6-aa7d-4626-a7a6-30afaed08cfa_2330x1124.png 848w, https://substackcdn.com/image/fetch/$s_!4UqB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc19f66e6-aa7d-4626-a7a6-30afaed08cfa_2330x1124.png 1272w, https://substackcdn.com/image/fetch/$s_!4UqB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc19f66e6-aa7d-4626-a7a6-30afaed08cfa_2330x1124.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Freeze all the weights, delete the attention layer, replace it with RWKV, and train it through multiple stages</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!AD0z!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88462dde-6010-46b2-a038-4cca803cee36_2328x1116.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!AD0z!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88462dde-6010-46b2-a038-4cca803cee36_2328x1116.png 424w, https://substackcdn.com/image/fetch/$s_!AD0z!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88462dde-6010-46b2-a038-4cca803cee36_2328x1116.png 848w, https://substackcdn.com/image/fetch/$s_!AD0z!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88462dde-6010-46b2-a038-4cca803cee36_2328x1116.png 1272w, https://substackcdn.com/image/fetch/$s_!AD0z!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88462dde-6010-46b2-a038-4cca803cee36_2328x1116.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!AD0z!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88462dde-6010-46b2-a038-4cca803cee36_2328x1116.png" width="1456" height="698" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/88462dde-6010-46b2-a038-4cca803cee36_2328x1116.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:698,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:404633,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/159379897?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88462dde-6010-46b2-a038-4cca803cee36_2328x1116.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!AD0z!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88462dde-6010-46b2-a038-4cca803cee36_2328x1116.png 424w, https://substackcdn.com/image/fetch/$s_!AD0z!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88462dde-6010-46b2-a038-4cca803cee36_2328x1116.png 848w, https://substackcdn.com/image/fetch/$s_!AD0z!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88462dde-6010-46b2-a038-4cca803cee36_2328x1116.png 1272w, https://substackcdn.com/image/fetch/$s_!AD0z!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88462dde-6010-46b2-a038-4cca803cee36_2328x1116.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>All while referencing the original model logits as a &#8220;teacher model&#8221;</p><p>More specifically it would be the following</p><ul><li><p>Train the RWKV layer individually, referencing the individual teacher blocks</p></li><li><p>Train the RWKV layer, with the whole model, training on teacher logits</p><ul><li><p>At this point the model is &#8220;usable&#8221; but has much more to improve on</p></li></ul></li><li><p>Train all the layers (both FFNN and RWKV), on teacher logits</p></li><li><p>Train all the layers with longer context length</p></li></ul><p>Unfortunately, due to the limitation of VRAM, our training was limited to 8k context length. However we view this as a resource constraint, and not a method constraint.</p><div><hr></div><h1>Implication:<br>AI knowledge, is not in attention, but FFN</h1><p>Due to the limited token training of 200-500M, of the converted layers. We do not believe that the newly trained RWKV layers, is sufficiently trained for &#8220;knowledge/intelligence&#8221; at this level.</p><p>In other words, the vast majority of an AI model knowledge, is not in the attention but the matrix multiplication FFN (Feed-Forward-Network) layer.</p><p>It would be more accurate, to view the Attention mechanisms, be it transformer based, or RWKV. As a means of guiding the model to focus on &#8220;what the model thinks&#8221; about in the FFN layer.</p><div><hr></div><h1>Benefits: Ideal for large scale application</h1><p>Additionally, with the shift towards inference-time-computing.</p><p>Linear architectures represents a dramatic reduction in both compute and vram requirement cost. Allowing us to scale hundreds to thousand requests per GPU.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!r0u4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10c34ad9-606d-4790-a1de-8748b9117965_1500x1144.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!r0u4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10c34ad9-606d-4790-a1de-8748b9117965_1500x1144.png 424w, https://substackcdn.com/image/fetch/$s_!r0u4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10c34ad9-606d-4790-a1de-8748b9117965_1500x1144.png 848w, https://substackcdn.com/image/fetch/$s_!r0u4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10c34ad9-606d-4790-a1de-8748b9117965_1500x1144.png 1272w, https://substackcdn.com/image/fetch/$s_!r0u4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10c34ad9-606d-4790-a1de-8748b9117965_1500x1144.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!r0u4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10c34ad9-606d-4790-a1de-8748b9117965_1500x1144.png" width="472" height="359.83516483516485" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/10c34ad9-606d-4790-a1de-8748b9117965_1500x1144.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1110,&quot;width&quot;:1456,&quot;resizeWidth&quot;:472,&quot;bytes&quot;:235521,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/159379897?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10c34ad9-606d-4790-a1de-8748b9117965_1500x1144.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!r0u4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10c34ad9-606d-4790-a1de-8748b9117965_1500x1144.png 424w, https://substackcdn.com/image/fetch/$s_!r0u4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10c34ad9-606d-4790-a1de-8748b9117965_1500x1144.png 848w, https://substackcdn.com/image/fetch/$s_!r0u4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10c34ad9-606d-4790-a1de-8748b9117965_1500x1144.png 1272w, https://substackcdn.com/image/fetch/$s_!r0u4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10c34ad9-606d-4790-a1de-8748b9117965_1500x1144.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><h1>We can now rapidly iterate new RWKV architectures on &lt;100B scales</h1><p>By dramatically reducing the compute requirement for scaling and testing a new RWKV attention architecture. To a small number of GPU&#8217;s</p><p>We will be able to test, iterate, and validate newer architecture changes faster, taking experiments what previously took weeks (or even months), to days.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!WeB2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54099f83-aee3-4afa-9c50-3d4556ea6ac3_1702x1204.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!WeB2!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54099f83-aee3-4afa-9c50-3d4556ea6ac3_1702x1204.png 424w, https://substackcdn.com/image/fetch/$s_!WeB2!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54099f83-aee3-4afa-9c50-3d4556ea6ac3_1702x1204.png 848w, https://substackcdn.com/image/fetch/$s_!WeB2!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54099f83-aee3-4afa-9c50-3d4556ea6ac3_1702x1204.png 1272w, https://substackcdn.com/image/fetch/$s_!WeB2!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54099f83-aee3-4afa-9c50-3d4556ea6ac3_1702x1204.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!WeB2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54099f83-aee3-4afa-9c50-3d4556ea6ac3_1702x1204.png" width="530" height="374.93131868131866" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/54099f83-aee3-4afa-9c50-3d4556ea6ac3_1702x1204.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1030,&quot;width&quot;:1456,&quot;resizeWidth&quot;:530,&quot;bytes&quot;:333254,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/159379897?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54099f83-aee3-4afa-9c50-3d4556ea6ac3_1702x1204.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!WeB2!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54099f83-aee3-4afa-9c50-3d4556ea6ac3_1702x1204.png 424w, https://substackcdn.com/image/fetch/$s_!WeB2!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54099f83-aee3-4afa-9c50-3d4556ea6ac3_1702x1204.png 848w, https://substackcdn.com/image/fetch/$s_!WeB2!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54099f83-aee3-4afa-9c50-3d4556ea6ac3_1702x1204.png 1272w, https://substackcdn.com/image/fetch/$s_!WeB2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54099f83-aee3-4afa-9c50-3d4556ea6ac3_1702x1204.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Historically, the RWKV group has been averaging 4 major versions across 2 years. With improvement to both model architecture accuracy and memories at every step.</p><p>A trend which we plan to accelerate moving forward.</p><p>As we work on our roadmap to Personalized AI and eventually Personalized AGI, which you can see more in our following article &#8230;</p><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:159708830,&quot;url&quot;:&quot;https://substack.recursal.ai/p/our-roadmap-to-personalized-ai-and&quot;,&quot;publication_id&quot;:2073186,&quot;publication_name&quot;:&quot;Featherless AI - recursive dev blog&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!RY89!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F655b233b-f955-4a8e-b220-6e4f392736ef_160x160.png&quot;,&quot;title&quot;:&quot;&#128739;&#65039; Our roadmap to Personalized AI and AGI&quot;,&quot;truncated_body_text&quot;:&quot;If you want to find out more about the latest QRWKV model, that makes all of this possible, it is recommended to read this first:&quot;,&quot;date&quot;:&quot;2025-03-24T17:40:14.797Z&quot;,&quot;like_count&quot;:4,&quot;comment_count&quot;:0,&quot;bylines&quot;:[{&quot;id&quot;:99170118,&quot;name&quot;:&quot;Eugene Cheah&quot;,&quot;handle&quot;:&quot;techtalkcto&quot;,&quot;previous_name&quot;:&quot;PicoCreator&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8dcb57e-6203-4be3-ae29-03732db5c5f7_460x460.jpeg&quot;,&quot;bio&quot;:&quot;Builds Attention-Free Transformer AI models (http://wiki.rwkv.com) from scratch, CEO @ featherless.ai (prv recursal.ai) - Also known for k8s infra &amp; UI testing tools, webapps, and GPU.js, Hot-takes/Views are my own&quot;,&quot;profile_set_up_at&quot;:&quot;2022-07-17T02:17:11.664Z&quot;,&quot;reader_installed_at&quot;:null,&quot;twitter_screen_name&quot;:&quot;picocreator&quot;,&quot;is_guest&quot;:true,&quot;bestseller_tier&quot;:null,&quot;primaryPublicationId&quot;:1004639,&quot;primaryPublicationName&quot;:&quot;Tech Talk CTO&quot;,&quot;primaryPublicationUrl&quot;:&quot;https://substack.tech-talk-cto.com&quot;,&quot;primaryPublicationSubscribeUrl&quot;:&quot;https://substack.tech-talk-cto.com/subscribe?&quot;}],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:true,&quot;type&quot;:&quot;newsletter&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://substack.recursal.ai/p/our-roadmap-to-personalized-ai-and?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!RY89!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F655b233b-f955-4a8e-b220-6e4f392736ef_160x160.png" loading="lazy"><span class="embedded-post-publication-name">Featherless AI - recursive dev blog</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title">&#128739;&#65039; Our roadmap to Personalized AI and AGI</div></div><div class="embedded-post-body">If you want to find out more about the latest QRWKV model, that makes all of this possible, it is recommended to read this first&#8230;</div><div class="embedded-post-cta-wrapper"><span class="embedded-post-cta">Read more</span></div><div class="embedded-post-meta">a year ago &#183; 4 likes &#183; Eugene Cheah</div></a></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>This model was originally published as Qwerky-72B. However, due to confusion with another similar naming company/model, we have been requested to avoid using the Qwerky name, so we have renamed our models to QRWKV-72B</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Featherless.ai Raises $5M in Seed Funding to Democratize Access to Open Source AI Models]]></title><description><![CDATA[Democratizing AI with instant access to 4,000+ open-source models through our breakthrough serverless platform]]></description><link>https://substack.recursal.ai/p/featherlessai-raises-5m-in-seed-funding</link><guid isPermaLink="false">https://substack.recursal.ai/p/featherlessai-raises-5m-in-seed-funding</guid><dc:creator><![CDATA[Darin Verheijke]]></dc:creator><pubDate>Mon, 17 Mar 2025 23:07:40 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!pk94!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><strong>San Francisco - March 17, 2025</strong> - We're excited to announce that Featherless.ai has raised US$5 million in Seed funding from investors including Airbus Ventures, 500 Global, Kickstart Ventures, HF0, Panache Ventures and Oakseed Ventures. As the leading serverless AI inference platform, we're on a mission to provide instant and affordable access to the world's largest collection of open-source AI models.<br>Our Vision: AI for Everyone</p><p>"AI has the power to transform industries and empower individuals around the globe. However, accessing and utilizing a diverse range of models can be cost-prohibitive and logistically challenging for many users, particularly those in emerging markets, where there's no shortage of demand for AI solutions," notes our Founder and CEO Eugene Cheah. "Featherless.ai is changing the game by providing an affordable and scalable solution for using open source AI models in production. I don't want a future where AI is controlled by the few. I want to empower individuals globally."</p><h2>What We Offer: The World's Largest Collection of Open-Source Models</h2><p>At Featherless.ai, we provide instant and affordable access to over 4,000 open-source AI models - including popular options like DeepSeek and LLama - and we're continuously onboarding new models every week. Our flat capacity pricing model ensures cost predictability and scalability, allowing businesses to dynamically scale AI usage without worrying about unexpected charges or rate limits.</p><p>Instead of dealing with infrastructure headaches, our users can focus their energies on what matters: running experiments to develop, test, and fine-tune their models. With this new funding, we're planning to support all major AI modalities &#8212; including embeddings, vision, and speech.</p><p>For casual AI users, our serverless platform offers a low entry point of just $10 per month, providing access to a wide range of the latest open-source models without the need for expensive high-end GPUs. For businesses and enterprises, we deliver scale and significant cost efficiencies through dynamic AI workload scaling and flat capacity pricing.</p><h2>Our Breakthrough Technology</h2><p>With our new funding, we're advancing research and development into next-generation AI architectures that can dramatically lower inference costs, making AI deployment feasible on lower cost hardware rather than requiring high-end GPUs.</p><p>Our architecture enables seamless support for over 100 languages without any performance degradation. This breakthrough removes a major barrier to global AI deployment, where existing systems struggle to maintain consistency across multiple languages.</p><p>One of our proudest innovations is our proprietary hot-swapping technology, which allows AI models to be switched in under 5 seconds, compared to the 30 minutes typically required to load a new AI model onto a standard GPU. This innovation optimizes GPU utilization, eliminates costly downtime, and dramatically reduces operational costs.</p><p>We're also actively involved in the development of the open-source RWKV foundation model project, the first AI model under the Linux Foundation, which has been deployed on Windows to billions of devices<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. This effort is part of our broader commitment to develop better and more accessible AI models across languages, and reduce the reliance on closed-source models from major tech companies.</p><h2>What Our Investors Are Saying</h2><p>"The biggest hurdle to scaling AI is compute cost. Featherless.ai tackles this head-on by providing the trifecta of efficient, effective and affordable AI deployment to businesses globally. Our portfolio companies are avid users of Featherless.ai, and we look forward to expanding that across all the regions we invest in," shares Vishal Harnal, Managing Partner at 500 Global.</p><p>Yuichiro Hikosaka, Principal at Airbus Ventures, explains: "Our research into foundational model development revealed that while much of the AI industry has focused on scaling existing transformer models, fundamental inefficiencies in the architecture have remained unaddressed. As most foundational models struggle to maintain performance across multiple languages, creating accessibility barriers in global markets, these structural inefficiencies have threatened to stifle competition and innovation across the broader global ecosystem. Now enter Featherless.ai - a serverless platform that empowers users to run, test, and fine-tune the latest models seamlessly and affordably. Featherless.ai is ready and primed to address a global market."</p><p>"We see Large Language Models being experimented and integrated into applications everywhere, Featherless.ai plays a unique role in enabling organizations to test, integrate and run any and every open source LLM easily and cost-effectively. Their innovative approach and focus on serving diverse markets make them a standout in this rapidly growing space," says Chee-We Ng, Managing Partner at Oakseed Ventures.</p><h2>Our Team</h2><p>Featherless.ai is co-founded by Eugene Cheah (CEO), Harrison Vanderbyl (CTO), and Wesley George (COO), who bring combined experience of 30 years building software and leading engineering teams.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!pk94!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!pk94!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png 424w, https://substackcdn.com/image/fetch/$s_!pk94!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png 848w, https://substackcdn.com/image/fetch/$s_!pk94!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png 1272w, https://substackcdn.com/image/fetch/$s_!pk94!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!pk94!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png" width="1456" height="970" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Featherless.ai team pictured in Palm Springs, California.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Featherless.ai team pictured in Palm Springs, California." title="Featherless.ai team pictured in Palm Springs, California." srcset="https://substackcdn.com/image/fetch/$s_!pk94!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png 424w, https://substackcdn.com/image/fetch/$s_!pk94!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png 848w, https://substackcdn.com/image/fetch/$s_!pk94!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png 1272w, https://substackcdn.com/image/fetch/$s_!pk94!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Featherless.ai team pictured in Palm Springs, California.</figcaption></figure></div><h2>Join us!</h2><p>Featherless.ai is hiring! If you're passionate about making AI open and accessible, send your cover letter to <a href="mailto:hello@featherless.ai">hello@featherless.ai</a>. For more information about our platform and services, visit <a href="http://www.featherless.ai">www.featherless.ai</a>.</p><p>Together, we can democratize access to AI and ensure that these powerful technologies benefit everyone, not just those with unlimited resources.</p><h3></h3><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p><strong>RWKV.cpp - shipping to 1.5 billion systems worldwide </strong>(https://blog.rwkv.com/p/rwkvcpp-shipping-to-half-a-billion)</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[QwQ-32B Now Available on Featherless.ai ]]></title><description><![CDATA[QwQ-32B: A Powerful Lightweight in the Age of Reasoning Models]]></description><link>https://substack.recursal.ai/p/qwq-32b-now-available-on-featherlessai</link><guid isPermaLink="false">https://substack.recursal.ai/p/qwq-32b-now-available-on-featherlessai</guid><dc:creator><![CDATA[Darin Verheijke]]></dc:creator><pubDate>Fri, 07 Mar 2025 10:14:55 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!Z7_s!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2><strong>QwQ-32B: A Powerful Lightweight in the Age of Reasoning Models</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Z7_s!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Z7_s!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png 424w, https://substackcdn.com/image/fetch/$s_!Z7_s!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png 848w, https://substackcdn.com/image/fetch/$s_!Z7_s!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png 1272w, https://substackcdn.com/image/fetch/$s_!Z7_s!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Z7_s!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png" width="1008" height="400" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:400,&quot;width&quot;:1008,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:573231,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/158576603?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Z7_s!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png 424w, https://substackcdn.com/image/fetch/$s_!Z7_s!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png 848w, https://substackcdn.com/image/fetch/$s_!Z7_s!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png 1272w, https://substackcdn.com/image/fetch/$s_!Z7_s!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>AI development continues to advance through diverse approaches to model design and optimization. <strong>DeepSeek-R1</strong>, with its impressive 671B parameters, has established itself as one of the most capable reasoning-focused models on the market. Its remarkable capabilities have set new benchmarks for what models in this space can achieve.</p><p>Meanwhile, <strong>efficiency and adaptability</strong> continue opening new frontiers, and this is where <strong>QwQ-32B</strong>, Qwen's latest release, makes its mark.</p><h2><strong>QwQ-32B: Efficient Reasoning Power</strong></h2><p><strong>QwQ-32B</strong> delivers <strong>high-level reasoning, problem-solving, and strong coding/math capabilities</strong> in a lightweight package. With just 32B parameters, early benchmarks show impressive performance, making it an attractive option for those looking for strong reasoning capabilities in a more efficient format.</p><p>With AI applications diversifying, the demand for models that deliver <strong>excellent performance with a smaller resource footprint</strong> continues to grow. <strong>QwQ-32B exemplifies how well-optimized models can achieve remarkable results.</strong></p><h2><strong>The Evolution of Reasoning Models</strong></h2><p>The AI field is evolving rapidly. While early models focused heavily on <strong>generative fluency and knowledge retrieval</strong>, today's most exciting breakthroughs are in <strong>models that can reason, plan, and solve complex problems</strong>. DeepSeek-R1 has been instrumental in this evolution, demonstrating the power of advanced reasoning capabilities.</p><p>Now, Qwen is expanding possibilities further, showing that <strong>reasoning power can be delivered in different formats to meet diverse needs.</strong></p><h2><strong>How Does QwQ-32B Perform?</strong></h2><p>Testing indicates that QwQ-32B:</p><ul><li><p><strong>Excels in logical reasoning tasks</strong> with impressive structured problem-solving</p></li><li><p><strong>Performs well in math and coding</strong>, key benchmarks for reasoning capability</p></li><li><p><strong>Offers strong efficiency</strong>, delivering high performance with reduced compute requirements</p></li></ul><p>For users seeking <strong>high-quality reasoning in an efficient package</strong>, QwQ-32B presents an exciting option.</p><h2><strong>What This Means For You</strong></h2><p>With both QwQ-32B and DeepSeek-R1 available on Featherless.ai, you now have multiple excellent options for advanced reasoning capabilities. Our ongoing optimization efforts ensure that you'll benefit from continuous improvements in both performance and functionality.</p><p>We're committed to making advanced open AI models accessible and practical for everyone. Each model in our lineup offers unique advantages to suit different use cases and requirements.</p><h2><strong>Try Our Models and Share Your Thoughts</strong></h2><p>QwQ-32B is now available alongside DeepSeek-R1, and we want to hear about your experiences with both models. Each excels in reasoning tasks while offering different profiles in terms of scale and efficiency.</p><p>Leave a review <strong>on the Featherless.ai model page</strong>: <a href="https://featherless.ai/models/Qwen/QwQ-32B">QwQ-32B</a></p><p>Have questions about integrating these models into your workflow? Reach out to us on <a href="https://discord.gg/7gybCMPjVA">Discord</a> or check our documentation for implementation guidelines and best practices.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://substack.recursal.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading the Featherless AI dev blog! Subscribe for free to receive new posts and support our platform.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Featherless.ai introduces QRWKV-72B-Preview: The Best Post-Transformer model yet]]></title><description><![CDATA[QRWKV-72B-Preview: Revolutionizing AI Efficiency and Accessibility with Hybrid Transformer Architecture]]></description><link>https://substack.recursal.ai/p/featherlessai-introduces-qwerky-72b</link><guid isPermaLink="false">https://substack.recursal.ai/p/featherlessai-introduces-qwerky-72b</guid><dc:creator><![CDATA[Featherless AI - dev blog]]></dc:creator><pubDate>Sat, 22 Feb 2025 20:21:00 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!B6mm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4623b9b-924d-4e8b-b36d-fbe97aaa093c_3584x2048.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!B6mm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4623b9b-924d-4e8b-b36d-fbe97aaa093c_3584x2048.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!B6mm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4623b9b-924d-4e8b-b36d-fbe97aaa093c_3584x2048.png 424w, https://substackcdn.com/image/fetch/$s_!B6mm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4623b9b-924d-4e8b-b36d-fbe97aaa093c_3584x2048.png 848w, https://substackcdn.com/image/fetch/$s_!B6mm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4623b9b-924d-4e8b-b36d-fbe97aaa093c_3584x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!B6mm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4623b9b-924d-4e8b-b36d-fbe97aaa093c_3584x2048.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!B6mm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4623b9b-924d-4e8b-b36d-fbe97aaa093c_3584x2048.png" width="1456" height="832" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c4623b9b-924d-4e8b-b36d-fbe97aaa093c_3584x2048.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:832,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:9353933,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/157697804?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4623b9b-924d-4e8b-b36d-fbe97aaa093c_3584x2048.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!B6mm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4623b9b-924d-4e8b-b36d-fbe97aaa093c_3584x2048.png 424w, https://substackcdn.com/image/fetch/$s_!B6mm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4623b9b-924d-4e8b-b36d-fbe97aaa093c_3584x2048.png 848w, https://substackcdn.com/image/fetch/$s_!B6mm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4623b9b-924d-4e8b-b36d-fbe97aaa093c_3584x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!B6mm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4623b9b-924d-4e8b-b36d-fbe97aaa093c_3584x2048.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><blockquote><p>IMPORTANT NOTICE: This preview model has been replaced entirely by the final QRWKV-72B<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> model found here:</p><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:159379897,&quot;url&quot;:&quot;https://substack.recursal.ai/p/qwerky-72b-and-32b-training-large&quot;,&quot;publication_id&quot;:2073186,&quot;publication_name&quot;:&quot;Featherless AI - recursive dev blog&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!RY89!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F655b233b-f955-4a8e-b220-6e4f392736ef_160x160.png&quot;,&quot;title&quot;:&quot;&#129727;QRWKV-72B and 32B : Training large attention free models, with only 8 GPU's&quot;,&quot;truncated_body_text&quot;:&quot;We are proud to announce the updated QRWKV-72B and 32B.&quot;,&quot;date&quot;:&quot;2025-03-24T17:30:34.860Z&quot;,&quot;like_count&quot;:14,&quot;comment_count&quot;:3,&quot;bylines&quot;:[{&quot;id&quot;:99170118,&quot;name&quot;:&quot;Eugene Cheah&quot;,&quot;handle&quot;:&quot;techtalkcto&quot;,&quot;previous_name&quot;:&quot;PicoCreator&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8dcb57e-6203-4be3-ae29-03732db5c5f7_460x460.jpeg&quot;,&quot;bio&quot;:&quot;Builds Attention-Free Transformer AI models (http://wiki.rwkv.com) from scratch, CEO @ featherless.ai (prv recursal.ai) - Also known for k8s infra &amp; UI testing tools, webapps, and GPU.js, Hot-takes/Views are my own&quot;,&quot;profile_set_up_at&quot;:&quot;2022-07-17T02:17:11.664Z&quot;,&quot;reader_installed_at&quot;:null,&quot;twitter_screen_name&quot;:&quot;picocreator&quot;,&quot;is_guest&quot;:true,&quot;bestseller_tier&quot;:null,&quot;primaryPublicationId&quot;:1004639,&quot;primaryPublicationName&quot;:&quot;Tech Talk CTO&quot;,&quot;primaryPublicationUrl&quot;:&quot;https://substack.tech-talk-cto.com&quot;,&quot;primaryPublicationSubscribeUrl&quot;:&quot;https://substack.tech-talk-cto.com/subscribe?&quot;}],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:false,&quot;type&quot;:&quot;newsletter&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://substack.recursal.ai/p/qwerky-72b-and-32b-training-large?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!RY89!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F655b233b-f955-4a8e-b220-6e4f392736ef_160x160.png"><span class="embedded-post-publication-name">Featherless AI - recursive dev blog</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title">&#129727;QRWKV-72B and 32B : Training large attention free models, with only 8 GPU's</div></div><div class="embedded-post-body">We are proud to announce the updated QRWKV-72B and 32B&#8230;</div><div class="embedded-post-cta-wrapper"><span class="embedded-post-cta">Read more</span></div><div class="embedded-post-meta">a year ago &#183; 14 likes &#183; 3 comments &#183; Eugene Cheah</div></a></div></blockquote><p>At <a href="http://featherless.ai/">Featherless.ai</a> , we&#8217;re thrilled to announce the launch of QRWKV-72B , a revolutionary hybrid model combining the computational efficiency of linear transformers with the precision of attention mechanisms. This breakthrough architecture reduces GPU compute costs by over 50% compared to traditional transformers, making it one of the most cost-effective large language models available today, costing less than $100k to build. </p><p>QRWKV-72B sets a new standard for scalability and accessibility, enabling real-time applications across industries.</p><h3><strong>Why QRWKV-72B Matters</strong></h3><p>The introduction of QRWKV-72B marks a pivotal moment in AI development. By merging the strengths of linear transformers and attention transformers, QRWKV-72B achieves unparalleled efficiency without sacrificing performance.</p><p><strong>T</strong>raditional transformer-based models require ~100GB of VRAM (excluding model weights) to handle a single 72B parameter model at a 16k context length. For each additional concurrent request, the VRAM demand increases significantly due to the attention mechanism's quadratic scaling with sequence length. </p><div class="pullquote"><p><strong>In contrast, QRWKV-72B achieves remarkable efficiency by leveraging its hybrid linear-transformer architecture. At 72B parameters, QRWKV-72B requires just 1GB of additional VRAM per request (excluding model weights). Regardless of context length.</strong> </p></div><p>These innovations unlocks several critical benefits:</p><ul><li><p><strong>Lower inference costs</strong>: Businesses can deploy advanced AI solutions at a fraction of the cost, particularly for tasks requiring long-context processing on affordable GPUs.</p></li><li><p><strong>Global Accessibility:</strong> Reduced hardware requirements enable smaller organizations and developing nations to leverage state-of-the-art AI technologies.</p></li><li><p><strong>Higher Concurrency</strong><em>:</em> Serve more users simultaneously on the same hardware.</p></li><li><p><strong>Scalability</strong><em>:</em> Handle longer context lengths without exponential increases in resource usage.</p></li></ul><p>This launch aligns with our mission to make AI accessible to everyone regardless of language or nation. QRWKV-72B runs at a fraction of the inference cost of current models, especially at larger context length, by merging the computational efficiency of linear transformers with the precision of attention mechanisms. This is a key multiplier unlock for not only making AI accessible for the world but for the recent test-time compute style models.</p><h3><strong>Real-time Voice:</strong> Instant, Natural AI Conversations</h3><p>Alongside QRWKV-72B, we&#8217;re proudly introducing <strong>Real-time</strong> Voice, an ultra-low latency AI speech solution designed for seamless human-computer interaction. Built on our serverless infrastructure, it delivers fast speech processing and generation time, minimizing delays for a more natural conversation experience. This allows individuals and businesses to build interactive voice applications that respond instantly and accurately. Whether for virtual assistants, global call centers, or interactive voice response systems, Real-time Voice ensures fast, reliable, and cost-efficient AI-powered speech applications.</p><h3>Private-cloud beta</h3><p><a href="http://featherless.ai/">Featherless.ai</a> now offers a Private Cloud solution for organizations that need full control over their AI deployments. Our dedicated, secure environments allow businesses to run open models with the ease of serverless infrastructure, zero maintenance, pay-per-use pricing, and complete data sovereignty. With Private Cloud, sensitive data stays protected while maintaining the scalability and flexibility required for modern AI applications. Whether you're an enterprise prioritizing compliance and security or a developer needing custom AI deployment, <a href="http://featherless.ai/">Featherless.ai</a>&#8217;s Private Cloud delivers seamless, cost-efficient AI hosting with full control over where and how your data is processed.</p><p>We invite you to experience the future of AI with QRWKV-72B, our real-time voice capabilities and private cloud solutions. Together, let&#8217;s make AI more accessible to everyone regardless of language or nation. Visit our website to access the model via our API or download the model directly from HuggingFace:</p><ul><li><p><a href="http://featherless.ai/models/featherless/Qwerky-72B">Featherless</a></p></li><li><p><a href="https://huggingface.co/recursal/QRWKV7-72B-Instruct-250126">Huggingface</a></p></li></ul><h2>Featherless.ai</h2><p><a href="http://Featherless.ai">Featherless.ai</a> is a serverless inference platform. Our goal is to make all AI models available for serverless inference. We provide inference via API to a continually expanding library of open-weight models, including the most popular models for role-playing, creative writing, coding assistance, and more. Our solutions enable enterprises and individuals to harness the full potential of artificial intelligence without worrying about underlying infrastructure. <a href="http://Featherless.ai">Featherless.ai</a> offers scalable, secure, and easy-to-use tools that empowers businesses and individuals alike to accelerate their AI initiatives. For more information, visit <a href="http://www.featherless.ai">www.featherless.ai</a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>This model was originally published as Qwerky-72B. However, due to confusion with another similar naming company/model, we have been requested to avoid using the Qwerky name, so we have renamed our models to QRWKV-72B</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Recursal.AI becomes Featherless.AI]]></title><description><![CDATA[A New Chapter, Same Mission]]></description><link>https://substack.recursal.ai/p/recursalai-becomes-featherlessai</link><guid isPermaLink="false">https://substack.recursal.ai/p/recursalai-becomes-featherlessai</guid><dc:creator><![CDATA[Eugene Cheah]]></dc:creator><pubDate>Tue, 11 Feb 2025 02:21:11 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!IwQW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!IwQW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!IwQW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg 424w, https://substackcdn.com/image/fetch/$s_!IwQW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg 848w, https://substackcdn.com/image/fetch/$s_!IwQW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!IwQW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!IwQW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg" width="1456" height="1219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1219,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:4171205,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!IwQW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg 424w, https://substackcdn.com/image/fetch/$s_!IwQW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg 848w, https://substackcdn.com/image/fetch/$s_!IwQW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!IwQW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Over the past year, Recursal.AI has served us well as a name and identity for our development team and our various projects. It has been a banner under which we&#8217;ve worked to push the boundaries in AI.</p><p>However overtime, we are now more commonly identified by our platform Featherless.AI, which has grown to be the biggest provider of HuggingFace LLMs, striving to make them more accessible, and free from the constraints of closed ecosystems. </p><p>It has become the identity most closely associated with our work, our vision, and the principles that drive us forward. </p><p>So today, we&#8217;re making it official: Recursal.AI is now Featherless.AI. This change isn&#8217;t just about branding&#8212;it&#8217;s a reaffirmation of our commitment to the usage of open-source AI. </p><p>We remain dedicated to bringing powerful language models to the wider world for the benefit of the many. </p><p>Our mission remains the same: to develop and support AI tools that are transparent, accessible, and unshackled from proprietary control. Here&#8217;s to the next chapter&#8212;under a name that truly reflects the community and future we&#8217;re building together. </p><p>&#8212; The Featherless.AI Team</p>]]></content:encoded></item><item><title><![CDATA[QRWKV6 and a charm of finches]]></title><description><![CDATA[And how QRWKV6 stands out among our various RWKV6 experiments]]></description><link>https://substack.recursal.ai/p/qrwkv6-and-a-charm-of-finches</link><guid isPermaLink="false">https://substack.recursal.ai/p/qrwkv6-and-a-charm-of-finches</guid><dc:creator><![CDATA[Eugene Cheah]]></dc:creator><pubDate>Wed, 11 Dec 2024 10:59:04 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!g60H!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf7b2317-9d3e-4cad-82f2-89a283866771_3840x2304.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!g60H!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf7b2317-9d3e-4cad-82f2-89a283866771_3840x2304.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!g60H!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf7b2317-9d3e-4cad-82f2-89a283866771_3840x2304.png 424w, https://substackcdn.com/image/fetch/$s_!g60H!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf7b2317-9d3e-4cad-82f2-89a283866771_3840x2304.png 848w, https://substackcdn.com/image/fetch/$s_!g60H!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf7b2317-9d3e-4cad-82f2-89a283866771_3840x2304.png 1272w, https://substackcdn.com/image/fetch/$s_!g60H!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf7b2317-9d3e-4cad-82f2-89a283866771_3840x2304.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!g60H!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf7b2317-9d3e-4cad-82f2-89a283866771_3840x2304.png" width="1456" height="874" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bf7b2317-9d3e-4cad-82f2-89a283866771_3840x2304.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:874,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:14930437,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!g60H!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf7b2317-9d3e-4cad-82f2-89a283866771_3840x2304.png 424w, https://substackcdn.com/image/fetch/$s_!g60H!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf7b2317-9d3e-4cad-82f2-89a283866771_3840x2304.png 848w, https://substackcdn.com/image/fetch/$s_!g60H!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf7b2317-9d3e-4cad-82f2-89a283866771_3840x2304.png 1272w, https://substackcdn.com/image/fetch/$s_!g60H!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf7b2317-9d3e-4cad-82f2-89a283866771_3840x2304.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Happy December Neurips,</p><p>We are proud to announce the triple model weights release of a charm of finches</p><h2>Q-RWKV-6 32B Instruct Preview</h2><p>Our latest frontier model. </p><p>A variant of RWKV-6, converted from an existing Qwen 32B model.</p><p>This is our strongest linear model to date, beating out all previous RWKV, State Space and Liquid AI models, smashing all previous key english benchmarks and evals.</p><p>Excitingly, this unlocks the option of converting existing transformer models to more efficient RWKV linear architecture.</p><p>Its limitation however, is how it inherits its knowledge training, and tokenizer, from the parent model. Which in this case is limited to approximately 30 languages (compared to RWKV 100+ languages)</p><p>See more info: <a href="https://substack.recursal.ai/p/q-rwkv-6-32b-instruct-preview">Announcement article</a><br>Try the model on our: <a href="https://huggingface.co/recursal/QRWKV6-32B-Instruct-Preview-v0.1">Featherless.ai inference</a></p><h2>RWKV-6 Finch MoE 37B</h2><p>Our first RWKV MoE model, for RWKV-6, with 11B out of 37B active parameters. Currently provides one of strongest multi-lingual model</p><p>See more info: <a href="https://substack.recursal.ai/p/flock-of-finches-rwkv-6-mixture-of">Announcement article</a></p><h2>RWKV-6 Finch 7B World 3</h2><p>An overall multi-lingual upgrade of our v6 7B base models, that is a major bump up from our previous 7B models for multi-lingual and mixed use cases.</p><p>This was developed and released under the RWKV foundation. With various contributors from Eleuther AI and RWKV open source group.</p><p>See more info: <a href="https://blog.rwkv.com/p/rwkv-6-finch-7b-world-3-now-with">Announcement article</a></p>]]></content:encoded></item><item><title><![CDATA[QRWKV6 32B Instruct Preview]]></title><description><![CDATA[The strongest, and largest RWKV model variant to date: QRWKV6 32B Instruct Preview]]></description><link>https://substack.recursal.ai/p/q-rwkv-6-32b-instruct-preview</link><guid isPermaLink="false">https://substack.recursal.ai/p/q-rwkv-6-32b-instruct-preview</guid><dc:creator><![CDATA[Eugene Cheah]]></dc:creator><pubDate>Wed, 11 Dec 2024 10:51:45 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!_aR2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb979eaed-e3c9-4e44-8b68-e226d2ed88ce_3840x2304.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_aR2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb979eaed-e3c9-4e44-8b68-e226d2ed88ce_3840x2304.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_aR2!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb979eaed-e3c9-4e44-8b68-e226d2ed88ce_3840x2304.png 424w, https://substackcdn.com/image/fetch/$s_!_aR2!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb979eaed-e3c9-4e44-8b68-e226d2ed88ce_3840x2304.png 848w, https://substackcdn.com/image/fetch/$s_!_aR2!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb979eaed-e3c9-4e44-8b68-e226d2ed88ce_3840x2304.png 1272w, https://substackcdn.com/image/fetch/$s_!_aR2!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb979eaed-e3c9-4e44-8b68-e226d2ed88ce_3840x2304.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_aR2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb979eaed-e3c9-4e44-8b68-e226d2ed88ce_3840x2304.png" width="1456" height="874" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b979eaed-e3c9-4e44-8b68-e226d2ed88ce_3840x2304.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:874,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:12945638,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_aR2!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb979eaed-e3c9-4e44-8b68-e226d2ed88ce_3840x2304.png 424w, https://substackcdn.com/image/fetch/$s_!_aR2!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb979eaed-e3c9-4e44-8b68-e226d2ed88ce_3840x2304.png 848w, https://substackcdn.com/image/fetch/$s_!_aR2!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb979eaed-e3c9-4e44-8b68-e226d2ed88ce_3840x2304.png 1272w, https://substackcdn.com/image/fetch/$s_!_aR2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb979eaed-e3c9-4e44-8b68-e226d2ed88ce_3840x2304.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The strongest linear model to date, beating out all previous RWKV, State Space and Liquid AI models, smashing all previous key english benchmarks and evals.</p><p>You can find this model available on both</p><ul><li><p>Hugging Face: <br><a href="https://huggingface.co/recursal/QRWKV6-32B-Instruct-Preview-v0.1">https://huggingface.co/recursal/QRWKV6-32B-Instruct-Preview-v0.1</a></p></li><li><p>Featherless.ai: <a href="https://featherless.ai/models/recursal/QRWKV6-32B-Instruct-Preview-v0.1">https://featherless.ai/models/recursal/QRWKV6-32B-Instruct-Preview-v0.1</a></p></li></ul><p>Note: that as an instruction preview, the model is not considered final</p><p>Trained by converting the weights of the Qwen 32B Instruct model, into a customized QRWKV6 architecture. We were successfully able to replace the existing transformer attention heads with RWKV-V6 attention heads, through a groundbreaking new conversion training process.</p><p>This unique training process was developed by the team at Recursal AI, in joint collaboration with the RWKV and EleutherAI open source community.</p><h3><strong>Benchmarks</strong></h3><p>We compared QRWKV6 against existing open weights models, both transformer based and linear-architecture based.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fC9I!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0051c55b-b57a-430b-a0b2-6f1ec9ec129f_2212x468.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fC9I!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0051c55b-b57a-430b-a0b2-6f1ec9ec129f_2212x468.png 424w, https://substackcdn.com/image/fetch/$s_!fC9I!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0051c55b-b57a-430b-a0b2-6f1ec9ec129f_2212x468.png 848w, https://substackcdn.com/image/fetch/$s_!fC9I!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0051c55b-b57a-430b-a0b2-6f1ec9ec129f_2212x468.png 1272w, https://substackcdn.com/image/fetch/$s_!fC9I!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0051c55b-b57a-430b-a0b2-6f1ec9ec129f_2212x468.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fC9I!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0051c55b-b57a-430b-a0b2-6f1ec9ec129f_2212x468.png" width="1456" height="308" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0051c55b-b57a-430b-a0b2-6f1ec9ec129f_2212x468.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:308,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:216374,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fC9I!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0051c55b-b57a-430b-a0b2-6f1ec9ec129f_2212x468.png 424w, https://substackcdn.com/image/fetch/$s_!fC9I!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0051c55b-b57a-430b-a0b2-6f1ec9ec129f_2212x468.png 848w, https://substackcdn.com/image/fetch/$s_!fC9I!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0051c55b-b57a-430b-a0b2-6f1ec9ec129f_2212x468.png 1272w, https://substackcdn.com/image/fetch/$s_!fC9I!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0051c55b-b57a-430b-a0b2-6f1ec9ec129f_2212x468.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>In overall, what is most exciting is how the QRWKV6 converted model, perform similarly to its original 32B model.</p><h3><strong>GPU Sponsor: <a href="https://tensorwave.com">TensorWave</a></strong></h3><p>The conversion process for QRWKV6, was done on 16 AMD MI300X GPUs kindly donated by TensorWave. Each MI300X comes with a whopping 192GB of VRAM, while having comparable H100 level of compute performance.</p><p>This allowed us to reduce the minimum number of nodes required for our training process, and simplify our overall training and conversion process.</p><p>The conversion process took about 8 hours</p><h3><strong>The Exciting</strong></h3><p>Linear models hold promise in substantially lower compute cost at scale. Delivering over a 1000x compute efficiency in inference cost, especially over large context length. A key multiplier unlock for both O1 style inference time thinking, and making AI more accessible for the world.</p><p>This technique is also scalable to larger transformer based models. Which we have since started.</p><h3><strong>The Good</strong></h3><p>The benefit of this process is that we are able to convert any previously trained QKV Attention based model, such as Qwen and LLaMA based models, into a variant of RWKV. Without needing to retrain the model from scratch.</p><p>This allows us to quickly test and prove out the significantly more efficient RWKV Linear attention mechanic at a larger scale, with a much smaller budget, without training from scratch. Proving out the architecture design and scalability of RWKV.</p><p>Once again proving, QKV attention is not all you need. <br>( Someone ping @jefrankle and @srush_nlp )</p><h3><strong>The Bad</strong></h3><p>The disadvantage of this process is that the model inherent knowledge and dataset training, is based on its &#8220;parent&#8221; model. Meaning unlike previous RWKV models trained on over 100+ languages. The QRWKV model is limited to the approximate 30 languages supported by the Qwen line of models.</p><p>Additionally, instead of RWKV based channel mix and feedforward network layers, we retain the &#8220;parent&#8221; model feed forward network architecture design. This means there will be incompatibility with existing RWKV inference code.</p><p>Separately, due to our compute budget, we were only able to do the conversion process up to 16k context length. While the model does exhibit stability beyond the given context length, the following model may need additional training to accurately support larger context length</p><h3><strong>Future Followups</strong></h3><p>Currently Q-RWKV-6 72B Instruct model is being trained</p><p>Additionally with the finalization of RWKV-7 architecture happening soon, we intend to repeat the process and provide a full line up of</p><ul><li><p>Q-RWKV-7 32B</p></li><li><p>LLaMA-RWKV-7 70B</p></li></ul><p>We intend to provide more details on the conversion process, along with our paper after the subsequent model release.</p><div><hr></div><h3><strong>References</strong></h3><ul><li><p><a href="https://huggingface.co/recursal/QRWKV6-32B-Instruct-Preview-v0.1">Model weights and code here</a></p></li></ul><h3>Acknowledgements</h3><ul><li><p>Special thanks to TensorWave and AMD for sponsoring the MI300X training</p></li><li><p>EleutherAI for support and guidance, especially on benchmarks and publishing research papers about the RWKV architecture</p></li><li><p>Linux Foundation AI &amp; Data group for supporting and hosting the RWKV project</p></li><li><p>Recursal AI for its commitment to providing resources and development for the RWKV ecosystem - you can use their<a href="https://featherless.ai/"> featherless.ai</a> platform to easily run RWKV and compare to it other language models</p></li></ul><p>And of course a huge thank you to the many developers around the world working hard to improve the RWKV ecosystem and provide environmentally friendly open source AI for all.</p>]]></content:encoded></item><item><title><![CDATA[Flock of Finches: RWKV-6 Mixture of Experts]]></title><description><![CDATA[The largest RWKV MoE model yet!]]></description><link>https://substack.recursal.ai/p/flock-of-finches-rwkv-6-mixture-of</link><guid isPermaLink="false">https://substack.recursal.ai/p/flock-of-finches-rwkv-6-mixture-of</guid><dc:creator><![CDATA[Eugene Cheah]]></dc:creator><pubDate>Wed, 11 Dec 2024 07:38:22 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!7WCL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75968f5b-a14e-411c-949d-1b24dedad3e4_1024x1024.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7WCL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75968f5b-a14e-411c-949d-1b24dedad3e4_1024x1024.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7WCL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75968f5b-a14e-411c-949d-1b24dedad3e4_1024x1024.jpeg 424w, https://substackcdn.com/image/fetch/$s_!7WCL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75968f5b-a14e-411c-949d-1b24dedad3e4_1024x1024.jpeg 848w, https://substackcdn.com/image/fetch/$s_!7WCL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75968f5b-a14e-411c-949d-1b24dedad3e4_1024x1024.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!7WCL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75968f5b-a14e-411c-949d-1b24dedad3e4_1024x1024.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7WCL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75968f5b-a14e-411c-949d-1b24dedad3e4_1024x1024.jpeg" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/75968f5b-a14e-411c-949d-1b24dedad3e4_1024x1024.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:152444,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7WCL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75968f5b-a14e-411c-949d-1b24dedad3e4_1024x1024.jpeg 424w, https://substackcdn.com/image/fetch/$s_!7WCL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75968f5b-a14e-411c-949d-1b24dedad3e4_1024x1024.jpeg 848w, https://substackcdn.com/image/fetch/$s_!7WCL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75968f5b-a14e-411c-949d-1b24dedad3e4_1024x1024.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!7WCL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75968f5b-a14e-411c-949d-1b24dedad3e4_1024x1024.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>We&#8217;re excited to release the latest addition to the RWKV family of model releases: Flock of Finches 37B-A11B v0.1! </p><p>This is an experimental model that uses 11 billion active parameters, and despite our new flock having been trained on only 109 billion tokens, roughly matches our recently released Finch 14B model on common benchmark evaluation scores. You can find the model and code at <a href="https://huggingface.co/recursal/Finch-MoE-37B-A11B-v0.1-HF">huggingface here</a> , or try it on <a href="https://featherless.ai/models/recursal/Finch-MoE-37B-A11B-v0.1-HF">featherless AI platform here</a></p><p>We leveraged an efficient Sparse Mixture of Experts (MoE) method to supply a higher total parameter count while activating only a fraction of those parameters for any given token. This saves time and uses less compute during both training and inference. As with most architectural choices, there is a tradeoff; increased efficiency comes in exchange for higher VRAM usage. </p><p>From our perspective, the ability to inexpensively train and run a model with greater powers seems very much worth that cost.</p><h1><strong>GPU Sponsor: <a href="https://tensorwave.com/">TensorWave</a></strong></h1><p>We trained Flock of Finches on 16 AMD MI300X GPUs kindly donated by <a href="https://tensorwave.com/">TensorWave</a>, over a period of nearly four weeks. Each MI300X comes with a whopping 192GB of VRAM, which easily accommodated the added VRAM requirements we had for MoE. </p><p>This allowed us to use our limited time efficiently, finding the best hyper-parameters and doing training instead of spending days or weeks developing software workarounds.</p><h1><strong>MoE Overview</strong></h1><p>A large part of the knowledge and intelligence of LLMs come from a component known as the Feed Forward Network (FFN), sometimes called the Channel Mixer. We added a flock of eight new Feed-Forward Network (&#8220;FFN&#8221;) &#8220;experts&#8221; to a Finch 7B checkpoint that had been trained on around 2 trillion tokens, then continued training it for only 109 billion more. </p><p>The original Finch FFN in Flock of Finches is always evaluated like usual, acting like the leader of the flock, and we call it the &#8220;shared expert&#8221;. Alongside this shared expert one additional expert from the flock is chosen for each token, and the results are added together. </p><p>This forms the mathematical equivalent of a double-width dynamically chosen FFN. The shared expert contributes the shared intelligence learned during the original 2 trillion tokens of training, while the new experts in the flock selectively contribute new information depending on context.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!KMuQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe93e3a95-3fa9-40bf-b2bd-43d03e637059_1150x629.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!KMuQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe93e3a95-3fa9-40bf-b2bd-43d03e637059_1150x629.png 424w, https://substackcdn.com/image/fetch/$s_!KMuQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe93e3a95-3fa9-40bf-b2bd-43d03e637059_1150x629.png 848w, https://substackcdn.com/image/fetch/$s_!KMuQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe93e3a95-3fa9-40bf-b2bd-43d03e637059_1150x629.png 1272w, https://substackcdn.com/image/fetch/$s_!KMuQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe93e3a95-3fa9-40bf-b2bd-43d03e637059_1150x629.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!KMuQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe93e3a95-3fa9-40bf-b2bd-43d03e637059_1150x629.png" width="1150" height="629" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e93e3a95-3fa9-40bf-b2bd-43d03e637059_1150x629.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:629,&quot;width&quot;:1150,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:111650,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!KMuQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe93e3a95-3fa9-40bf-b2bd-43d03e637059_1150x629.png 424w, https://substackcdn.com/image/fetch/$s_!KMuQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe93e3a95-3fa9-40bf-b2bd-43d03e637059_1150x629.png 848w, https://substackcdn.com/image/fetch/$s_!KMuQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe93e3a95-3fa9-40bf-b2bd-43d03e637059_1150x629.png 1272w, https://substackcdn.com/image/fetch/$s_!KMuQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe93e3a95-3fa9-40bf-b2bd-43d03e637059_1150x629.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1><strong>MoE Shared Expert</strong></h1><p>A few choices we made were unusual, and make Flock of Finches a bit different from other MoE architectures you may have encountered in the wild. One such choice was to use a shared expert and add eight fresh experts, instead of replacing the original FFN with eight cloned copies and continuing training from there. </p><p>We found that this setup learned much faster, even when accounting for the extra width and therefore computation it adds. We also discovered that with this setup we were able to use an extremely high initial learning rate for the new experts, eventually annealing it down to the original model&#8217;s learning rate as training progressed.</p><h1><strong>MoE Hash Routing</strong></h1><p>Another unusual choice we made was to use hash routing instead of a trained top-k gated router. We chose this partly for simplicity and speed, but also because it gives us a naturally even token-to-expert routing distribution, which we hope will improve inference efficiency. Hash routing is extremely simple; we take the token index fed into the model plus a prime number and use that result modulo eight as the index of the expert to which that token is sent for processing. Many other MoE models use a learned gating function, which is trained instead of being fixed in advance of training.</p><p>And one final very RWKV-specific quirk was our use of token-shift with these new experts. Ordinarily, RWKV does a unique kind of 1D convolution as part of its FFN called token-shift, which mixes parts of the current and prior token together. This allows the model to perform some kinds of operations in a single layer that a traditional transformer would require two layers to accomplish. We tried various ways of applying token-shift to our new experts, and in the end we found that the most efficient way was to perform the same shift on the input that goes to both the shared and new experts. The gate applied to FFN outputs is also generated from a single token-shift and applied uniformly to the combined output.</p><h1><strong>Benchmark</strong></h1><p>We evaluated Flock of Finches across a range of common industry standard benchmarks using EleutherAI&#8217;s lm-eval-harness. While some benchmarks got higher scores and some lower, it was generally around the same level as our recently released Finch 14B model. This is an interesting result for us, as this model has significantly fewer active parameters (11B versus 14B), and those parameters are more concentrated in the Feed Forward Network portion of the model.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8OeU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecc362eb-34df-4887-9ac2-b72c0efa12be_497x183.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8OeU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecc362eb-34df-4887-9ac2-b72c0efa12be_497x183.png 424w, https://substackcdn.com/image/fetch/$s_!8OeU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecc362eb-34df-4887-9ac2-b72c0efa12be_497x183.png 848w, https://substackcdn.com/image/fetch/$s_!8OeU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecc362eb-34df-4887-9ac2-b72c0efa12be_497x183.png 1272w, https://substackcdn.com/image/fetch/$s_!8OeU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecc362eb-34df-4887-9ac2-b72c0efa12be_497x183.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8OeU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecc362eb-34df-4887-9ac2-b72c0efa12be_497x183.png" width="497" height="183" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ecc362eb-34df-4887-9ac2-b72c0efa12be_497x183.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:183,&quot;width&quot;:497,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:8114,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8OeU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecc362eb-34df-4887-9ac2-b72c0efa12be_497x183.png 424w, https://substackcdn.com/image/fetch/$s_!8OeU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecc362eb-34df-4887-9ac2-b72c0efa12be_497x183.png 848w, https://substackcdn.com/image/fetch/$s_!8OeU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecc362eb-34df-4887-9ac2-b72c0efa12be_497x183.png 1272w, https://substackcdn.com/image/fetch/$s_!8OeU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecc362eb-34df-4887-9ac2-b72c0efa12be_497x183.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h1><strong>The Takeaway</strong></h1><p>Flock of Finches 37B-A11B features a new Mixture of Experts RWKV-6 architecture with 11 billion active parameters and 37 billion parameters total. It&#8217;s the largest RWKV MoE model yet, but it&#8217;s just our first step combining MoE with the RWKV architecture. We&#8217;re excited to expand the use of MoE to the Time Mixer portion of RWKV, and to try more complex MoE ideas like employing expert parameter sharing across breadth and depth, and combining a larger number of narrower experts.</p><p>We hope you&#8217;ll give Flock of Finches a try, and see how the RWKV ecosystem is growing with new more powerful models.</p><h1><strong>References</strong></h1><ul><li><p>Weights &amp; Code: <a href="https://huggingface.co/recursal/Finch-MoE-37B-A11B-v0.1-HF">https://huggingface.co/recursal/Finch-MoE-37B-A11B-v0.1-HF</a></p></li></ul><h1><strong>Acknowledgements</strong></h1><ul><li><p>Special thanks to TensorWave and AMD for sponsoring the Flock of Finches MI300X training run</p></li><li><p>Recursal AI for its commitment to providing resources and development for the RWKV ecosystem - you can use their<a href="https://featherless.ai"> featherless.ai</a> platform to easily run RWKV and compare to it other language models</p></li><li><p>EleutherAI for support and guidance, especially on benchmarks and publishing research papers about the RWKV architecture</p></li><li><p>Linux Foundation AI &amp; Data group for supporting and hosting the RWKV project</p></li></ul><p>And of course a huge thank you to the many developers around the world working hard to improve the RWKV ecosystem and provide environmentally friendly open source AI for all.</p>]]></content:encoded></item><item><title><![CDATA[Featherless Feud: a dip into LLM-powered game development]]></title><description><![CDATA[rebuilding a TV classic]]></description><link>https://substack.recursal.ai/p/featherless-feud-a-dip-into-llm-powered</link><guid isPermaLink="false">https://substack.recursal.ai/p/featherless-feud-a-dip-into-llm-powered</guid><dc:creator><![CDATA[Erik Cadieux]]></dc:creator><pubDate>Tue, 20 Aug 2024 04:29:55 GMT</pubDate><enclosure url="https://substackcdn.com/image/youtube/w_728,c_limit/aIPf7nsvrGM" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>It takes time to develop a sense of how to apply LLMs to concrete problems. But a great way to do this is to build with them; in this post, we&#8217;ll work through building a web version of the TV classic Family Feud.</p><div id="youtube2-aIPf7nsvrGM" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;aIPf7nsvrGM&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/aIPf7nsvrGM?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>This is a game with elements that are quite challenging for &#8220;traditional&#8221; software development that are remarkably easy with an LLM. If you want to try what we built before reading further, you can do so <a href="https://huggingface.co/spaces/Darok/Featherless-Feud">here</a>!</p><h1>The Challenges</h1><p>Building a game requires solving lots of problems around input, output, UI, visuals, latency, the list goes on. We&#8217;ll be focussing on the <em>content </em>related challenges that are not easy for the &#8220;rules-based&#8221; nature of typical software.</p><p>When we think about building Family Feud - or Featherless Feud as we call this version - we hit two problems:</p><ol><li><p>question and answer generation</p></li><li><p>fuzzy answer matching</p></li></ol><p>i.e. how do come up with quiz questions, and when given answers that aren&#8217;t exactly on the list, how do we match them?</p><p>For the actual show, the question bank is built with the help of surveys of the general public, meanwhile identifying variation of official answers are done by show staff live, a natural language task which is simple for humans, but challenging for software.</p><p>In both cases, we&#8217;ll be prompting an LLM to generate a certain kind of text and interpreting the results, but beyond just prompting, how we run the inference also matters. Let&#8217;s dive in.</p><h1><strong>Part 1 - Question/Answers Generation</strong></h1><p>To generate questions, we might prompt an LLM like so</p><pre><code>You are the producer of the game show Family Feud.

Your job is to devise a question and a list of common answers to this question. For each answer, output a number between 0 and 100 which is how common you think the answer would be when given by a member of the general public. The sum of the scores should not exceed 100.</code></pre><p>Now that content has structure (e.g. question versus answers) that we need to know about so that the various pieces can be handled appropriately by the different parts of our app.</p><p>The LLM &#8220;knows&#8221; about the relationship between the questions and answers in an output, so we can ask it to include the annotations in the output. We really could ask the LLM to output any kind of structured format but the most common format to request output in is JSON.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dxsY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dxsY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!dxsY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!dxsY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!dxsY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dxsY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp" width="1456" height="832" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:832,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;A modern, sleek, and minimalistic banner that visually represents JSON data parsing in a game development context. The background is dark, with abstract digital lines and code snippets flowing from left to right, symbolizing data parsing and AI integration. In the center, a simplified JSON structure glows, with key-value pairs like 'question' and 'answers' prominently displayed. Around this structure, subtle hints of a Family Feud-style game show, such as silhouettes of a game board or abstract players, are faintly visible. The overall color scheme should include dark tones with glowing accents, creating a high-tech and focused atmosphere.&quot;,&quot;title&quot;:&quot;A modern, sleek, and minimalistic banner that visually represents JSON data parsing in a game development context. The background is dark, with abstract digital lines and code snippets flowing from left to right, symbolizing data parsing and AI integration. In the center, a simplified JSON structure glows, with key-value pairs like 'question' and 'answers' prominently displayed. Around this structure, subtle hints of a Family Feud-style game show, such as silhouettes of a game board or abstract players, are faintly visible. The overall color scheme should include dark tones with glowing accents, creating a high-tech and focused atmosphere.&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="A modern, sleek, and minimalistic banner that visually represents JSON data parsing in a game development context. The background is dark, with abstract digital lines and code snippets flowing from left to right, symbolizing data parsing and AI integration. In the center, a simplified JSON structure glows, with key-value pairs like 'question' and 'answers' prominently displayed. Around this structure, subtle hints of a Family Feud-style game show, such as silhouettes of a game board or abstract players, are faintly visible. The overall color scheme should include dark tones with glowing accents, creating a high-tech and focused atmosphere." title="A modern, sleek, and minimalistic banner that visually represents JSON data parsing in a game development context. The background is dark, with abstract digital lines and code snippets flowing from left to right, symbolizing data parsing and AI integration. In the center, a simplified JSON structure glows, with key-value pairs like 'question' and 'answers' prominently displayed. Around this structure, subtle hints of a Family Feud-style game show, such as silhouettes of a game board or abstract players, are faintly visible. The overall color scheme should include dark tones with glowing accents, creating a high-tech and focused atmosphere." srcset="https://substackcdn.com/image/fetch/$s_!dxsY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!dxsY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!dxsY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!dxsY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>As for getting a model to formulate it&#8217;s output as JSON there are typically two parts:</p><ol><li><p>prompting for the answer to be in JSON with a description of desired structure</p></li><li><p>requesting JSON from your inference API call</p></li></ol><p>For 1., you might update your prompt as follows</p><pre><code>You are the producer of the game show Family Feud.

Your job is to devise a question and a list of common answers to this question. For each answer, output a number between 0 and 100 which is how common you think the answer would be when given by a member of the general public. The sum of the scores should not exceed 100.

Be very concise, present only the question and answers, and don't add anything else.

Output valid JSON with the following format: two keys, `question`, which should be a string, and `answers` should be an array of objects. Each object in the `answers` array should have two keys, `answer` and `score`, each of which should be a string.

e.g.
```json
{
  "question": "Name something that might be wobbly",
  "answers": [
    { "answer": "Furniture" , score: 64 },
    { "answer": "Person/A Drunk", score: 15 },
    { "answer": "Spinning toy/top", score: 5 },
    { "answer": "Shopping Cart", score: 3 }
  ]
}
```

The sum of the scores should not exceed 100.</code></pre><p>Being prescriptive in structure is important; if the output description isn&#8217;t clear enough to the model, you&#8217;ll get variation in structure and keys used, which likely cause an exception when the part of your software operating on the JSON output tries to separate the question from the answers.</p><p>Though even with an explicit description of output structure, the model to output syntactically valid JSON is it is unlikely that if you prompting only will have your output will be valid JSON. Here&#8217;s a sample of the output from above:</p><pre><code>Here is a question with a list of answers and scores:

{
  "question": "Name something you might find in a purse or wallet",
  "answers": [
    { "answer": "Money", score: 70 },
    { "answer": "Phone", score: 20 },
    { "answer": "Credit Card", score: 5 },
    { "answer": "Candy", score: 2 },
    { "answer": "Makeup", score: 3 }
  ]
}</code></pre><p>Pretty good right?  It came up with a question, and in the format as requested (and needed for the game). It also outputted the `score` values as Numbers even though I asked, in the prompt, for their input to be strings! The model overrode my instructions, but for the better. However despite the request to give <em>nothing</em> but the output, there still is some pre-amble which requires it&#8217;s own intelligence to parse. How do we get rid of this?</p><p>The other half the equation is an inference technique known as &#8220;guided decoding&#8221; in it&#8217;s most general form, but known as JSON mode. Here we guarantee the output is syntactically valid JSON by manipulating token probabilities during sampling to prevent syntactically invalid JSON from appearing (e.g. the string {&#8216;&#8216;} is <em>not </em>valid JSON).</p><p>Of course these two modes are in tension with another - if you don&#8217;t ask for JSON in your prompt (or specify structure clearly enough), and put on JSON mode, it&#8217;s possible that your output bears little relationship to your input. In an early version of the prompt above, I got a number of questions.</p><p>How you request JSON mode varies by your API provider (this is a specific type of  guided decoding which is a big topic), but many (most?) providers that have an API will accept the same format to the request as OpenAI, which is to include a</p><pre><code>"request_format": { "type": "json_object" }</code></pre><p>in the body of the inference POST request. Once this is included, my output is exactly how I hoped: exactly the JSON and only the JSON.</p><pre><code>{
  "question": "Name something you might find in a garage",
  "answers": [
    { "answer": "Tools", "score": 40 },
    { "answer": "Bike", "score": 20 },
    { "answer": "Work Bench", "score": 12 },
    { "answer": "Vacuum", "score": 8 },
    { "answer": "Lawn Mower", "score": 5 },
    { "answer": "Storage Containers", "score": 5 }
  ]
}</code></pre><p>Note generation <em>takes longer</em> when JSON mode is enabled. This is a consequence of how that guided decoding works, which is worth a blog post in it&#8217;s own right.</p><p>We haven&#8217;t talked about the Featherless API we&#8217;ve been using for inference, but hold on to that thought - more on that in a bit.</p><h1><strong>Part 2 - Fuzzy Answer Matching</strong></h1><p>The other challenge is matching players answers to the list of official answers, as they don&#8217;t always match, in our running example from the video above, the question &#8220;Name something that might be wobbly&#8221; had the following answers</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3Mmg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3Mmg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png 424w, https://substackcdn.com/image/fetch/$s_!3Mmg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png 848w, https://substackcdn.com/image/fetch/$s_!3Mmg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png 1272w, https://substackcdn.com/image/fetch/$s_!3Mmg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3Mmg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png" width="1388" height="788" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:788,&quot;width&quot;:1388,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1837675,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3Mmg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png 424w, https://substackcdn.com/image/fetch/$s_!3Mmg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png 848w, https://substackcdn.com/image/fetch/$s_!3Mmg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png 1272w, https://substackcdn.com/image/fetch/$s_!3Mmg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Name something that might be wobbly - &#8220;A stool&#8221; and &#8220;A baby&#8221; were both accepted answers</figcaption></figure></div><p>Where &#8220;A stool&#8221; was accepted in place of &#8220;Furniture&#8221; and &#8220;A baby&#8221; was accepted as a version of &#8220;Person / a drunk&#8221;.</p><p>It&#8217;s relatively easy to devise a prompt and post-processing that captures the gist of the idea. Here&#8217;s a short python program we used during testing to let you iterate through prompt and model choice by letting an answer you supply on the command line is judged.</p><div class="github-gist" data-attrs="{&quot;innerHTML&quot;:&quot;<div id=\&quot;gist132134060\&quot; class=\&quot;gist\&quot;>\n    <div class=\&quot;gist-file\&quot; translate=\&quot;no\&quot; data-color-mode=\&quot;light\&quot; data-light-theme=\&quot;light\&quot;>\n      <div class=\&quot;gist-data\&quot;>\n        <div class=\&quot;js-gist-file-update-container js-task-list-container\&quot;>\n  <div id=\&quot;file-family-feud-judging-prompt-py\&quot; class=\&quot;file my-2\&quot;>\n    \n    <div itemprop=\&quot;text\&quot; class=\&quot;Box-body p-0 blob-wrapper data type-python  \&quot;>\n\n        \n<div class=\&quot;js-check-bidi js-blob-code-container blob-code-content\&quot;>\n\n  <template class=\&quot;js-file-alert-template\&quot;>\n  <div data-view-component=\&quot;true\&quot; class=\&quot;flash flash-warn flash-full d-flex flex-items-center\&quot;>\n  <svg aria-hidden=\&quot;true\&quot; height=\&quot;16\&quot; viewBox=\&quot;0 0 16 16\&quot; version=\&quot;1.1\&quot; width=\&quot;16\&quot; data-view-component=\&quot;true\&quot; class=\&quot;octicon octicon-alert\&quot;>\n    <path d=\&quot;M6.457 1.047c.659-1.234 2.427-1.234 3.086 0l6.082 11.378A1.75 1.75 0 0 1 14.082 15H1.918a1.75 1.75 0 0 1-1.543-2.575Zm1.763.707a.25.25 0 0 0-.44 0L1.698 13.132a.25.25 0 0 0 .22.368h12.164a.25.25 0 0 0 .22-.368Zm.53 3.996v2.5a.75.75 0 0 1-1.5 0v-2.5a.75.75 0 0 1 1.5 0ZM9 11a1 1 0 1 1-2 0 1 1 0 0 1 2 0Z\&quot;></path>\n</svg>\n    <span>\n      This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.\n      <a class=\&quot;Link--inTextBlock\&quot; href=\&quot;https://github.co/hiddenchars\&quot; target=\&quot;_blank\&quot;>Learn more about bidirectional Unicode characters</a>\n    </span>\n\n\n  <div data-view-component=\&quot;true\&quot; class=\&quot;flash-action\&quot;>        <a href=\&quot;{{ revealButtonHref }}\&quot; data-view-component=\&quot;true\&quot; class=\&quot;btn-sm btn\&quot;>    Show hidden characters\n</a>\n</div>\n</div></template>\n<template class=\&quot;js-line-alert-template\&quot;>\n  <span aria-label=\&quot;This line has hidden Unicode characters\&quot; data-view-component=\&quot;true\&quot; class=\&quot;line-alert tooltipped tooltipped-e\&quot;>\n    <svg aria-hidden=\&quot;true\&quot; height=\&quot;16\&quot; viewBox=\&quot;0 0 16 16\&quot; version=\&quot;1.1\&quot; width=\&quot;16\&quot; data-view-component=\&quot;true\&quot; class=\&quot;octicon octicon-alert\&quot;>\n    <path d=\&quot;M6.457 1.047c.659-1.234 2.427-1.234 3.086 0l6.082 11.378A1.75 1.75 0 0 1 14.082 15H1.918a1.75 1.75 0 0 1-1.543-2.575Zm1.763.707a.25.25 0 0 0-.44 0L1.698 13.132a.25.25 0 0 0 .22.368h12.164a.25.25 0 0 0 .22-.368Zm.53 3.996v2.5a.75.75 0 0 1-1.5 0v-2.5a.75.75 0 0 1 1.5 0ZM9 11a1 1 0 1 1-2 0 1 1 0 0 1 2 0Z\&quot;></path>\n</svg>\n</span></template>\n\n  <table data-hpc class=\&quot;highlight tab-size js-file-line-container js-code-nav-container js-tagsearch-file\&quot; data-tab-size=\&quot;8\&quot; data-paste-markdown-skip data-tagsearch-lang=\&quot;Python\&quot; data-tagsearch-path=\&quot;family-feud-judging-prompt.py\&quot;>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L1\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;1\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC1\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-k>from</span> <span class=pl-s1>openai</span> <span class=pl-k>import</span> <span class=pl-v>OpenAI</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L2\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;2\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC2\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-k>import</span> <span class=pl-s1>os</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L3\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;3\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC3\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-k>import</span> <span class=pl-s1>click</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L4\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;4\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC4\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>\n</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L5\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;5\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC5\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s1>client</span> <span class=pl-c1>=</span> <span class=pl-v>OpenAI</span>(</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L6\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;6\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC6\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-s1>base_url</span><span class=pl-c1>=</span><span class=pl-s>&amp;quot;https://api.featherless.ai/v1&amp;quot;</span>,</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L7\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;7\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC7\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-s1>api_key</span><span class=pl-c1>=</span><span class=pl-s1>os</span>.<span class=pl-s1>environ</span>.<span class=pl-en>get</span>(<span class=pl-s>&amp;#39;FEATHERLESS_API_KEY&amp;#39;</span>)</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L8\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;8\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC8\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>)</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L9\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;9\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC9\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>\n</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L10\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;10\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC10\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-en>@<span class=pl-s1>click</span>.<span class=pl-en>command</span>()</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L11\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;11\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC11\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-en>@<span class=pl-s1>click</span>.<span class=pl-en>argument</span>(<span class=pl-s>&amp;#39;answer&amp;#39;</span>)</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L12\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;12\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC12\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-en>@<span class=pl-s1>click</span>.<span class=pl-en>option</span>(<span class=pl-s>&amp;#39;--model&amp;#39;</span>, <span class=pl-s1>default</span><span class=pl-c1>=</span><span class=pl-s>&amp;#39;meta-llama/Meta-Llama-3-8B-Instruct&amp;#39;</span>)</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L13\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;13\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC13\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-k>def</span> <span class=pl-en>quiz</span>(<span class=pl-s1>answer</span>, <span class=pl-s1>model</span>):</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L14\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;14\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC14\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-s1>question</span> <span class=pl-c1>=</span> <span class=pl-s>&amp;quot;Name something that might be wobbly&amp;quot;</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L15\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;15\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC15\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-s1>official_answers</span> <span class=pl-c1>=</span> [</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L16\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;16\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC16\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>    <span class=pl-s>&amp;quot;Furniture&amp;quot;</span>,</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L17\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;17\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC17\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>    <span class=pl-s>&amp;quot;Person/A Drunk&amp;quot;</span>,</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L18\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;18\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC18\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>    <span class=pl-s>&amp;quot;Spinning toy/top&amp;quot;</span>,</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L19\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;19\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC19\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>    <span class=pl-s>&amp;quot;Shopping Cart&amp;quot;</span>,</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L20\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;20\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC20\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  ]</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L21\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;21\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC21\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>\n</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L22\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;22\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC22\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-s1>system_prompt</span> <span class=pl-c1>=</span> <span class=pl-s>f&amp;quot;&amp;quot;&amp;quot;You are a judge on the show Family Feud.</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L23\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;23\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC23\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s>You&amp;#39;re going to receive a guess from a contestant. That guess is a guess at one of the answers to the question</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L24\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;24\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC24\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s></span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L25\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;25\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC25\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s>Question is &amp;quot;<span class=pl-s1><span class=pl-kos>{</span><span class=pl-s1>question</span><span class=pl-kos>}</span></span>&amp;quot;</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L26\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;26\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC26\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s></span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L27\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;27\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC27\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s>The official answers are</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L28\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;28\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC28\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s><span class=pl-s1><span class=pl-kos>{</span><span class=pl-s>&amp;#39;<span class=pl-cce>\\n</span>&amp;#39;</span>.<span class=pl-en>join</span>([ <span class=pl-s>f&amp;quot;* <span class=pl-s1><span class=pl-kos>{</span><span class=pl-s1>a</span><span class=pl-kos>}</span></span>&amp;quot;</span> <span class=pl-k>for</span> <span class=pl-s1>a</span> <span class=pl-c1>in</span> <span class=pl-s1>official_answers</span> ])<span class=pl-kos>}</span></span></span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L29\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;29\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC29\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s></span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L30\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;30\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC30\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s>If the candidate&amp;#39;s guess is a version of an official answer, please respond the wording of the official answer.</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L31\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;31\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC31\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s></span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L32\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;32\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC32\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s>If the candidate&amp;#39;s guess is not a version of any official answer, respond with &amp;quot;Survey says ... AERR!!&amp;quot;</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L33\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;33\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC33\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s>&amp;quot;&amp;quot;&amp;quot;</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L34\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;34\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC34\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>\n</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L35\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;35\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC35\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-en>print</span>(<span class=pl-s1>model</span>)</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L36\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;36\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC36\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-en>print</span>(<span class=pl-s1>system_prompt</span>)</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L37\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;37\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC37\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>\n</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L38\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;38\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC38\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-s1>chat_completions_response</span> <span class=pl-c1>=</span> <span class=pl-s1>client</span>.<span class=pl-s1>chat</span>.<span class=pl-s1>completions</span>.<span class=pl-en>create</span>(</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L39\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;39\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC39\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>    <span class=pl-s1>model</span><span class=pl-c1>=</span><span class=pl-s1>model</span>,</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L40\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;40\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC40\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>    <span class=pl-s1>messages</span><span class=pl-c1>=</span>[</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L41\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;41\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC41\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>      { <span class=pl-s>&amp;quot;role&amp;quot;</span>: <span class=pl-s>&amp;quot;system&amp;quot;</span>, <span class=pl-s>&amp;quot;content&amp;quot;</span>: <span class=pl-s1>system_prompt</span>},</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L42\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;42\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC42\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>      { <span class=pl-s>&amp;quot;role&amp;quot;</span>: <span class=pl-s>&amp;quot;user&amp;quot;</span>, <span class=pl-s>&amp;quot;content&amp;quot;</span>: <span class=pl-s1>answer</span> }</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L43\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;43\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC43\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>    ],</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L44\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;44\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC44\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>    <span class=pl-s1>max_tokens</span><span class=pl-c1>=</span><span class=pl-c1>250</span>,</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L45\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;45\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC45\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  )</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L46\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;46\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC46\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-en>print</span>(<span class=pl-s1>chat_completions_response</span>.<span class=pl-s1>choices</span>[<span class=pl-c1>0</span>].<span class=pl-s1>message</span>.<span class=pl-s1>content</span>)</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L47\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;47\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC47\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>\n</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L48\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;48\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC48\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-k>if</span> <span class=pl-s1>__name__</span> <span class=pl-c1>==</span> <span class=pl-s>&amp;quot;__main__&amp;quot;</span>:</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L49\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;49\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC49\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-en>quiz</span>()</td>\n        </tr>\n  </table>\n</div>\n\n\n    </div>\n\n  </div>\n</div>\n\n      </div>\n      <div class=\&quot;gist-meta\&quot;>\n        <a href=\&quot;https://gist.github.com/wxgeorge/4d8f812c381e62dc91c006296d9b0561/raw/8beabc223d1578a0eee77c2e7bfe02a4af378992/family-feud-judging-prompt.py\&quot; style=\&quot;float:right\&quot; class=\&quot;Link--inTextBlock\&quot;>view raw</a>\n        <a href=\&quot;https://gist.github.com/wxgeorge/4d8f812c381e62dc91c006296d9b0561#file-family-feud-judging-prompt-py\&quot; class=\&quot;Link--inTextBlock\&quot;>\n          family-feud-judging-prompt.py\n        </a>\n        hosted with &amp;#10084; by <a class=\&quot;Link--inTextBlock\&quot; href=\&quot;https://github.com\&quot;>GitHub</a>\n      </div>\n    </div>\n</div>\n&quot;,&quot;stylesheet&quot;:&quot;https://github.githubassets.com/assets/gist-embed-3575177cfe1a.css&quot;}" data-component-name="GitgistToDOM"><link rel="stylesheet" href="https://github.githubassets.com/assets/gist-embed-3575177cfe1a.css"><div id="gist132134060" class="gist">
    <div class="gist-file" data-color-mode="light" data-light-theme="light">
      <div class="gist-data">
        <div class="js-gist-file-update-container js-task-list-container">
  <div id="file-family-feud-judging-prompt-py" class="file my-2">
    
    <div itemprop="text" class="Box-body p-0 blob-wrapper data type-python  ">

        
<div class="js-check-bidi js-blob-code-container blob-code-content">

  
  <div data-view-component="true" class="flash flash-warn flash-full d-flex flex-items-center">
  
    

    <span>
      This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      <a class="Link--inTextBlock" href="https://github.co/hiddenchars" target="_blank">Learn more about bidirectional Unicode characters</a>
    </span>


  <div data-view-component="true" class="flash-action">        <a href="{{ revealButtonHref }}" data-view-component="true" class="btn-sm btn">    Show hidden characters
</a>
</div>
</div>

  <span data-view-component="true" class="line-alert tooltipped tooltipped-e">
    
    

</span>

  <table data-hpc="" class="highlight tab-size js-file-line-container js-code-nav-container js-tagsearch-file" data-tab-size="8" data-paste-markdown-skip="" data-tagsearch-lang="Python" data-tagsearch-path="family-feud-judging-prompt.py">
        <tbody><tr>
          <td id="file-family-feud-judging-prompt-py-L1" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="1"></td>
          <td id="file-family-feud-judging-prompt-py-LC1" class="blob-code blob-code-inner js-file-line"><span class="pl-k">from</span> <span class="pl-s1">openai</span> <span class="pl-k">import</span> <span class="pl-v">OpenAI</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L2" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="2"></td>
          <td id="file-family-feud-judging-prompt-py-LC2" class="blob-code blob-code-inner js-file-line"><span class="pl-k">import</span> <span class="pl-s1">os</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L3" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="3"></td>
          <td id="file-family-feud-judging-prompt-py-LC3" class="blob-code blob-code-inner js-file-line"><span class="pl-k">import</span> <span class="pl-s1">click</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L4" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="4"></td>
          <td id="file-family-feud-judging-prompt-py-LC4" class="blob-code blob-code-inner js-file-line">
</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L5" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="5"></td>
          <td id="file-family-feud-judging-prompt-py-LC5" class="blob-code blob-code-inner js-file-line"><span class="pl-s1">client</span> <span class="pl-c1">=</span> <span class="pl-v">OpenAI</span>(</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L6" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="6"></td>
          <td id="file-family-feud-judging-prompt-py-LC6" class="blob-code blob-code-inner js-file-line">  <span class="pl-s1">base_url</span><span class="pl-c1">=</span><span class="pl-s">"https://api.featherless.ai/v1"</span>,</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L7" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="7"></td>
          <td id="file-family-feud-judging-prompt-py-LC7" class="blob-code blob-code-inner js-file-line">  <span class="pl-s1">api_key</span><span class="pl-c1">=</span><span class="pl-s1">os</span>.<span class="pl-s1">environ</span>.<span class="pl-en">get</span>(<span class="pl-s">'FEATHERLESS_API_KEY'</span>)</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L8" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="8"></td>
          <td id="file-family-feud-judging-prompt-py-LC8" class="blob-code blob-code-inner js-file-line">)</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L9" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="9"></td>
          <td id="file-family-feud-judging-prompt-py-LC9" class="blob-code blob-code-inner js-file-line">
</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L10" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="10"></td>
          <td id="file-family-feud-judging-prompt-py-LC10" class="blob-code blob-code-inner js-file-line"><span class="pl-en">@<span class="pl-s1">click</span>.<span class="pl-en">command</span>()</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L11" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="11"></td>
          <td id="file-family-feud-judging-prompt-py-LC11" class="blob-code blob-code-inner js-file-line"><span class="pl-en">@<span class="pl-s1">click</span>.<span class="pl-en">argument</span>(<span class="pl-s">'answer'</span>)</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L12" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="12"></td>
          <td id="file-family-feud-judging-prompt-py-LC12" class="blob-code blob-code-inner js-file-line"><span class="pl-en">@<span class="pl-s1">click</span>.<span class="pl-en">option</span>(<span class="pl-s">'--model'</span>, <span class="pl-s1">default</span><span class="pl-c1">=</span><span class="pl-s">'meta-llama/Meta-Llama-3-8B-Instruct'</span>)</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L13" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="13"></td>
          <td id="file-family-feud-judging-prompt-py-LC13" class="blob-code blob-code-inner js-file-line"><span class="pl-k">def</span> <span class="pl-en">quiz</span>(<span class="pl-s1">answer</span>, <span class="pl-s1">model</span>):</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L14" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="14"></td>
          <td id="file-family-feud-judging-prompt-py-LC14" class="blob-code blob-code-inner js-file-line">  <span class="pl-s1">question</span> <span class="pl-c1">=</span> <span class="pl-s">"Name something that might be wobbly"</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L15" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="15"></td>
          <td id="file-family-feud-judging-prompt-py-LC15" class="blob-code blob-code-inner js-file-line">  <span class="pl-s1">official_answers</span> <span class="pl-c1">=</span> [</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L16" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="16"></td>
          <td id="file-family-feud-judging-prompt-py-LC16" class="blob-code blob-code-inner js-file-line">    <span class="pl-s">"Furniture"</span>,</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L17" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="17"></td>
          <td id="file-family-feud-judging-prompt-py-LC17" class="blob-code blob-code-inner js-file-line">    <span class="pl-s">"Person/A Drunk"</span>,</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L18" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="18"></td>
          <td id="file-family-feud-judging-prompt-py-LC18" class="blob-code blob-code-inner js-file-line">    <span class="pl-s">"Spinning toy/top"</span>,</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L19" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="19"></td>
          <td id="file-family-feud-judging-prompt-py-LC19" class="blob-code blob-code-inner js-file-line">    <span class="pl-s">"Shopping Cart"</span>,</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L20" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="20"></td>
          <td id="file-family-feud-judging-prompt-py-LC20" class="blob-code blob-code-inner js-file-line">  ]</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L21" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="21"></td>
          <td id="file-family-feud-judging-prompt-py-LC21" class="blob-code blob-code-inner js-file-line">
</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L22" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="22"></td>
          <td id="file-family-feud-judging-prompt-py-LC22" class="blob-code blob-code-inner js-file-line">  <span class="pl-s1">system_prompt</span> <span class="pl-c1">=</span> <span class="pl-s">f"""You are a judge on the show Family Feud.</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L23" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="23"></td>
          <td id="file-family-feud-judging-prompt-py-LC23" class="blob-code blob-code-inner js-file-line"><span class="pl-s">You're going to receive a guess from a contestant. That guess is a guess at one of the answers to the question</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L24" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="24"></td>
          <td id="file-family-feud-judging-prompt-py-LC24" class="blob-code blob-code-inner js-file-line"><span class="pl-s"></span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L25" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="25"></td>
          <td id="file-family-feud-judging-prompt-py-LC25" class="blob-code blob-code-inner js-file-line"><span class="pl-s">Question is "<span class="pl-s1"><span class="pl-kos">{</span><span class="pl-s1">question</span><span class="pl-kos">}</span></span>"</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L26" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="26"></td>
          <td id="file-family-feud-judging-prompt-py-LC26" class="blob-code blob-code-inner js-file-line"><span class="pl-s"></span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L27" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="27"></td>
          <td id="file-family-feud-judging-prompt-py-LC27" class="blob-code blob-code-inner js-file-line"><span class="pl-s">The official answers are</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L28" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="28"></td>
          <td id="file-family-feud-judging-prompt-py-LC28" class="blob-code blob-code-inner js-file-line"><span class="pl-s"><span class="pl-s1"><span class="pl-kos">{</span><span class="pl-s">'<span class="pl-cce">\n</span>'</span>.<span class="pl-en">join</span>([ <span class="pl-s">f"* <span class="pl-s1"><span class="pl-kos">{</span><span class="pl-s1">a</span><span class="pl-kos">}</span></span>"</span> <span class="pl-k">for</span> <span class="pl-s1">a</span> <span class="pl-c1">in</span> <span class="pl-s1">official_answers</span> ])<span class="pl-kos">}</span></span></span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L29" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="29"></td>
          <td id="file-family-feud-judging-prompt-py-LC29" class="blob-code blob-code-inner js-file-line"><span class="pl-s"></span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L30" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="30"></td>
          <td id="file-family-feud-judging-prompt-py-LC30" class="blob-code blob-code-inner js-file-line"><span class="pl-s">If the candidate's guess is a version of an official answer, please respond the wording of the official answer.</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L31" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="31"></td>
          <td id="file-family-feud-judging-prompt-py-LC31" class="blob-code blob-code-inner js-file-line"><span class="pl-s"></span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L32" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="32"></td>
          <td id="file-family-feud-judging-prompt-py-LC32" class="blob-code blob-code-inner js-file-line"><span class="pl-s">If the candidate's guess is not a version of any official answer, respond with "Survey says ... AERR!!"</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L33" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="33"></td>
          <td id="file-family-feud-judging-prompt-py-LC33" class="blob-code blob-code-inner js-file-line"><span class="pl-s">"""</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L34" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="34"></td>
          <td id="file-family-feud-judging-prompt-py-LC34" class="blob-code blob-code-inner js-file-line">
</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L35" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="35"></td>
          <td id="file-family-feud-judging-prompt-py-LC35" class="blob-code blob-code-inner js-file-line">  <span class="pl-en">print</span>(<span class="pl-s1">model</span>)</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L36" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="36"></td>
          <td id="file-family-feud-judging-prompt-py-LC36" class="blob-code blob-code-inner js-file-line">  <span class="pl-en">print</span>(<span class="pl-s1">system_prompt</span>)</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L37" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="37"></td>
          <td id="file-family-feud-judging-prompt-py-LC37" class="blob-code blob-code-inner js-file-line">
</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L38" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="38"></td>
          <td id="file-family-feud-judging-prompt-py-LC38" class="blob-code blob-code-inner js-file-line">  <span class="pl-s1">chat_completions_response</span> <span class="pl-c1">=</span> <span class="pl-s1">client</span>.<span class="pl-s1">chat</span>.<span class="pl-s1">completions</span>.<span class="pl-en">create</span>(</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L39" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="39"></td>
          <td id="file-family-feud-judging-prompt-py-LC39" class="blob-code blob-code-inner js-file-line">    <span class="pl-s1">model</span><span class="pl-c1">=</span><span class="pl-s1">model</span>,</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L40" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="40"></td>
          <td id="file-family-feud-judging-prompt-py-LC40" class="blob-code blob-code-inner js-file-line">    <span class="pl-s1">messages</span><span class="pl-c1">=</span>[</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L41" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="41"></td>
          <td id="file-family-feud-judging-prompt-py-LC41" class="blob-code blob-code-inner js-file-line">      { <span class="pl-s">"role"</span>: <span class="pl-s">"system"</span>, <span class="pl-s">"content"</span>: <span class="pl-s1">system_prompt</span>},</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L42" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="42"></td>
          <td id="file-family-feud-judging-prompt-py-LC42" class="blob-code blob-code-inner js-file-line">      { <span class="pl-s">"role"</span>: <span class="pl-s">"user"</span>, <span class="pl-s">"content"</span>: <span class="pl-s1">answer</span> }</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L43" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="43"></td>
          <td id="file-family-feud-judging-prompt-py-LC43" class="blob-code blob-code-inner js-file-line">    ],</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L44" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="44"></td>
          <td id="file-family-feud-judging-prompt-py-LC44" class="blob-code blob-code-inner js-file-line">    <span class="pl-s1">max_tokens</span><span class="pl-c1">=</span><span class="pl-c1">250</span>,</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L45" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="45"></td>
          <td id="file-family-feud-judging-prompt-py-LC45" class="blob-code blob-code-inner js-file-line">  )</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L46" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="46"></td>
          <td id="file-family-feud-judging-prompt-py-LC46" class="blob-code blob-code-inner js-file-line">  <span class="pl-en">print</span>(<span class="pl-s1">chat_completions_response</span>.<span class="pl-s1">choices</span>[<span class="pl-c1">0</span>].<span class="pl-s1">message</span>.<span class="pl-s1">content</span>)</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L47" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="47"></td>
          <td id="file-family-feud-judging-prompt-py-LC47" class="blob-code blob-code-inner js-file-line">
</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L48" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="48"></td>
          <td id="file-family-feud-judging-prompt-py-LC48" class="blob-code blob-code-inner js-file-line"><span class="pl-k">if</span> <span class="pl-s1">__name__</span> <span class="pl-c1">==</span> <span class="pl-s">"__main__"</span>:</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L49" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="49"></td>
          <td id="file-family-feud-judging-prompt-py-LC49" class="blob-code blob-code-inner js-file-line">  <span class="pl-en">quiz</span>()</td>
        </tr>
  </tbody></table>
</div>


    </div>

  </div>
</div>

      </div>
      <div class="gist-meta">
        <a href="https://gist.github.com/wxgeorge/4d8f812c381e62dc91c006296d9b0561/raw/8beabc223d1578a0eee77c2e7bfe02a4af378992/family-feud-judging-prompt.py" style="float:right" class="Link--inTextBlock">view raw</a>
        <a href="https://gist.github.com/wxgeorge/4d8f812c381e62dc91c006296d9b0561#file-family-feud-judging-prompt-py" class="Link--inTextBlock">
          family-feud-judging-prompt.py
        </a>
        hosted with &#10084; by <a class="Link--inTextBlock" href="https://github.com">GitHub</a>
      </div>
    </div>
</div>
</div><p>But catching the comedic subtlety of the game is something else entirely. Despite many prompt iterations and many different models I tried (including Llama 3.1 405B!), I couldn&#8217;t get the judge to accept &#8220;A baby&#8221; as a variation of the answer &#8220;Person / A drunk&#8221;. If you have a different result, I&#8217;d love to hear it in the comments!</p><h1>Choosing an inference provider</h1><p>There is a growing list of inference providers to choose from. Every foundation model company provides API access to their models (OpenAI, Anthropic, etc.), companies focussed just on inference provision (friendli.ai, replicate.ai, together.ai), and aggregators (e.g. openrouter.ai).</p><p>What&#8217;s unique about <a href="https://featherless.ai">featherless.ai</a> is the number of models available <em>serverlessly, </em>i.e. without paying for dedicated GPUs. With the exception of featherless, all other serverless providers only make the most popular models are available, since, behind the scenes, there is dedicated infrastructure to those models, those costs are just being amortized across a large enough group of consumers. Featherless is different. <em>Every</em> fine-tune of a large class of models (i.e. all fine-tunes of a specific set of base architectures) is available serverlessly. Our goal is to make every public model on HuggingFace available for <em>serverless </em>inference and we currently have over 2k models available.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7SEz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7SEz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png 424w, https://substackcdn.com/image/fetch/$s_!7SEz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png 848w, https://substackcdn.com/image/fetch/$s_!7SEz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png 1272w, https://substackcdn.com/image/fetch/$s_!7SEz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7SEz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png" width="1456" height="739" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:739,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:305513,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7SEz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png 424w, https://substackcdn.com/image/fetch/$s_!7SEz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png 848w, https://substackcdn.com/image/fetch/$s_!7SEz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png 1272w, https://substackcdn.com/image/fetch/$s_!7SEz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">https://featherless.ai/models</figcaption></figure></div><p>The next closest service in model count is openrouter.ai, which aggregates providers, and has less than 10% of the available models.</p><h2><strong>Model Selection and Testing</strong></h2><p>Play testing is an important part of any game development, and as we were playing with Featherless Feud, we tried a number of different models. The primary issues were a lack of diverse answers or answers that were too ambiguous for the game format. We&#8217;ve left the utility box that overrides the model in the lower-right hand corner of the game; this showcases some of the strength of the featherless platform: take any of our 2k models from our models page, drop it into that selector, and the game is instantly updated. When contrasted that just model download times are tens of minutes for small models, you might appreciate this feat.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://huggingface.co/spaces/Darok/Featherless-Feud" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CWkG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png 424w, https://substackcdn.com/image/fetch/$s_!CWkG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png 848w, https://substackcdn.com/image/fetch/$s_!CWkG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png 1272w, https://substackcdn.com/image/fetch/$s_!CWkG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CWkG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://huggingface.co/spaces/Darok/Featherless-Feud&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CWkG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png 424w, https://substackcdn.com/image/fetch/$s_!CWkG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png 848w, https://substackcdn.com/image/fetch/$s_!CWkG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png 1272w, https://substackcdn.com/image/fetch/$s_!CWkG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">the text box in the lower right</figcaption></figure></div><p>We tested a range of model architectures, sizes, and data-sets, including</p><ul><li><p><a href="https://featherless.ai/models/failspy/Meta-Llama-3-8B-Instruct-abliterated-v3">failspy/Meta-Llama-3-8B-Instruct-abliterated-v3</a> - we chose this model to explore if the abilteration approach would increase the variety of questions in an interesting way (inconclusive)</p></li><li><p><a href="https://featherless.ai/models/Qwen/Qwen1.5-32B">qwen/qwen1.5-32b</a> - The Qwen models feel very different than the Llama models - maybe that&#8217;s the culture of Alibaba versus Meta coming through &#8230;</p></li><li><p><a href="https://featherless.ai/models/meta-llama/Meta-Llama-3.1-70B-Instruct">meta-llama/Meta-Llama-3.1-70B-Instruct</a><strong>:</strong> While this 70B model offered superior judgment in evaluating user answers, its slower response time made it less suitable for rapid iteration during gameplay. However, it was useful in scenarios where answer accuracy was critical.</p></li><li><p><a href="https://featherless.ai/models/alpindale/magnum-72b-v1">alpindale/magnum-72b-v1</a><strong>:</strong> Initially, this 72B model seemed promising due to its creative output. However, its tendency for erratic responses made it unsuitable for the structured requirements of Featherless Feud.</p></li><li><p><a href="https://featherless.ai/models/Kooten/Mistral-Nemo-Instruct-2407-norefuse-OAS">Kooten/Mistral-Nemo-Instruct-2407-norefuse-OAS</a></p></li><li><p><strong><a href="https://featherless.ai/models/Sao10K/L3-8B-Stheno-v3.2">Sao10K/L3-8B-Stheno-v3.2</a>:</strong> This 8B model was selected for its optimal balance between speed and output quality. It consistently produced valid JSON responses, making it a reliable choice for generating game content efficiently.</p></li><li><p><a href="https://featherless.ai/models/anthracite-org/magnum-32b-v2">anthracite-org/magnum-32b-v2</a> - a qwen2-32b fine-tune intended for role-play</p></li></ul><p>At a high-level, the more RP-focussed models tended to generate more variety of questions, and the bigger models didn&#8217;t generate content with a sufficient jump in quality to justify the longer running time. So we settled on Stheno as it provided a balance of variety of output with fastest inference time.</p><p>Try running <a href="https://huggingface.co/spaces/Darok/Featherless-Feud">the game</a> with some of the models and let us know in the comments what you differences you notice!</p><h1>Wrapping up</h1><p>We learned a ton in this journey, and hope you did too. Perhaps most surprising is that capturing the comedic subtlety of Family Feud escaped us even when using the more capable (i.e. larger) open-source LLMs.</p><p>Thanks for reading! Let us know how you enjoyed this article, either in the comments, or with an email to hello@featherless.ai, and head on over to <a href="https://featherless.ai">featherless.ai</a> to experiment with the models listed here and more.</p><p>And again, <a href="https://huggingface.co/spaces/Darok/Featherless-Feud">check out the running game here</a></p>]]></content:encoded></item><item><title><![CDATA[minmodmon: A quickstart to local RWKV]]></title><description><![CDATA[In April we launched our RWKV-based model, EagleX v2. EagleX goes toe-to-toe with modern transformers on performance, while being much cheaper to run, and with an infinite context limit. The most common question I have personally seen about EagleX since then however has been, "How do I run it?".]]></description><link>https://substack.recursal.ai/p/minmodmon-a-quickstart-to-local-rwkv</link><guid isPermaLink="false">https://substack.recursal.ai/p/minmodmon-a-quickstart-to-local-rwkv</guid><dc:creator><![CDATA[Layl Bongers]]></dc:creator><pubDate>Wed, 14 Aug 2024 13:01:36 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!rqLR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab64c050-2157-436d-b345-c7520fe63c73_1280x640.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rqLR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab64c050-2157-436d-b345-c7520fe63c73_1280x640.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rqLR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab64c050-2157-436d-b345-c7520fe63c73_1280x640.png 424w, https://substackcdn.com/image/fetch/$s_!rqLR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab64c050-2157-436d-b345-c7520fe63c73_1280x640.png 848w, https://substackcdn.com/image/fetch/$s_!rqLR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab64c050-2157-436d-b345-c7520fe63c73_1280x640.png 1272w, https://substackcdn.com/image/fetch/$s_!rqLR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab64c050-2157-436d-b345-c7520fe63c73_1280x640.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rqLR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab64c050-2157-436d-b345-c7520fe63c73_1280x640.png" width="1280" height="640" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ab64c050-2157-436d-b345-c7520fe63c73_1280x640.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:640,&quot;width&quot;:1280,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:151215,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rqLR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab64c050-2157-436d-b345-c7520fe63c73_1280x640.png 424w, https://substackcdn.com/image/fetch/$s_!rqLR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab64c050-2157-436d-b345-c7520fe63c73_1280x640.png 848w, https://substackcdn.com/image/fetch/$s_!rqLR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab64c050-2157-436d-b345-c7520fe63c73_1280x640.png 1272w, https://substackcdn.com/image/fetch/$s_!rqLR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab64c050-2157-436d-b345-c7520fe63c73_1280x640.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In April we launched our RWKV-based model, <a href="https://substack.recursal.ai/cp/143699561">EagleX v2</a>. EagleX goes toe-to-toe with modern transformers on performance, while being much cheaper to run, and with an infinite context limit. The most common question I have personally seen about EagleX since then however has been, "How do I run it?".</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://substack.recursal.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Recursal AI development blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p><a href="https://github.com/recursal/minmodmon">Minmodmon</a> is a small self-contained tool that lets you easily and quickly run RWKV-based models on Windows, on your GPU, <strong>locally</strong>. No dependencies required! Just <a href="https://github.com/recursal/minmodmon/releases">download the latest release ZIP</a> and run it!</p><h1>AI for everyone</h1><p>Our number 1 goal at Recursal is to make sure *everyone* receives the benefits of AI. We don't want you to have to be a technology expert to start running EagleX. There are already plenty of ways to run RWKV, but these tend to be aimed at more expert users.</p><p>Minmodmon was made to be used by anyone wanting to try out AI models, regardless of computer skill. It runs on Windows, requires no separate installs (no python, pip, etc), and needs no command-line expertise.</p><p>However, by design minmodmon is very limited. For a more feature-complete setup, the library <a href="https://github.com/cryscan/web-rwkv">web-rwkv</a> that is used by minmodmon is also used in the excellent project <a href="https://github.com/Ai00-X/ai00_server">ai00_server</a>.</p><h1>What you can do with it</h1><p>Minmodmon is for use with <strong>other</strong> applications. You can't talk to a model directly through its web interface, but it integrates with common standards.</p><p>In particular, I recommend trying out minmodmon with <a href="https://docs.sillytavern.app/">SillyTavern</a>. SillyTavern is an amazing AI chat application that lets you load in AI personas to talk with locally, completely free and open source.</p><p>This is still an early release. If you encounter any issues head over to <a href="https://github.com/recursal/minmodmon/issues">our issue tracker</a> and report them to us!</p><h1>What's next</h1><p>This release is just one step in our plans. We want both local and remote AI to be a seamless experience, putting you in control of what you use and where your data goes. The user experience still leaves much to be desired, but we have big plans in the works.</p><p>If you do not have a powerful GPU necessary to run local models, or just want better performance and larger models, we also recently launched a privacy-focused remote AI service, <a href="https://featherless.ai/">Featherless</a>.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://substack.recursal.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Recursal AI development blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Model Support Summary + new this week]]></title><description><![CDATA[Nemo 12B and Qwen2 32B latest additions for a total of 12 model families]]></description><link>https://substack.recursal.ai/p/model-support-summary-new-this-week</link><guid isPermaLink="false">https://substack.recursal.ai/p/model-support-summary-new-this-week</guid><dc:creator><![CDATA[Wesley George]]></dc:creator><pubDate>Sat, 03 Aug 2024 02:34:25 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!dpMl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Featherless is a new kind of inference provider: we are building serverless inference for <em>all </em>of hugging face. We&#8217;re working through this one <em>architecture</em> at a time <em>(</em>e.g. Llama 3.1 8B).</p><p>Since our initial launch in June, we&#8217;ve been adding architectures, with fanfare only in our discord. But with Mistral Nemo 12B and Qwen2 32B becoming supported this week, taking the # of supported architectures to 12 and the total inferencible model count* to nearly 2k (1,922 at time of writing), this post seems overdue.</p><p>The full list of supported architectures is available on <a href="https://featherless.ai/about">our about page</a>, but the timeline is this</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dpMl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dpMl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png 424w, https://substackcdn.com/image/fetch/$s_!dpMl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png 848w, https://substackcdn.com/image/fetch/$s_!dpMl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png 1272w, https://substackcdn.com/image/fetch/$s_!dpMl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dpMl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png" width="1118" height="570" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:570,&quot;width&quot;:1118,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:112730,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!dpMl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png 424w, https://substackcdn.com/image/fetch/$s_!dpMl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png 848w, https://substackcdn.com/image/fetch/$s_!dpMl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png 1272w, https://substackcdn.com/image/fetch/$s_!dpMl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>When you break that model registration out over time, it looks something like this</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fqyB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fqyB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png 424w, https://substackcdn.com/image/fetch/$s_!fqyB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png 848w, https://substackcdn.com/image/fetch/$s_!fqyB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png 1272w, https://substackcdn.com/image/fetch/$s_!fqyB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fqyB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png" width="600" height="371" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:371,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24852,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fqyB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png 424w, https://substackcdn.com/image/fetch/$s_!fqyB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png 848w, https://substackcdn.com/image/fetch/$s_!fqyB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png 1272w, https://substackcdn.com/image/fetch/$s_!fqyB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>That smaller models are easier (take less time and money) to fine-tune accounts for a larger proportion of 7B and 8Bs (versus 70B, 72B). Likewise the older the model is, the more time folks have had to fine tune (hence many Llama 2 tunes, but few Nemos and Qwens).</p><p>If you want to weigh in on what model architecture we&#8217;re supporting next, <a href="https://discord.gg/5Fw9dFH65S">join our discord</a>.</p><p>Also a plug for fine-tuners: we&#8217;re working on a set of features that will be of benefit to model creators. If you are doing some fine-tuning, we&#8217;d love to connect with you for feedback on these upcoming features.</p>]]></content:encoded></item><item><title><![CDATA[🪶 Featherless.ai referral program]]></title><description><![CDATA[Invite a friend, and if they sign up for a plan, its $10 off both of your next bill]]></description><link>https://substack.recursal.ai/p/featherlessai-referral-program</link><guid isPermaLink="false">https://substack.recursal.ai/p/featherlessai-referral-program</guid><dc:creator><![CDATA[Wesley George]]></dc:creator><pubDate>Mon, 24 Jun 2024 07:18:29 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!w8-w!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!w8-w!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!w8-w!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png 424w, https://substackcdn.com/image/fetch/$s_!w8-w!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png 848w, https://substackcdn.com/image/fetch/$s_!w8-w!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png 1272w, https://substackcdn.com/image/fetch/$s_!w8-w!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!w8-w!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png" width="1456" height="1088" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1088,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:639946,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!w8-w!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png 424w, https://substackcdn.com/image/fetch/$s_!w8-w!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png 848w, https://substackcdn.com/image/fetch/$s_!w8-w!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png 1272w, https://substackcdn.com/image/fetch/$s_!w8-w!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Like being able to access over 450+ models from huggingface and know someone who would love to try? </p><p>Refer them to <a href="https://featherless.ai">featherless.ai</a> and both of you get the <strong>$10 OFF</strong> your next monthly bill!</p><p><strong>Refer 12</strong> of your friends and you can have a <strong>full year</strong> off our basic plan! (The discount stacks!)</p><p>Enjoy &#129303; your models with <a href="https://featherless.ai">featherless.ai</a> !</p><blockquote><p>Reminder: All our models do not log any of your messages prompt or completion &#128521;</p></blockquote>]]></content:encoded></item><item><title><![CDATA[🚀 Launching 🪶 Featherless.AI ]]></title><description><![CDATA[Run any &#129433; model from Hugging Face, instantly.]]></description><link>https://substack.recursal.ai/p/launching-featherlessai</link><guid isPermaLink="false">https://substack.recursal.ai/p/launching-featherlessai</guid><dc:creator><![CDATA[Eugene Cheah]]></dc:creator><pubDate>Mon, 24 Jun 2024 07:17:49 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!DODQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DODQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DODQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png 424w, https://substackcdn.com/image/fetch/$s_!DODQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png 848w, https://substackcdn.com/image/fetch/$s_!DODQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png 1272w, https://substackcdn.com/image/fetch/$s_!DODQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DODQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png" width="471" height="322.2631578947368" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:832,&quot;width&quot;:1216,&quot;resizeWidth&quot;:471,&quot;bytes&quot;:1423173,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!DODQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png 424w, https://substackcdn.com/image/fetch/$s_!DODQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png 848w, https://substackcdn.com/image/fetch/$s_!DODQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png 1272w, https://substackcdn.com/image/fetch/$s_!DODQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Featherless</figcaption></figure></div><p>Earlier this year, we took the world by storm when we announced that<a href="https://substack.recursal.ai/cp/143699561"> our Eagle model had beaten Meta&#8217;s Llama-2</a> while taking less training time, being the world&#8217;s most efficient model. </p><p>While Eagle still packs a powerful punch, and has been helping diverse use-cases from multi-lingual, to content moderation, gaming, and role-play, we&#8217;ve been working on something new, to bring our insights on efficiency to a much broader realm.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;d3479942-797f-49ec-95a0-94bd4081f821&quot;,&quot;duration&quot;:null}"></div><p>Just this Friday, we launched<a href="https://featherless.ai"> Featherless AI</a>, which enables <em>serverless</em> inference of <em>every </em>Llama-3 8B and 70B model on Hugging Face we grabbed our hands. </p><p>That&#8217;s over 475 models. With many more being added daily.</p><p>Allowing anyone to quickly experiment, try, and choose the latest and best models, from huggingface. Starting from $10 / month.</p><p>Previously, to use the even the smallest fine-tunes requires dedicated hardware, which translates to real hosting costs, whether you&#8217;re experimenting with a model or ramping up production use. This is a barrier to a host of use cases <em>particularly agents</em> where each step in the agent computation might benefit from a particular model.</p><p>The goal of featherless is to make every model on HuggingFace available serverless and with these Llama &amp; RWKV based models, we&#8217;re a big step of the way there. </p><p>With featherless, you can experiment with an entirely new range of models at completely different economics.</p><ul><li><p><a href="https://featherless.ai">Check out the site: featherless.ai</a></p></li><li><p><a href="https://twitter.com/picocreator/status/1804197546472149117">Retweet our launch tweet</a>, </p></li><li><p><a href="https://www.producthunt.com/posts/featherless-llm">Support our product hunt launch</a></p></li></ul>]]></content:encoded></item><item><title><![CDATA[Featherless: an introduction]]></title><description><![CDATA[making every hugging face model available for inference and why it matters]]></description><link>https://substack.recursal.ai/p/featherless-an-introduction</link><guid isPermaLink="false">https://substack.recursal.ai/p/featherless-an-introduction</guid><dc:creator><![CDATA[Wesley George]]></dc:creator><pubDate>Fri, 31 May 2024 20:00:00 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/39692c80-a092-44cd-8fb3-840a3dbdedea_741x661.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h1>There&#8217;s a custom model for that</h1><p>There are more than 100,000 distinct language models on <a href="https://huggingface.co/models?pipeline_tag=text-generation&amp;sort=trending">the hugging face hub</a>.</p><p>This is the output of an enormous amount of creative energy: built by over 10k AI enthusiasts, these models include impressive attempts to improve upon the best known language models like ChatGPT.</p><p>A lot of airtime goes to innovations on <em>technical</em> elements of language models (e.g. context length). And while important, that a huge part of what left with a collection there are a great many of domain specific LLMs like</p><ul><li><p>for specific languages (e.g. for <a href="https://huggingface.co/VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct">German</a>, <a href="https://huggingface.co/IlyaGusev/saiga_llama3_8b">Russian</a> or <a href="https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat">Chinese</a>)</p></li><li><p>for creative-writing (e.g tdrussell&#8217;s <a href="https://huggingface.co/tdrussell/Llama-3-70B-Instruct-Storywriter">Llama-3-70b-Instruct-Storywriter</a>)</p></li><li><p>with detailed medical knowledge (e.g. <a href="https://huggingface.co/BioMistral/BioMistral-7B">BioMistral/BioMistral-7B</a>)</p></li><li><p>can understand SEC filings, (e.g. <a href="https://huggingface.co/arcee-ai/Llama-3-SEC-Base">arcee-ai/Llama-3-SEC</a>)</p></li><li><p>legal (e.g. <a href="https://huggingface.co/umarbutler/open-australian-legal-llm">umarbutler/open-australian-legal-llm</a> - a model trained on a dataset of Australian Law curated by the Australian Attorney General&#8217;s office!)</p></li><li><p>novelty / character (e.g. <a href="https://huggingface.co/failspy/Llama-3-8B-Instruct-MopeyMule">MopeyMule</a>)</p></li></ul><h1>but it&#8217;s hard to use</h1><p>So how do you use these things?</p><p>Despite that HuggingFace is the defacto place to <em>host </em>models, you&#8217;re hard pressed to <em><strong>use</strong></em> them there. If you&#8217;ve spent time on the site, you may have forgotten that there is a specific part of the model card designed to let you test the model: it&#8217;s typically disabled for models 8B and up, which is the vast majority of models.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3J4g!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3J4g!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png 424w, https://substackcdn.com/image/fetch/$s_!3J4g!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png 848w, https://substackcdn.com/image/fetch/$s_!3J4g!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png 1272w, https://substackcdn.com/image/fetch/$s_!3J4g!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3J4g!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png" width="649" height="163" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:163,&quot;width&quot;:649,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:17213,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!3J4g!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png 424w, https://substackcdn.com/image/fetch/$s_!3J4g!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png 848w, https://substackcdn.com/image/fetch/$s_!3J4g!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png 1272w, https://substackcdn.com/image/fetch/$s_!3J4g!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>This kind of UX is a fact of that running these models requires operating expensive hardware (i.e. GPUs). You can rent these GPUs, but you&#8217;re looking at at least $2 / hour, and that would only cover you for the smaller models.</p><p>If you have a budget to experiment with, you can try launching the model on a dedicated service. However this will also require your patience; the most natural service to do this is HuggingFace&#8217;s inference endpoints service. Which I haven&#8217;t gotten it to work, despite the suggestion I should be able to launch a model in a few clicks.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!eMSC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!eMSC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png 424w, https://substackcdn.com/image/fetch/$s_!eMSC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png 848w, https://substackcdn.com/image/fetch/$s_!eMSC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png 1272w, https://substackcdn.com/image/fetch/$s_!eMSC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!eMSC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png" width="621" height="132" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/35067964-2224-4610-b606-8d011e821c9f_621x132.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:132,&quot;width&quot;:621,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:17896,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!eMSC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png 424w, https://substackcdn.com/image/fetch/$s_!eMSC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png 848w, https://substackcdn.com/image/fetch/$s_!eMSC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png 1272w, https://substackcdn.com/image/fetch/$s_!eMSC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p><a href="http://openrouter.ai">Openrouter</a> is probably the cloud service with the most options and offers per-token pricing. but it doesn&#8217;t have any of the models listed above. Nor does it let you bring your own model.</p><p>You can try and run it locally (and there are a host of tools that have significantly simplify the process - <a href="https://www.nomic.ai/gpt4all">gpt4all</a>, <a href="http://ollama.com">ollama</a> and <a href="http://cortex.so">cortex</a> are the more popular. But they still require technical orientation, patience, and, most importantly, powerful computing hardware.</p><h1>Experiment Faster with Featherless</h1><p>The goal of featherless is to make every LLM on hugging face available serverlessly. Right now, our collection is up to 1,501 models, making it the largest collection of models available for inference from any inference provider.</p><p>You can test any of the models on the site, but we&#8217;re expecting that you&#8217;ll plug this directly into a client, whether that&#8217;s to chat with it as a human e.g. via <a href="https://www.typingmind.com/">Typing Mind</a>, <a href="http://jan.ai">Jan</a>, or <a href="http://sillytavern.app">Silly Tavern</a>, or you&#8217;ll use the API directly, e.g. in a raw python program, or in some higher-level framework like <a href="https://www.langchain.com/">Lang Chain</a> or <a href="https://www.llamaindex.ai/">Llama Index</a></p><p>Checkout our <a href="http://Terms">terms of service</a> and <a href="https://featherless.ai/privacy">privacy policy.</a> </p>]]></content:encoded></item><item><title><![CDATA[🦅 EagleX v1 : Soaring past LLaMA 7B 2T in both English and Multi-lang evals (RWKV-v5)]]></title><description><![CDATA[A linear transformer has just cross the gold standard in transformer models, LLaMA 7B, with less tokens trained in both English and multi-lingual evals. A historical first.]]></description><link>https://substack.recursal.ai/p/eaglex-17t-soaring-past-llama-7b</link><guid isPermaLink="false">https://substack.recursal.ai/p/eaglex-17t-soaring-past-llama-7b</guid><dc:creator><![CDATA[Eugene Cheah]]></dc:creator><pubDate>Sat, 16 Mar 2024 08:33:56 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!nmho!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10cf7fd1-6c72-4a99-84c2-794fb7bc52b3_2432x1664.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nmho!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10cf7fd1-6c72-4a99-84c2-794fb7bc52b3_2432x1664.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nmho!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10cf7fd1-6c72-4a99-84c2-794fb7bc52b3_2432x1664.png 424w, https://substackcdn.com/image/fetch/$s_!nmho!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10cf7fd1-6c72-4a99-84c2-794fb7bc52b3_2432x1664.png 848w, https://substackcdn.com/image/fetch/$s_!nmho!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10cf7fd1-6c72-4a99-84c2-794fb7bc52b3_2432x1664.png 1272w, https://substackcdn.com/image/fetch/$s_!nmho!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10cf7fd1-6c72-4a99-84c2-794fb7bc52b3_2432x1664.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nmho!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10cf7fd1-6c72-4a99-84c2-794fb7bc52b3_2432x1664.png" width="1200" height="820.8791208791209" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/10cf7fd1-6c72-4a99-84c2-794fb7bc52b3_2432x1664.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:996,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:5727069,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!nmho!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10cf7fd1-6c72-4a99-84c2-794fb7bc52b3_2432x1664.png 424w, https://substackcdn.com/image/fetch/$s_!nmho!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10cf7fd1-6c72-4a99-84c2-794fb7bc52b3_2432x1664.png 848w, https://substackcdn.com/image/fetch/$s_!nmho!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10cf7fd1-6c72-4a99-84c2-794fb7bc52b3_2432x1664.png 1272w, https://substackcdn.com/image/fetch/$s_!nmho!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10cf7fd1-6c72-4a99-84c2-794fb7bc52b3_2432x1664.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">An eagle, flying past llama</figcaption></figure></div><blockquote><p>If you are fine-tuning, we recommend waiting for the full EagleX 2T model coming out later this month instead, unless you are doing so for research purpose. <br><br>This model is released for research purposes, as it represents the major checkpoint that surpasses LLaMA2 7B, as part of our current training to 2T tokens and beyond.</p></blockquote><h1>EagleX 1.7T - in short</h1><p>EagleX 1.7T is a early research release of our 7.52B parameter model training that:</p><ul><li><p>Is part of a larger 2T model training</p></li><li><p>Is built on the <a href="https://wiki.rwkv.com">RWKV-v5 architecture</a><br>(a linear transformer with 10-100x+ lower inference cost)</p></li><li><p><a href="https://blog.rwkv.com/p/eagle-7b-soaring-past-transformers">Is continuation based on the original Eagle 7B model</a></p></li><li><p><a href="https://blog.rwkv.com/p/the-worlds-greenest-ai-model-rwkvs">Ranks as the world&#8217;s greenest 7B model (per token)</a></p></li><li><p>Trained on 1.7 Trillion tokens across 100+ languages</p></li><li><p>Outperforms all 7B class models in multi-lingual benchmarks</p></li><li><p>Passes LLaMA2 (2T) in multiple English evals, approaches Mistral (&gt;2T?)</p></li><li><p><a href="https://www.isattentionallyouneed.com/">All while being an &#8220;Attention-Free Transformer&#8221;</a></p></li></ul><p>We are releasing RWKV-v5 EagleX 1.7T, <a href="https://blog.rwkv.com/p/rwkv-joins-the-linux-foundation-as">licensed under Apache 2.0</a>, which can be used personally or commercially without restrictions. </p><ul><li><p><a href="https://huggingface.co/recursal/EagleX_1-7T">Download from HuggingFace </a></p></li><li><p>Try it online today on </p><ul><li><p><a href="https://huggingface.co/spaces/recursal/EagleX-7B-1.7T-Gradio-Demo">our hugging face</a></p></li><li><p><a href="https://recursal.ai/">our new cloud platform</a></p></li></ul></li><li><p>Use our reference <a href="https://pypi.org/project/rwkv/">pip inference package</a>, or any other community inference options (<a href="https://github.com/josStorer/RWKV-Runner">Desktop App</a>, <a href="https://github.com/saharNooby/rwkv.cpp">RWKV.cpp</a>, <a href="https://wiki.rwkv.com/basic/play.html">etc</a>) , and use it anywhere (even locally)</p></li><li><p><a href="https://github.com/RWKV/RWKV-infctx-trainer">Fine-tune using our Infctx trainer</a></p></li><li><p><a href="https://github.com/huggingface/transformers/pull/26963">[Pending PR] Get support merged into Huggingface transformers!</a></p></li><li><p><a href="https://docs.google.com/spreadsheets/d/1PFELH3u8yQlr-bGs9D5lBYXCXqSFZw2O0vfW084jbgI/edit?usp=sharing">All eval data can be found in the google sheet here</a></p></li></ul><h1>What does it mean to fly past LLaMA 7B?</h1><p>It is a definitely a very big claim to say you have caught up and pass the &#8220;Gold Standard&#8221; of the 7B weight class from scratch, which nearly every other major open access model is built on (allegedly even Mistral). Even more so given that this is done with a comparatively lower dataset token count of 1.7 trillion token (vs. 2 trillion tokens).</p><h1>Going big on eval data</h1><p>As this is a entirely different model, trained from scratch, there will be evals that we win and we lose, which we are fully transparent about, in showing how we are ahead of LLaMA 7B on average.</p><p>Instead of simply cherry picking 14 different evals which we won and calling it a day with a victory, we ran ALL the benchmarks in EleutherAI `<a href="https://github.com/EleutherAI/lm-evaluation-harness">lm-eval-harness</a>`, at commit `f78e2da` that we could do, with the following limitations:</p><ul><li><p>It has to complete in under 30 minutes on 8x4090 (we were running lots of evals)</p><ul><li><p>This rules out some of the rather more expensive long chain of thought evals</p></li></ul></li><li><p>We excluded all the personality / alignment evals</p></li><li><p>Eval has to be executable across a wide variety of models, via lm-eval-harness</p></li><li><p>All evals are 0 shot (no 5 shot-ing an MCQ question)</p></li><li><p>We limited comparison to other models within the 7B weight class</p></li></ul><p>These resulted into running 60+ major eval groups, which generated over 1,000+ data points per model. A data point count so high, that we had to drop standard error deviations, just to ensure the raw CSV file can be loaded in MacOS numbers.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mRg-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2550c6fd-9cb6-4cf2-a379-279dc4c1e2f4_3612x564.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mRg-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2550c6fd-9cb6-4cf2-a379-279dc4c1e2f4_3612x564.png 424w, https://substackcdn.com/image/fetch/$s_!mRg-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2550c6fd-9cb6-4cf2-a379-279dc4c1e2f4_3612x564.png 848w, https://substackcdn.com/image/fetch/$s_!mRg-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2550c6fd-9cb6-4cf2-a379-279dc4c1e2f4_3612x564.png 1272w, https://substackcdn.com/image/fetch/$s_!mRg-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2550c6fd-9cb6-4cf2-a379-279dc4c1e2f4_3612x564.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mRg-!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2550c6fd-9cb6-4cf2-a379-279dc4c1e2f4_3612x564.png" width="1200" height="187.0879120879121" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2550c6fd-9cb6-4cf2-a379-279dc4c1e2f4_3612x564.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:227,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:240303,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!mRg-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2550c6fd-9cb6-4cf2-a379-279dc4c1e2f4_3612x564.png 424w, https://substackcdn.com/image/fetch/$s_!mRg-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2550c6fd-9cb6-4cf2-a379-279dc4c1e2f4_3612x564.png 848w, https://substackcdn.com/image/fetch/$s_!mRg-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2550c6fd-9cb6-4cf2-a379-279dc4c1e2f4_3612x564.png 1272w, https://substackcdn.com/image/fetch/$s_!mRg-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2550c6fd-9cb6-4cf2-a379-279dc4c1e2f4_3612x564.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">What it takes to fit 184 english eval data point onto the screen.</figcaption></figure></div><p>Whew, that&#8217;s a crazy number of data points to digest. Let me break it down to more digestible parts:</p><ul><li><p>English perplexity</p></li><li><p>Multi lingual performance</p></li><li><p>21 English Eval Focus</p></li><li><p>183 English Evals</p></li></ul><p>All data shown here is made available in the Google Sheet over here:</p><blockquote><p>We included explanations of what several of the evals mean, which you can keep in mind in future eval results you see (demystify what those numbers mean!)</p></blockquote><div><hr></div><h1>Improved English Perplexity </h1><p>We start with basics: Perplexity. This is the loss value against the test dataset (lower score = better), i.e. how good the model is with next token prediction.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!oxcX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484d5b6b-1e4f-41ba-902a-2577020e6e87_2512x893.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!oxcX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484d5b6b-1e4f-41ba-902a-2577020e6e87_2512x893.png 424w, https://substackcdn.com/image/fetch/$s_!oxcX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484d5b6b-1e4f-41ba-902a-2577020e6e87_2512x893.png 848w, https://substackcdn.com/image/fetch/$s_!oxcX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484d5b6b-1e4f-41ba-902a-2577020e6e87_2512x893.png 1272w, https://substackcdn.com/image/fetch/$s_!oxcX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484d5b6b-1e4f-41ba-902a-2577020e6e87_2512x893.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!oxcX!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484d5b6b-1e4f-41ba-902a-2577020e6e87_2512x893.png" width="1200" height="426.9230769230769" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/484d5b6b-1e4f-41ba-902a-2577020e6e87_2512x893.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:518,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:318875,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!oxcX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484d5b6b-1e4f-41ba-902a-2577020e6e87_2512x893.png 424w, https://substackcdn.com/image/fetch/$s_!oxcX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484d5b6b-1e4f-41ba-902a-2577020e6e87_2512x893.png 848w, https://substackcdn.com/image/fetch/$s_!oxcX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484d5b6b-1e4f-41ba-902a-2577020e6e87_2512x893.png 1272w, https://substackcdn.com/image/fetch/$s_!oxcX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F484d5b6b-1e4f-41ba-902a-2577020e6e87_2512x893.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In general, with the perplexity improvements, the EagleX model outperforms LLaMA2-7b, ranking between Falcom/LLaMA2-7b and Mistral.</p><div class="pullquote"><p><strong>Why do experts care about perplexity?</strong><br>Eval in general can be very subjective, and opinion driven, and commonly gives mixed results. Perplexity in a way gives the TLDR summary for most experts to start with</p></div><h1>Leading Multi-lang Perplexity &amp; evals</h1><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!tlOH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c7a3a64-3ea4-424b-9245-4b5a84608b71_2648x622.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!tlOH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c7a3a64-3ea4-424b-9245-4b5a84608b71_2648x622.png 424w, https://substackcdn.com/image/fetch/$s_!tlOH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c7a3a64-3ea4-424b-9245-4b5a84608b71_2648x622.png 848w, https://substackcdn.com/image/fetch/$s_!tlOH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c7a3a64-3ea4-424b-9245-4b5a84608b71_2648x622.png 1272w, https://substackcdn.com/image/fetch/$s_!tlOH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c7a3a64-3ea4-424b-9245-4b5a84608b71_2648x622.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!tlOH!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c7a3a64-3ea4-424b-9245-4b5a84608b71_2648x622.png" width="1200" height="281.86813186813185" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8c7a3a64-3ea4-424b-9245-4b5a84608b71_2648x622.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:342,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:269234,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!tlOH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c7a3a64-3ea4-424b-9245-4b5a84608b71_2648x622.png 424w, https://substackcdn.com/image/fetch/$s_!tlOH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c7a3a64-3ea4-424b-9245-4b5a84608b71_2648x622.png 848w, https://substackcdn.com/image/fetch/$s_!tlOH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c7a3a64-3ea4-424b-9245-4b5a84608b71_2648x622.png 1272w, https://substackcdn.com/image/fetch/$s_!tlOH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c7a3a64-3ea4-424b-9245-4b5a84608b71_2648x622.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mE6U!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52a6d947-e984-4e09-beff-6437cb55b87a_2559x893.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mE6U!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52a6d947-e984-4e09-beff-6437cb55b87a_2559x893.png 424w, https://substackcdn.com/image/fetch/$s_!mE6U!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52a6d947-e984-4e09-beff-6437cb55b87a_2559x893.png 848w, https://substackcdn.com/image/fetch/$s_!mE6U!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52a6d947-e984-4e09-beff-6437cb55b87a_2559x893.png 1272w, https://substackcdn.com/image/fetch/$s_!mE6U!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52a6d947-e984-4e09-beff-6437cb55b87a_2559x893.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mE6U!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52a6d947-e984-4e09-beff-6437cb55b87a_2559x893.png" width="1200" height="418.68131868131866" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/52a6d947-e984-4e09-beff-6437cb55b87a_2559x893.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:508,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:340434,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!mE6U!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52a6d947-e984-4e09-beff-6437cb55b87a_2559x893.png 424w, https://substackcdn.com/image/fetch/$s_!mE6U!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52a6d947-e984-4e09-beff-6437cb55b87a_2559x893.png 848w, https://substackcdn.com/image/fetch/$s_!mE6U!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52a6d947-e984-4e09-beff-6437cb55b87a_2559x893.png 1272w, https://substackcdn.com/image/fetch/$s_!mE6U!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52a6d947-e984-4e09-beff-6437cb55b87a_2559x893.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>EagleX maintains the lead for best in class multi-lingual performance, with the incremental improvements we&#8217;re making to the Eagle line of models.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FE7f!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9170f5fa-4616-4507-89ec-122e50178116_1739x519.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FE7f!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9170f5fa-4616-4507-89ec-122e50178116_1739x519.png 424w, https://substackcdn.com/image/fetch/$s_!FE7f!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9170f5fa-4616-4507-89ec-122e50178116_1739x519.png 848w, https://substackcdn.com/image/fetch/$s_!FE7f!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9170f5fa-4616-4507-89ec-122e50178116_1739x519.png 1272w, https://substackcdn.com/image/fetch/$s_!FE7f!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9170f5fa-4616-4507-89ec-122e50178116_1739x519.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FE7f!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9170f5fa-4616-4507-89ec-122e50178116_1739x519.png" width="1200" height="358.5164835164835" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9170f5fa-4616-4507-89ec-122e50178116_1739x519.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:435,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:101919,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FE7f!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9170f5fa-4616-4507-89ec-122e50178116_1739x519.png 424w, https://substackcdn.com/image/fetch/$s_!FE7f!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9170f5fa-4616-4507-89ec-122e50178116_1739x519.png 848w, https://substackcdn.com/image/fetch/$s_!FE7f!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9170f5fa-4616-4507-89ec-122e50178116_1739x519.png 1272w, https://substackcdn.com/image/fetch/$s_!FE7f!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9170f5fa-4616-4507-89ec-122e50178116_1739x519.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Most of the tasks here are common sense reasoning tests of wide variety of formats, across languages including <a href="https://blog.rwkv.com/i/141130059/multi-lingual-performance-details">23 of the world&#8217;s most widely used languages.</a></p><p>For the remaining languages, we urge the community to test and judge it themselves, over a 100+ languages was trained. Over time, we would want more languages to be added into evals.</p><div class="pullquote"><p><strong>Why is multi-lingual perf important? </strong><br>The goal of the RWKV project &amp; Eagle line of models, is to build <strong>inclusive</strong> AI for everyone regardless of their language. Our mission is to build AI models not just made for English, but also for the 83% of the world&#8217;s population using a non-English language everyday.</p></div><h1>21 English Evals</h1><p>Nevertheless, English is still important. We reduced the evals down to 21 of the argubly most popular English evals, such as Lambada, Glue, Swag, Winogrande, TruthfulQA, MMLU:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jgJp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0331cd9-82f2-46eb-8544-c07cb8b08bb3_2817x469.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jgJp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0331cd9-82f2-46eb-8544-c07cb8b08bb3_2817x469.png 424w, https://substackcdn.com/image/fetch/$s_!jgJp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0331cd9-82f2-46eb-8544-c07cb8b08bb3_2817x469.png 848w, https://substackcdn.com/image/fetch/$s_!jgJp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0331cd9-82f2-46eb-8544-c07cb8b08bb3_2817x469.png 1272w, https://substackcdn.com/image/fetch/$s_!jgJp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0331cd9-82f2-46eb-8544-c07cb8b08bb3_2817x469.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jgJp!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0331cd9-82f2-46eb-8544-c07cb8b08bb3_2817x469.png" width="1200" height="199.45054945054946" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f0331cd9-82f2-46eb-8544-c07cb8b08bb3_2817x469.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:242,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:319914,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jgJp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0331cd9-82f2-46eb-8544-c07cb8b08bb3_2817x469.png 424w, https://substackcdn.com/image/fetch/$s_!jgJp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0331cd9-82f2-46eb-8544-c07cb8b08bb3_2817x469.png 848w, https://substackcdn.com/image/fetch/$s_!jgJp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0331cd9-82f2-46eb-8544-c07cb8b08bb3_2817x469.png 1272w, https://substackcdn.com/image/fetch/$s_!jgJp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0331cd9-82f2-46eb-8544-c07cb8b08bb3_2817x469.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Narrowing it down to the 4 models that most of us actually care about - LLaMA, Mistral, EagleX and Eagle-7b - the new EagleX model outperforms LLaMA-2-7b on average across the 21 evals, and lags not far behind Mistral.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vSm3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f87f388-2250-4dc9-aa60-6c5e5bdc86e6_1119x407.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vSm3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f87f388-2250-4dc9-aa60-6c5e5bdc86e6_1119x407.png 424w, https://substackcdn.com/image/fetch/$s_!vSm3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f87f388-2250-4dc9-aa60-6c5e5bdc86e6_1119x407.png 848w, https://substackcdn.com/image/fetch/$s_!vSm3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f87f388-2250-4dc9-aa60-6c5e5bdc86e6_1119x407.png 1272w, https://substackcdn.com/image/fetch/$s_!vSm3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f87f388-2250-4dc9-aa60-6c5e5bdc86e6_1119x407.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vSm3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f87f388-2250-4dc9-aa60-6c5e5bdc86e6_1119x407.png" width="1119" height="407" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1f87f388-2250-4dc9-aa60-6c5e5bdc86e6_1119x407.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:407,&quot;width&quot;:1119,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:68054,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vSm3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f87f388-2250-4dc9-aa60-6c5e5bdc86e6_1119x407.png 424w, https://substackcdn.com/image/fetch/$s_!vSm3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f87f388-2250-4dc9-aa60-6c5e5bdc86e6_1119x407.png 848w, https://substackcdn.com/image/fetch/$s_!vSm3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f87f388-2250-4dc9-aa60-6c5e5bdc86e6_1119x407.png 1272w, https://substackcdn.com/image/fetch/$s_!vSm3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f87f388-2250-4dc9-aa60-6c5e5bdc86e6_1119x407.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Keep in mind that this average shown, is across all 21 evals</figcaption></figure></div><div><hr></div><h4><strong>The Good</strong></h4><p>Now, let&#8217;s look at where our model is blowing the rest of the models out of the water.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CLIM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F478ded26-6adf-4b2d-a245-e96216b6a598_2478x666.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CLIM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F478ded26-6adf-4b2d-a245-e96216b6a598_2478x666.png 424w, https://substackcdn.com/image/fetch/$s_!CLIM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F478ded26-6adf-4b2d-a245-e96216b6a598_2478x666.png 848w, https://substackcdn.com/image/fetch/$s_!CLIM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F478ded26-6adf-4b2d-a245-e96216b6a598_2478x666.png 1272w, https://substackcdn.com/image/fetch/$s_!CLIM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F478ded26-6adf-4b2d-a245-e96216b6a598_2478x666.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CLIM!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F478ded26-6adf-4b2d-a245-e96216b6a598_2478x666.png" width="1200" height="322.25274725274727" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/478ded26-6adf-4b2d-a245-e96216b6a598_2478x666.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:391,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:311417,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CLIM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F478ded26-6adf-4b2d-a245-e96216b6a598_2478x666.png 424w, https://substackcdn.com/image/fetch/$s_!CLIM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F478ded26-6adf-4b2d-a245-e96216b6a598_2478x666.png 848w, https://substackcdn.com/image/fetch/$s_!CLIM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F478ded26-6adf-4b2d-a245-e96216b6a598_2478x666.png 1272w, https://substackcdn.com/image/fetch/$s_!CLIM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F478ded26-6adf-4b2d-a245-e96216b6a598_2478x666.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>First, the big stand out is the first 6 evals, which even our small 1.7T trained model beats out even Mistral 2T++ trained model (sciq, glue, anli, mmnli, swag), across multiple tasks focused around either contextual based simple Q&amp;A with common sense reasoning, or deductive logic. EagleX also performs better than LLaMA-2-7b in wingrade and wnli evals, which also involves contextual common sense reasoning as well. This implies that the EagleX model would be applicable in RAG use cases, which are mainly contextual Q&amp;A, with the right prompt engineering.</p><p>Finally, for truthfulqa, while it outperforms LLaMA, but in my opinion, this is still indicative of how vulnerable all models are from learning common human misconceptions from the web, seeing how bad the scores are across all models.<br>(to be fair, this is hard for most humans as well)</p><blockquote><p>PS: The jump for glue/mnli was high enough, that we needed to check the dataset specifically for contamination. Which we were not be able to find any. This jump is currently being attributed to multiple training datasets, along with data augmented / machine rewritten instruct dataset following a similar structure.</p></blockquote><div class="pullquote"><p>Strong common sense reasoning over context, <br>has very strong applicable use cases for multiple RAG use cases</p></div><h4><strong>The Mixed</strong></h4><p>Next: the eval sets with mixed results. Here, we have very similar evals with 2 major variants. The results between EagleX and LLaMA are close enough, that it&#8217;s hard to say which model is clearly better between the two for these evals. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!P0Sa!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae663f4a-2522-4615-891b-44a018f02f1e_1651x554.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!P0Sa!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae663f4a-2522-4615-891b-44a018f02f1e_1651x554.png 424w, https://substackcdn.com/image/fetch/$s_!P0Sa!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae663f4a-2522-4615-891b-44a018f02f1e_1651x554.png 848w, https://substackcdn.com/image/fetch/$s_!P0Sa!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae663f4a-2522-4615-891b-44a018f02f1e_1651x554.png 1272w, https://substackcdn.com/image/fetch/$s_!P0Sa!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae663f4a-2522-4615-891b-44a018f02f1e_1651x554.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!P0Sa!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae663f4a-2522-4615-891b-44a018f02f1e_1651x554.png" width="1200" height="403.02197802197804" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ae663f4a-2522-4615-891b-44a018f02f1e_1651x554.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:489,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:177411,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!P0Sa!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae663f4a-2522-4615-891b-44a018f02f1e_1651x554.png 424w, https://substackcdn.com/image/fetch/$s_!P0Sa!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae663f4a-2522-4615-891b-44a018f02f1e_1651x554.png 848w, https://substackcdn.com/image/fetch/$s_!P0Sa!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae663f4a-2522-4615-891b-44a018f02f1e_1651x554.png 1272w, https://substackcdn.com/image/fetch/$s_!P0Sa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae663f4a-2522-4615-891b-44a018f02f1e_1651x554.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>What&#8217;s interesting, is that even though logiqa can be seen as form of &#8220;common sense&#8221; reasoning test, the EagleX model scored much lower compared to the 6 evals (sciq, glue, anli, mmnli, swag). This could mean that while the model is better at reasoning given a context, but it lacks the depth of knowledge compared to other models with more token training.</p><div><hr></div><h4><strong>The &#8220;Not too bad&#8220; and the &#8220;Really Bad&#8221;</strong></h4><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!p9de!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c8533a9-e578-45c7-9b4b-7cb081d14483_2555x721.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!p9de!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c8533a9-e578-45c7-9b4b-7cb081d14483_2555x721.png 424w, https://substackcdn.com/image/fetch/$s_!p9de!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c8533a9-e578-45c7-9b4b-7cb081d14483_2555x721.png 848w, https://substackcdn.com/image/fetch/$s_!p9de!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c8533a9-e578-45c7-9b4b-7cb081d14483_2555x721.png 1272w, https://substackcdn.com/image/fetch/$s_!p9de!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c8533a9-e578-45c7-9b4b-7cb081d14483_2555x721.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!p9de!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c8533a9-e578-45c7-9b4b-7cb081d14483_2555x721.png" width="1200" height="338.7362637362637" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0c8533a9-e578-45c7-9b4b-7cb081d14483_2555x721.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:411,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:306187,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!p9de!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c8533a9-e578-45c7-9b4b-7cb081d14483_2555x721.png 424w, https://substackcdn.com/image/fetch/$s_!p9de!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c8533a9-e578-45c7-9b4b-7cb081d14483_2555x721.png 848w, https://substackcdn.com/image/fetch/$s_!p9de!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c8533a9-e578-45c7-9b4b-7cb081d14483_2555x721.png 1272w, https://substackcdn.com/image/fetch/$s_!p9de!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c8533a9-e578-45c7-9b4b-7cb081d14483_2555x721.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>These are the evals the EagleX performs worse on compared to both Mistral and LLaMA. However, for the evals that we&#8217;ve lost to LLaMA, it&#8217;s by a narrow margin. But we&#8217;ll be keeping track of these as we train past 2T tokens.</p><p>Let&#8217;s look what went really badly: Math.</p><p>The results for arithmetic eval sank drastically, like a rock, even compared to our original Eagle model.</p><p>What went wrong?</p><p>We dug through the dataset we used for training, and realized we missed out the entire math dataset (along with a few others) due to an error. Oops. </p><p>This emphasize the importance of maintaining the dataset composition over the training run. We&#8217;re adding math back for future runs.</p><blockquote><p>We expect overall math score to rise back up as the training continue, however realistically IMO - no one should be depending on a 7B model for math (just saying)</p></blockquote><div><hr></div><h2>183 English Evals</h2><p>We do not simply want to cherry pick 9 or 21 evals and claim victory over LLaMA, or even Mistral. So, let&#8217;s zoom out, and look at it holistically across 183 English evals.</p><p><a href="https://docs.google.com/spreadsheets/d/1PFELH3u8yQlr-bGs9D5lBYXCXqSFZw2O0vfW084jbgI/edit?usp=sharing">You can view the full results here</a></p><p>Although using the overall averages across all the evals does have a bias the results towards larger eval sets (due to double counting, e.g. mmlu overall and many indivudall mmlu test), it does not change the ranking among the EagleX, Mistral, LLaMA and the original Eagle models.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nScm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c7691a5-4e2d-473a-9c54-7e837b1ffb44_767x566.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nScm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c7691a5-4e2d-473a-9c54-7e837b1ffb44_767x566.png 424w, https://substackcdn.com/image/fetch/$s_!nScm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c7691a5-4e2d-473a-9c54-7e837b1ffb44_767x566.png 848w, https://substackcdn.com/image/fetch/$s_!nScm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c7691a5-4e2d-473a-9c54-7e837b1ffb44_767x566.png 1272w, https://substackcdn.com/image/fetch/$s_!nScm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c7691a5-4e2d-473a-9c54-7e837b1ffb44_767x566.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nScm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c7691a5-4e2d-473a-9c54-7e837b1ffb44_767x566.png" width="767" height="566" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3c7691a5-4e2d-473a-9c54-7e837b1ffb44_767x566.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:566,&quot;width&quot;:767,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:102952,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!nScm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c7691a5-4e2d-473a-9c54-7e837b1ffb44_767x566.png 424w, https://substackcdn.com/image/fetch/$s_!nScm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c7691a5-4e2d-473a-9c54-7e837b1ffb44_767x566.png 848w, https://substackcdn.com/image/fetch/$s_!nScm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c7691a5-4e2d-473a-9c54-7e837b1ffb44_767x566.png 1272w, https://substackcdn.com/image/fetch/$s_!nScm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c7691a5-4e2d-473a-9c54-7e837b1ffb44_767x566.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>However these results is extremely useful for smaller insights, for example</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2noV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ea7f07c-9224-4f58-9a5a-c8d8637332f0_1658x234.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2noV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ea7f07c-9224-4f58-9a5a-c8d8637332f0_1658x234.png 424w, https://substackcdn.com/image/fetch/$s_!2noV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ea7f07c-9224-4f58-9a5a-c8d8637332f0_1658x234.png 848w, https://substackcdn.com/image/fetch/$s_!2noV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ea7f07c-9224-4f58-9a5a-c8d8637332f0_1658x234.png 1272w, https://substackcdn.com/image/fetch/$s_!2noV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ea7f07c-9224-4f58-9a5a-c8d8637332f0_1658x234.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2noV!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ea7f07c-9224-4f58-9a5a-c8d8637332f0_1658x234.png" width="1200" height="168.95604395604394" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ea7f07c-9224-4f58-9a5a-c8d8637332f0_1658x234.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:205,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:69101,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2noV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ea7f07c-9224-4f58-9a5a-c8d8637332f0_1658x234.png 424w, https://substackcdn.com/image/fetch/$s_!2noV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ea7f07c-9224-4f58-9a5a-c8d8637332f0_1658x234.png 848w, https://substackcdn.com/image/fetch/$s_!2noV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ea7f07c-9224-4f58-9a5a-c8d8637332f0_1658x234.png 1272w, https://substackcdn.com/image/fetch/$s_!2noV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ea7f07c-9224-4f58-9a5a-c8d8637332f0_1658x234.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>The EagleX model lost to LLaMA2 when it comes to US history, but won in world history. This makes sense, given the broader approach we took to making the dataset from a more inclusive, more global view, instead of a US centric one.</p><p>The detailed insights will be used by our dataset team to iterate and improve on our future datasets.</p><div class="pullquote"><p>How the model answer, is a reflection of the dataset experiences it has learnt<br>How much resources the model consumes, is a reflection of its architecture</p></div><h1>Perhaps a good dataset + Scalable architecture: is all you need?</h1><p>One of the biggest change we did was to change the dataset for the current 1T tokens, which now uses a cleaner filtered set of data with <strong>careful considerations to ensure  permissible licensed content sources used</strong>.</p><p>There are also huge implications on the fact, the model crossed the llama2 line earlier then the plan schedule. That either the architecture is more efficient in training, or that the improvements in dataset quality has a large impact in model performance.</p><p>The following is a summary of the dataset used, its public release will be made available next month after the current 2T training is completed.</p><pre><code>## 15% Code 

Contains code/programming related topics
- the-stack
- codeparrot
- devopedia
- mdn

## 15% Multi lang

Generally multi-lang webtext
- sea-lion (Singapore)
- madlad
- culturax
- multi lang wiki

## The giant soup

Creative content
- fandom (only sites with permissive licenses, and low spam)
- scp-foundation

Wikipedia
- Various Permissively licensed wikis.
- wikipedia

Papers:
- Mainly arxiv (Permissive Licenses) and pes2o

Books:
All the books contained in out train sets are public domains books.
- gutenberg, 
- standardebooks

Webtext
- webtext
- refinedweb (Note: This chunk made the model worse, we recommend against refinedweb in future trains)
- slimpajama
- europarl
- eurlex.
- stackexchange

Various
- aya (multilang convo)
- some system prompt, instruct
- long list of sub 100B training datasets on HF
- rewritten text !!! (splicing in, to replicate the rewritten web paper)</code></pre><div><hr></div><h1>Why does the architecture matter?</h1><p>We are over a 100x more scalable then the transformer architecture.<br>Transformers became the most prominent architecture in AI, not because it was the best, but it was the first to successfully scale to billion of parameters in training.</p><p>Till today</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gCLY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c33b708-8c96-4179-8393-8274e86e7fcf_616x463.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gCLY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c33b708-8c96-4179-8393-8274e86e7fcf_616x463.png 424w, https://substackcdn.com/image/fetch/$s_!gCLY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c33b708-8c96-4179-8393-8274e86e7fcf_616x463.png 848w, https://substackcdn.com/image/fetch/$s_!gCLY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c33b708-8c96-4179-8393-8274e86e7fcf_616x463.png 1272w, https://substackcdn.com/image/fetch/$s_!gCLY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c33b708-8c96-4179-8393-8274e86e7fcf_616x463.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gCLY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c33b708-8c96-4179-8393-8274e86e7fcf_616x463.png" width="616" height="463" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4c33b708-8c96-4179-8393-8274e86e7fcf_616x463.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:463,&quot;width&quot;:616,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:61048,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!gCLY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c33b708-8c96-4179-8393-8274e86e7fcf_616x463.png 424w, https://substackcdn.com/image/fetch/$s_!gCLY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c33b708-8c96-4179-8393-8274e86e7fcf_616x463.png 848w, https://substackcdn.com/image/fetch/$s_!gCLY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c33b708-8c96-4179-8393-8274e86e7fcf_616x463.png 1272w, https://substackcdn.com/image/fetch/$s_!gCLY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c33b708-8c96-4179-8393-8274e86e7fcf_616x463.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">CUDA computational cost, for RWKV-based architecture vs transformer models - that quadratic-vs-linear really scales!</figcaption></figure></div><div><hr></div><h1>The Milestone</h1><p>In overall, the release of this model marks an important milestone and transition for many of us, within both the commercial team within Recursal AI, and the open source team in the RWKV group.</p><ul><li><p>Its the first major training done by the Recursal AI team, in partnership with AWS as our main compute provider</p></li><li><p>This model is being released under Apache 2 licensing</p></li><li><p>The fully trained 2T model will be released under the RWKV group, under the Linux Foundation</p></li><li><p>The first Non-Transformer Architecture to pass LLaMA2 in evals</p></li><li><p>The strongest Linear Transformer to date</p></li><li><p>Proof you can have both strong multi-lingual and english performance</p></li></ul><div><hr></div><h1>What&#8217;s next?</h1><p>Similar to the original Eagle 7B announcements, the following is the revised goals for the model training</p><ul><li><p>[April 2024] Completion of the 2T Eagle 7B models</p></li><li><p>[March-May 2024] Training of our v6 &#8220;Finch&#8221;line of models</p></li><li><p>[June 2024] v6 MoE model, for GPT 3.5 class performance</p></li></ul><blockquote><p>Disclaimer: All dates are approximate, and is heavily subjected to compute availability from our sponsors/compute-provider/investors</p></blockquote><h1>Want more?</h1><p>If you want find more about the RWKV opensource Project at</p><ul><li><p>Wiki: <a href="https://wiki.rwkv.com/">https://wiki.rwkv.com/</a></p></li><li><p>Discord: <a href="https://discord.gg/bDSBUMeFpc">https://discord.gg/bDSBUMeFpc</a></p></li></ul><p>If you like to try the model today, you can do so on our platform at <a href="https://recursal.ai">recursal.ai</a> - the best place host, run, and create finetunes of the Eagle line of RWKV models.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://substack.recursal.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Recursal AI development blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Launching Eagle 7B - into our public demo, and open router (till March 2024)]]></title><description><![CDATA[Brining the worlds strongest multi-lingual model to the world]]></description><link>https://substack.recursal.ai/p/launching-eagle-7b-into-our-public</link><guid isPermaLink="false">https://substack.recursal.ai/p/launching-eagle-7b-into-our-public</guid><dc:creator><![CDATA[Eugene Cheah]]></dc:creator><pubDate>Tue, 30 Jan 2024 01:07:07 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/b5e6e8c0-3c3c-4c55-860e-3ce2855b4d0c_2832x1878.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_zPg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cd492e6-1516-4a76-9c37-82c0a344bc4a_1045x1622.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_zPg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cd492e6-1516-4a76-9c37-82c0a344bc4a_1045x1622.png 424w, https://substackcdn.com/image/fetch/$s_!_zPg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cd492e6-1516-4a76-9c37-82c0a344bc4a_1045x1622.png 848w, https://substackcdn.com/image/fetch/$s_!_zPg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cd492e6-1516-4a76-9c37-82c0a344bc4a_1045x1622.png 1272w, https://substackcdn.com/image/fetch/$s_!_zPg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cd492e6-1516-4a76-9c37-82c0a344bc4a_1045x1622.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_zPg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cd492e6-1516-4a76-9c37-82c0a344bc4a_1045x1622.png" width="1045" height="1622" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6cd492e6-1516-4a76-9c37-82c0a344bc4a_1045x1622.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1622,&quot;width&quot;:1045,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1937700,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_zPg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cd492e6-1516-4a76-9c37-82c0a344bc4a_1045x1622.png 424w, https://substackcdn.com/image/fetch/$s_!_zPg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cd492e6-1516-4a76-9c37-82c0a344bc4a_1045x1622.png 848w, https://substackcdn.com/image/fetch/$s_!_zPg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cd492e6-1516-4a76-9c37-82c0a344bc4a_1045x1622.png 1272w, https://substackcdn.com/image/fetch/$s_!_zPg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cd492e6-1516-4a76-9c37-82c0a344bc4a_1045x1622.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The Eagle-7B model launch has been a great success!<br>More details can be found here: <a href="https://substack.recursal.ai/cp/141146731">https://substack.recursal.ai/cp/141146731</a></p><p>As we are working behind the scenes for our cloud platform launch - we have decided to avoid an eagle-and-egg situation. And decided to provide our latest 7B model for free on both our chat demo and open router endpoints</p><ul><li><p><a href="https://openrouter.ai/models/recursal/eagle-7b">https://openrouter.ai/models/recursal/eagle-7b</a></p></li><li><p><a href="https://rwkv-demo-api.recursal.ai/">https://rwkv-demo-api.recursal.ai/</a></p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vGXS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe74ba324-1390-4e96-ad84-5e2cad625652_2832x1878.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vGXS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe74ba324-1390-4e96-ad84-5e2cad625652_2832x1878.png 424w, https://substackcdn.com/image/fetch/$s_!vGXS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe74ba324-1390-4e96-ad84-5e2cad625652_2832x1878.png 848w, https://substackcdn.com/image/fetch/$s_!vGXS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe74ba324-1390-4e96-ad84-5e2cad625652_2832x1878.png 1272w, https://substackcdn.com/image/fetch/$s_!vGXS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe74ba324-1390-4e96-ad84-5e2cad625652_2832x1878.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vGXS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe74ba324-1390-4e96-ad84-5e2cad625652_2832x1878.png" width="1456" height="966" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e74ba324-1390-4e96-ad84-5e2cad625652_2832x1878.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:966,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:466101,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vGXS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe74ba324-1390-4e96-ad84-5e2cad625652_2832x1878.png 424w, https://substackcdn.com/image/fetch/$s_!vGXS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe74ba324-1390-4e96-ad84-5e2cad625652_2832x1878.png 848w, https://substackcdn.com/image/fetch/$s_!vGXS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe74ba324-1390-4e96-ad84-5e2cad625652_2832x1878.png 1272w, https://substackcdn.com/image/fetch/$s_!vGXS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe74ba324-1390-4e96-ad84-5e2cad625652_2832x1878.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>These models will be provided for free, with rate limits until the launch of our cloud platform in March - stay tuned!</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://substack.recursal.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Recursal AI development blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Public RWKV 3B Models via OpenRouter]]></title><description><![CDATA[Our free RWKV 3B models, are now accessible with OpenRouter]]></description><link>https://substack.recursal.ai/p/public-rwkv-3b-model-via-openrouter</link><guid isPermaLink="false">https://substack.recursal.ai/p/public-rwkv-3b-model-via-openrouter</guid><dc:creator><![CDATA[Eugene Cheah]]></dc:creator><pubDate>Tue, 12 Dec 2023 20:36:17 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!d_Fa!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d760c1-b117-4fe9-a254-adf51d434b5d_1307x1044.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>In collaboration with OpenRouter.ai - and the RWKV team here at Recursal.AI</p><p>We are proud to announce the public release of RWKV 3B models on open router</p><ul><li><p><a href="https://openrouter.ai/models/rwkv/rwkv-5-world-3b">RWKV v5 world 3B</a></p></li><li><p><a href="https://openrouter.ai/models/recursal/rwkv-5-3b-ai-town">RWKV v5 3B AI Town</a></p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!d_Fa!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d760c1-b117-4fe9-a254-adf51d434b5d_1307x1044.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!d_Fa!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d760c1-b117-4fe9-a254-adf51d434b5d_1307x1044.png 424w, https://substackcdn.com/image/fetch/$s_!d_Fa!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d760c1-b117-4fe9-a254-adf51d434b5d_1307x1044.png 848w, https://substackcdn.com/image/fetch/$s_!d_Fa!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d760c1-b117-4fe9-a254-adf51d434b5d_1307x1044.png 1272w, https://substackcdn.com/image/fetch/$s_!d_Fa!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d760c1-b117-4fe9-a254-adf51d434b5d_1307x1044.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!d_Fa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d760c1-b117-4fe9-a254-adf51d434b5d_1307x1044.png" width="1307" height="1044" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d5d760c1-b117-4fe9-a254-adf51d434b5d_1307x1044.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1044,&quot;width&quot;:1307,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:242463,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!d_Fa!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d760c1-b117-4fe9-a254-adf51d434b5d_1307x1044.png 424w, https://substackcdn.com/image/fetch/$s_!d_Fa!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d760c1-b117-4fe9-a254-adf51d434b5d_1307x1044.png 848w, https://substackcdn.com/image/fetch/$s_!d_Fa!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d760c1-b117-4fe9-a254-adf51d434b5d_1307x1044.png 1272w, https://substackcdn.com/image/fetch/$s_!d_Fa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d760c1-b117-4fe9-a254-adf51d434b5d_1307x1044.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This is in line with our previous 2 public API release for RWKV world, and AI town respectively, </p><ul><li><p><a href="https://substack.recursal.ai/p/public-rwkv-v5-3b-models">Public RWKV v5 3B OpenAI endpoint</a></p></li></ul><ul><li><p><a href="https://substack.recursal.ai/p/dedicated-ai-town-server-is-up">Public AI town 3B OpenAI endpoint</a></p></li></ul><p>All of which to make it easier for you to switch existing experiments for testing.</p><div><hr></div><p>We hope, by giving such public access, we get to see more interesting and application built on RWKV. Ping @picocreator on twitter or RWKV discord, to show us what you built with it &#128521;</p><blockquote><p>Recursal.AI intend to keep the RWKV 3B models, open to the public, with some resonable anti-abuse / rate limits - till the end of 2024, as we explore means of optimizing our inference infrastructure at scale under load, for our commercial cloud platform.<br><br>Dedicated RWKV inference cloud service will be out soon.<br><br>Disclaimer: This API service is provided as it is, without any warranties or guarantees.</p></blockquote>]]></content:encoded></item></channel></rss>