<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Featherless AI - recursive dev blog: Featherless Platform]]></title><description><![CDATA[Every LLM. Serverless.]]></description><link>https://substack.recursal.ai/s/featherless-ai</link><image><url>https://substackcdn.com/image/fetch/$s_!RY89!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F655b233b-f955-4a8e-b220-6e4f392736ef_160x160.png</url><title>Featherless AI - recursive dev blog: Featherless Platform</title><link>https://substack.recursal.ai/s/featherless-ai</link></image><generator>Substack</generator><lastBuildDate>Sun, 12 Apr 2026 08:53:41 GMT</lastBuildDate><atom:link href="https://substack.recursal.ai/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Recursal AI]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[featherless@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[featherless@substack.com]]></itunes:email><itunes:name><![CDATA[Featherless AI - dev blog]]></itunes:name></itunes:owner><itunes:author><![CDATA[Featherless AI - dev blog]]></itunes:author><googleplay:owner><![CDATA[featherless@substack.com]]></googleplay:owner><googleplay:email><![CDATA[featherless@substack.com]]></googleplay:email><googleplay:author><![CDATA[Featherless AI - dev blog]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[The world's most reliable AI agent that actually works - where Claude, Gemini, and o3 fail]]></title><description><![CDATA[That will do the boring chores in life, for you]]></description><link>https://substack.recursal.ai/p/the-worlds-most-reliable-ai-agent</link><guid isPermaLink="false">https://substack.recursal.ai/p/the-worlds-most-reliable-ai-agent</guid><dc:creator><![CDATA[Eugene Cheah]]></dc:creator><pubDate>Fri, 23 May 2025 22:20:13 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div id="youtube2-hgXiwwMtDJg" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;hgXiwwMtDJg&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/hgXiwwMtDJg?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>Most companies and use cases do not care about PhD-level capabilities. They just want an AI model to depend on with absolute reliability, for the simple tasks in life.</p><p>Imagine a highly competent, dependable, reliable assistant, who does all your chores&#8211;instead of the caffeine-overloaded genius, who sometimes works when it &#8220;feels like it&#8221;--but otherwise causes more harm than good half the time.</p><p>That's what we just built with the <a href="http://featherless.ai">Featherless.ai</a> Action-R1 model &amp; agent, which achieved SotA (State-of-the-Art) in the REAL (Realistic Evaluations for Agents Leaderboard) benchmark.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MOo2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MOo2!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png 424w, https://substackcdn.com/image/fetch/$s_!MOo2!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png 848w, https://substackcdn.com/image/fetch/$s_!MOo2!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png 1272w, https://substackcdn.com/image/fetch/$s_!MOo2!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MOo2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png" width="1456" height="818" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:818,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:564424,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/164269885?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!MOo2!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png 424w, https://substackcdn.com/image/fetch/$s_!MOo2!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png 848w, https://substackcdn.com/image/fetch/$s_!MOo2!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png 1272w, https://substackcdn.com/image/fetch/$s_!MOo2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3c943464-192f-4899-90b6-ec3e9fcd74cd_4128x2320.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">realevals.xyz score benchmark</figcaption></figure></div><p>Our <a href="https://www.realevals.xyz/details/92a67e60-bb83-4be9-9422-e93746dc13a0">AI agent (test result at link)</a> achieved a 65% success rate on the REAL benchmark. This makes it nearly 50% better than <a href="https://www.realevals.xyz/details/074c427c-18b2-4c34-8443-b569644fff72">Anthropic computer use (result link)</a>, the next best model &amp; framework by a major lab, at 42%.</p><div><hr></div><h1><strong>What is the REAL benchmark?</strong></h1><p>What is interesting about the REAL benchmark is that it tests over 110 practical real-world tasks. These are not university-level knowledge tasks, but real-world chores that reflect what people do online every day, like booking flights, organizing and replying emails, and shopping for groceries. Basically, office desk work.</p><p>These are tested using a controlled test replica, which mirrors 11 major websites, including Airbnb, Amazon, Gmail, LinkedIn, and Uber.</p><p><a href="https://arxiv.org/abs/2504.11543">The paper for the benchmark can be found here</a>.</p><div><hr></div><h1>Why does reliability matter?</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!UkEE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UkEE!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png 424w, https://substackcdn.com/image/fetch/$s_!UkEE!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png 848w, https://substackcdn.com/image/fetch/$s_!UkEE!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png 1272w, https://substackcdn.com/image/fetch/$s_!UkEE!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UkEE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png" width="590" height="405" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:405,&quot;width&quot;:590,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:35032,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/164269885?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!UkEE!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png 424w, https://substackcdn.com/image/fetch/$s_!UkEE!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png 848w, https://substackcdn.com/image/fetch/$s_!UkEE!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png 1272w, https://substackcdn.com/image/fetch/$s_!UkEE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F981d61e5-ee7d-434e-8cfd-b141238c335e_590x405.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Side by Side reliability comparisons for top use cases </figcaption></figure></div><p>Frontier AI agents have been plateauing at around 43% overall task completion. We hit 65%, leapfrogging it by 22 percentage points.</p><p>More importantly, on certain sites, such as the Omnizon (aka Amazon clone), our agent reached a 100% success rate, while the next best managed 60%.</p><p><strong>And the 99%+ distinction matters.</strong></p><p>Because you can now rely on this model and agent for tasks in this domain.</p><p>If a model is only at 50% success rate at a category of task, you spend more time and energy &#8220;babysitting&#8221; the models. In several cases, the model will require human intervention which can take more bandwidth than the task itself.</p><p>Raising reliability means that you can hand off tasks to the AI, eliminating the frustration many workers face with AI in the enterprise.</p><p>For the first time, we have built an AI agent you can rely on for tasks within a handful of platforms with 99%+ success. We will eventually expand to 99%+ reliability for all web platforms.</p><div><hr></div><h1><strong>How we did it</strong></h1><p>We partnered with the team at UI-licious, who automate end-to-end testing for their customers with AI at scale, building on their experience and knowledge in UI test automation and their proprietary PetaByte scale dataset which they have built up over the years on UI testing.</p><p>Together, we co-built a specialized AI action model and agent harness. This helps the AI understand instructions and how to navigate complex and dynamic UIs.</p><p>As a bonus, these AI agents are not just navigating and completing tasks. They are finding and issuing bug reports, which we will be following up with the eval maintainers on &#128521;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mEar!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mEar!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png 424w, https://substackcdn.com/image/fetch/$s_!mEar!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png 848w, https://substackcdn.com/image/fetch/$s_!mEar!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png 1272w, https://substackcdn.com/image/fetch/$s_!mEar!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mEar!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png" width="1456" height="818" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:818,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1244065,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/164269885?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!mEar!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png 424w, https://substackcdn.com/image/fetch/$s_!mEar!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png 848w, https://substackcdn.com/image/fetch/$s_!mEar!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png 1272w, https://substackcdn.com/image/fetch/$s_!mEar!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d87a3df-ce97-44b8-b59d-5b742c09f02c_5312x2986.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Hopefully will be passed to another AI to fix the bugs!</figcaption></figure></div><div><hr></div><h1>The bigger picture: attacking multi-billion dollar markets</h1><p>The <a href="https://www.statista.com/statistics/1309384/worldwide-rpa-software-market-size">$3B+ Robotic Process Automation (RPA) market</a> and the <a href="https://www.datainsightsmarket.com/reports/ui-test-automation-software-1947276">$20B+ UI testing marke</a>t are dominated by brittle tools that break easily and require constant maintenance, where AI agents achieve about 60% reliability.</p><p>Today&#8217;s RPA tools like UIPath operate like glorified macros, breaking when user interfaces change or when variability is introduced. Businesses spend millions maintaining these systems, fixing broken RPA scripts.</p><p>Agentic AI is different. It learns. It adapts. It interacts with interfaces the way humans do: observing, interpreting, and acting based on context. It allows automation over dynamic interfaces.</p><p>And we&#8217;re just getting started.</p><p>As we scale our AI research in the open source space, we will be working alongside industry partners to build reliable AI Agents and models for their own industry specific use, with their own proprietary datasets.</p><p>So we will be increasing reliability, not just in automated UI testing, but in all future domains as well, with industry partners from accounting to legal.</p><p>Raising reliability means unlocking the 90% of AI projects that fail to enter production within enterprises, as the AI was just &#8220;not reliable enough&#8221; for day-to-day office tasks.</p><p>Raising reliability means bringing into production your AI agent into the market.</p><p>If you are interested in partnering with us, and making your AI more SoTA-reliable, please reach out to us.</p><div><hr></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JA52!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JA52!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png 424w, https://substackcdn.com/image/fetch/$s_!JA52!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png 848w, https://substackcdn.com/image/fetch/$s_!JA52!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png 1272w, https://substackcdn.com/image/fetch/$s_!JA52!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JA52!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png" width="610" height="455.54987212276217" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:584,&quot;width&quot;:782,&quot;resizeWidth&quot;:610,&quot;bytes&quot;:74206,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/164269885?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fd60670-1a7d-4740-9efe-75b10e6c6285_782x614.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!JA52!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png 424w, https://substackcdn.com/image/fetch/$s_!JA52!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png 848w, https://substackcdn.com/image/fetch/$s_!JA52!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png 1272w, https://substackcdn.com/image/fetch/$s_!JA52!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f0d50-05ac-4b35-873c-4b82b6b6f078_782x584.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1>How can we try this new action model &amp; agent?</h1><p>Look out for a follow-up launch announcement with our design partners at UI-licious.</p><p>This AI action model and agent will be co-launched on both platforms.</p><p>For early access, you can sign up here: <br><a href="https://forms.gle/wxwQ2z12xf1KzFvPA">https://forms.gle/wxwQ2z12xf1KzFvPA</a></p><p>Priority access will be given to portfolio companies among our investors and users.</p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://substack.recursal.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Featherless AI - recursive dev blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Featherless.ai Raises $5M in Seed Funding to Democratize Access to Open Source AI Models]]></title><description><![CDATA[Democratizing AI with instant access to 4,000+ open-source models through our breakthrough serverless platform]]></description><link>https://substack.recursal.ai/p/featherlessai-raises-5m-in-seed-funding</link><guid isPermaLink="false">https://substack.recursal.ai/p/featherlessai-raises-5m-in-seed-funding</guid><dc:creator><![CDATA[Darin Verheijke]]></dc:creator><pubDate>Mon, 17 Mar 2025 23:07:40 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!pk94!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><strong>San Francisco - March 17, 2025</strong> - We're excited to announce that Featherless.ai has raised US$5 million in Seed funding from investors including Airbus Ventures, 500 Global, Kickstart Ventures, HF0, Panache Ventures and Oakseed Ventures. As the leading serverless AI inference platform, we're on a mission to provide instant and affordable access to the world's largest collection of open-source AI models.<br>Our Vision: AI for Everyone</p><p>"AI has the power to transform industries and empower individuals around the globe. However, accessing and utilizing a diverse range of models can be cost-prohibitive and logistically challenging for many users, particularly those in emerging markets, where there's no shortage of demand for AI solutions," notes our Founder and CEO Eugene Cheah. "Featherless.ai is changing the game by providing an affordable and scalable solution for using open source AI models in production. I don't want a future where AI is controlled by the few. I want to empower individuals globally."</p><h2>What We Offer: The World's Largest Collection of Open-Source Models</h2><p>At Featherless.ai, we provide instant and affordable access to over 4,000 open-source AI models - including popular options like DeepSeek and LLama - and we're continuously onboarding new models every week. Our flat capacity pricing model ensures cost predictability and scalability, allowing businesses to dynamically scale AI usage without worrying about unexpected charges or rate limits.</p><p>Instead of dealing with infrastructure headaches, our users can focus their energies on what matters: running experiments to develop, test, and fine-tune their models. With this new funding, we're planning to support all major AI modalities &#8212; including embeddings, vision, and speech.</p><p>For casual AI users, our serverless platform offers a low entry point of just $10 per month, providing access to a wide range of the latest open-source models without the need for expensive high-end GPUs. For businesses and enterprises, we deliver scale and significant cost efficiencies through dynamic AI workload scaling and flat capacity pricing.</p><h2>Our Breakthrough Technology</h2><p>With our new funding, we're advancing research and development into next-generation AI architectures that can dramatically lower inference costs, making AI deployment feasible on lower cost hardware rather than requiring high-end GPUs.</p><p>Our architecture enables seamless support for over 100 languages without any performance degradation. This breakthrough removes a major barrier to global AI deployment, where existing systems struggle to maintain consistency across multiple languages.</p><p>One of our proudest innovations is our proprietary hot-swapping technology, which allows AI models to be switched in under 5 seconds, compared to the 30 minutes typically required to load a new AI model onto a standard GPU. This innovation optimizes GPU utilization, eliminates costly downtime, and dramatically reduces operational costs.</p><p>We're also actively involved in the development of the open-source RWKV foundation model project, the first AI model under the Linux Foundation, which has been deployed on Windows to billions of devices<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. This effort is part of our broader commitment to develop better and more accessible AI models across languages, and reduce the reliance on closed-source models from major tech companies.</p><h2>What Our Investors Are Saying</h2><p>"The biggest hurdle to scaling AI is compute cost. Featherless.ai tackles this head-on by providing the trifecta of efficient, effective and affordable AI deployment to businesses globally. Our portfolio companies are avid users of Featherless.ai, and we look forward to expanding that across all the regions we invest in," shares Vishal Harnal, Managing Partner at 500 Global.</p><p>Yuichiro Hikosaka, Principal at Airbus Ventures, explains: "Our research into foundational model development revealed that while much of the AI industry has focused on scaling existing transformer models, fundamental inefficiencies in the architecture have remained unaddressed. As most foundational models struggle to maintain performance across multiple languages, creating accessibility barriers in global markets, these structural inefficiencies have threatened to stifle competition and innovation across the broader global ecosystem. Now enter Featherless.ai - a serverless platform that empowers users to run, test, and fine-tune the latest models seamlessly and affordably. Featherless.ai is ready and primed to address a global market."</p><p>"We see Large Language Models being experimented and integrated into applications everywhere, Featherless.ai plays a unique role in enabling organizations to test, integrate and run any and every open source LLM easily and cost-effectively. Their innovative approach and focus on serving diverse markets make them a standout in this rapidly growing space," says Chee-We Ng, Managing Partner at Oakseed Ventures.</p><h2>Our Team</h2><p>Featherless.ai is co-founded by Eugene Cheah (CEO), Harrison Vanderbyl (CTO), and Wesley George (COO), who bring combined experience of 30 years building software and leading engineering teams.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!pk94!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!pk94!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png 424w, https://substackcdn.com/image/fetch/$s_!pk94!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png 848w, https://substackcdn.com/image/fetch/$s_!pk94!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png 1272w, https://substackcdn.com/image/fetch/$s_!pk94!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!pk94!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png" width="1456" height="970" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Featherless.ai team pictured in Palm Springs, California.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Featherless.ai team pictured in Palm Springs, California." title="Featherless.ai team pictured in Palm Springs, California." srcset="https://substackcdn.com/image/fetch/$s_!pk94!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png 424w, https://substackcdn.com/image/fetch/$s_!pk94!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png 848w, https://substackcdn.com/image/fetch/$s_!pk94!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png 1272w, https://substackcdn.com/image/fetch/$s_!pk94!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d729561-541d-4079-a62b-17c9374fb1fb_1600x1066.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Featherless.ai team pictured in Palm Springs, California.</figcaption></figure></div><h2>Join us!</h2><p>Featherless.ai is hiring! If you're passionate about making AI open and accessible, send your cover letter to <a href="mailto:hello@featherless.ai">hello@featherless.ai</a>. For more information about our platform and services, visit <a href="http://www.featherless.ai">www.featherless.ai</a>.</p><p>Together, we can democratize access to AI and ensure that these powerful technologies benefit everyone, not just those with unlimited resources.</p><h3></h3><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p><strong>RWKV.cpp - shipping to 1.5 billion systems worldwide </strong>(https://blog.rwkv.com/p/rwkvcpp-shipping-to-half-a-billion)</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[QwQ-32B Now Available on Featherless.ai ]]></title><description><![CDATA[QwQ-32B: A Powerful Lightweight in the Age of Reasoning Models]]></description><link>https://substack.recursal.ai/p/qwq-32b-now-available-on-featherlessai</link><guid isPermaLink="false">https://substack.recursal.ai/p/qwq-32b-now-available-on-featherlessai</guid><dc:creator><![CDATA[Darin Verheijke]]></dc:creator><pubDate>Fri, 07 Mar 2025 10:14:55 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!Z7_s!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2><strong>QwQ-32B: A Powerful Lightweight in the Age of Reasoning Models</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Z7_s!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Z7_s!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png 424w, https://substackcdn.com/image/fetch/$s_!Z7_s!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png 848w, https://substackcdn.com/image/fetch/$s_!Z7_s!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png 1272w, https://substackcdn.com/image/fetch/$s_!Z7_s!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Z7_s!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png" width="1008" height="400" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:400,&quot;width&quot;:1008,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:573231,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://substack.recursal.ai/i/158576603?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Z7_s!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png 424w, https://substackcdn.com/image/fetch/$s_!Z7_s!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png 848w, https://substackcdn.com/image/fetch/$s_!Z7_s!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png 1272w, https://substackcdn.com/image/fetch/$s_!Z7_s!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ec322f4-1674-4a42-ad82-6c73a84edcfc_1008x400.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>AI development continues to advance through diverse approaches to model design and optimization. <strong>DeepSeek-R1</strong>, with its impressive 671B parameters, has established itself as one of the most capable reasoning-focused models on the market. Its remarkable capabilities have set new benchmarks for what models in this space can achieve.</p><p>Meanwhile, <strong>efficiency and adaptability</strong> continue opening new frontiers, and this is where <strong>QwQ-32B</strong>, Qwen's latest release, makes its mark.</p><h2><strong>QwQ-32B: Efficient Reasoning Power</strong></h2><p><strong>QwQ-32B</strong> delivers <strong>high-level reasoning, problem-solving, and strong coding/math capabilities</strong> in a lightweight package. With just 32B parameters, early benchmarks show impressive performance, making it an attractive option for those looking for strong reasoning capabilities in a more efficient format.</p><p>With AI applications diversifying, the demand for models that deliver <strong>excellent performance with a smaller resource footprint</strong> continues to grow. <strong>QwQ-32B exemplifies how well-optimized models can achieve remarkable results.</strong></p><h2><strong>The Evolution of Reasoning Models</strong></h2><p>The AI field is evolving rapidly. While early models focused heavily on <strong>generative fluency and knowledge retrieval</strong>, today's most exciting breakthroughs are in <strong>models that can reason, plan, and solve complex problems</strong>. DeepSeek-R1 has been instrumental in this evolution, demonstrating the power of advanced reasoning capabilities.</p><p>Now, Qwen is expanding possibilities further, showing that <strong>reasoning power can be delivered in different formats to meet diverse needs.</strong></p><h2><strong>How Does QwQ-32B Perform?</strong></h2><p>Testing indicates that QwQ-32B:</p><ul><li><p><strong>Excels in logical reasoning tasks</strong> with impressive structured problem-solving</p></li><li><p><strong>Performs well in math and coding</strong>, key benchmarks for reasoning capability</p></li><li><p><strong>Offers strong efficiency</strong>, delivering high performance with reduced compute requirements</p></li></ul><p>For users seeking <strong>high-quality reasoning in an efficient package</strong>, QwQ-32B presents an exciting option.</p><h2><strong>What This Means For You</strong></h2><p>With both QwQ-32B and DeepSeek-R1 available on Featherless.ai, you now have multiple excellent options for advanced reasoning capabilities. Our ongoing optimization efforts ensure that you'll benefit from continuous improvements in both performance and functionality.</p><p>We're committed to making advanced open AI models accessible and practical for everyone. Each model in our lineup offers unique advantages to suit different use cases and requirements.</p><h2><strong>Try Our Models and Share Your Thoughts</strong></h2><p>QwQ-32B is now available alongside DeepSeek-R1, and we want to hear about your experiences with both models. Each excels in reasoning tasks while offering different profiles in terms of scale and efficiency.</p><p>Leave a review <strong>on the Featherless.ai model page</strong>: <a href="https://featherless.ai/models/Qwen/QwQ-32B">QwQ-32B</a></p><p>Have questions about integrating these models into your workflow? Reach out to us on <a href="https://discord.gg/7gybCMPjVA">Discord</a> or check our documentation for implementation guidelines and best practices.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://substack.recursal.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading the Featherless AI dev blog! Subscribe for free to receive new posts and support our platform.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Recursal.AI becomes Featherless.AI]]></title><description><![CDATA[A New Chapter, Same Mission]]></description><link>https://substack.recursal.ai/p/recursalai-becomes-featherlessai</link><guid isPermaLink="false">https://substack.recursal.ai/p/recursalai-becomes-featherlessai</guid><dc:creator><![CDATA[Eugene Cheah]]></dc:creator><pubDate>Tue, 11 Feb 2025 02:21:11 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!IwQW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!IwQW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!IwQW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg 424w, https://substackcdn.com/image/fetch/$s_!IwQW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg 848w, https://substackcdn.com/image/fetch/$s_!IwQW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!IwQW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!IwQW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg" width="1456" height="1219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1219,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:4171205,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!IwQW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg 424w, https://substackcdn.com/image/fetch/$s_!IwQW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg 848w, https://substackcdn.com/image/fetch/$s_!IwQW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!IwQW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd59b2c67-b559-469a-8ff2-238ea8d20c25_4096x3429.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Over the past year, Recursal.AI has served us well as a name and identity for our development team and our various projects. It has been a banner under which we&#8217;ve worked to push the boundaries in AI.</p><p>However overtime, we are now more commonly identified by our platform Featherless.AI, which has grown to be the biggest provider of HuggingFace LLMs, striving to make them more accessible, and free from the constraints of closed ecosystems. </p><p>It has become the identity most closely associated with our work, our vision, and the principles that drive us forward. </p><p>So today, we&#8217;re making it official: Recursal.AI is now Featherless.AI. This change isn&#8217;t just about branding&#8212;it&#8217;s a reaffirmation of our commitment to the usage of open-source AI. </p><p>We remain dedicated to bringing powerful language models to the wider world for the benefit of the many. </p><p>Our mission remains the same: to develop and support AI tools that are transparent, accessible, and unshackled from proprietary control. Here&#8217;s to the next chapter&#8212;under a name that truly reflects the community and future we&#8217;re building together. </p><p>&#8212; The Featherless.AI Team</p>]]></content:encoded></item><item><title><![CDATA[Featherless Feud: a dip into LLM-powered game development]]></title><description><![CDATA[rebuilding a TV classic]]></description><link>https://substack.recursal.ai/p/featherless-feud-a-dip-into-llm-powered</link><guid isPermaLink="false">https://substack.recursal.ai/p/featherless-feud-a-dip-into-llm-powered</guid><dc:creator><![CDATA[Erik Cadieux]]></dc:creator><pubDate>Tue, 20 Aug 2024 04:29:55 GMT</pubDate><enclosure url="https://substackcdn.com/image/youtube/w_728,c_limit/aIPf7nsvrGM" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>It takes time to develop a sense of how to apply LLMs to concrete problems. But a great way to do this is to build with them; in this post, we&#8217;ll work through building a web version of the TV classic Family Feud.</p><div id="youtube2-aIPf7nsvrGM" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;aIPf7nsvrGM&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/aIPf7nsvrGM?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>This is a game with elements that are quite challenging for &#8220;traditional&#8221; software development that are remarkably easy with an LLM. If you want to try what we built before reading further, you can do so <a href="https://huggingface.co/spaces/Darok/Featherless-Feud">here</a>!</p><h1>The Challenges</h1><p>Building a game requires solving lots of problems around input, output, UI, visuals, latency, the list goes on. We&#8217;ll be focussing on the <em>content </em>related challenges that are not easy for the &#8220;rules-based&#8221; nature of typical software.</p><p>When we think about building Family Feud - or Featherless Feud as we call this version - we hit two problems:</p><ol><li><p>question and answer generation</p></li><li><p>fuzzy answer matching</p></li></ol><p>i.e. how do come up with quiz questions, and when given answers that aren&#8217;t exactly on the list, how do we match them?</p><p>For the actual show, the question bank is built with the help of surveys of the general public, meanwhile identifying variation of official answers are done by show staff live, a natural language task which is simple for humans, but challenging for software.</p><p>In both cases, we&#8217;ll be prompting an LLM to generate a certain kind of text and interpreting the results, but beyond just prompting, how we run the inference also matters. Let&#8217;s dive in.</p><h1><strong>Part 1 - Question/Answers Generation</strong></h1><p>To generate questions, we might prompt an LLM like so</p><pre><code>You are the producer of the game show Family Feud.

Your job is to devise a question and a list of common answers to this question. For each answer, output a number between 0 and 100 which is how common you think the answer would be when given by a member of the general public. The sum of the scores should not exceed 100.</code></pre><p>Now that content has structure (e.g. question versus answers) that we need to know about so that the various pieces can be handled appropriately by the different parts of our app.</p><p>The LLM &#8220;knows&#8221; about the relationship between the questions and answers in an output, so we can ask it to include the annotations in the output. We really could ask the LLM to output any kind of structured format but the most common format to request output in is JSON.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dxsY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dxsY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!dxsY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!dxsY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!dxsY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dxsY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp" width="1456" height="832" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:832,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;A modern, sleek, and minimalistic banner that visually represents JSON data parsing in a game development context. The background is dark, with abstract digital lines and code snippets flowing from left to right, symbolizing data parsing and AI integration. In the center, a simplified JSON structure glows, with key-value pairs like 'question' and 'answers' prominently displayed. Around this structure, subtle hints of a Family Feud-style game show, such as silhouettes of a game board or abstract players, are faintly visible. The overall color scheme should include dark tones with glowing accents, creating a high-tech and focused atmosphere.&quot;,&quot;title&quot;:&quot;A modern, sleek, and minimalistic banner that visually represents JSON data parsing in a game development context. The background is dark, with abstract digital lines and code snippets flowing from left to right, symbolizing data parsing and AI integration. In the center, a simplified JSON structure glows, with key-value pairs like 'question' and 'answers' prominently displayed. Around this structure, subtle hints of a Family Feud-style game show, such as silhouettes of a game board or abstract players, are faintly visible. The overall color scheme should include dark tones with glowing accents, creating a high-tech and focused atmosphere.&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="A modern, sleek, and minimalistic banner that visually represents JSON data parsing in a game development context. The background is dark, with abstract digital lines and code snippets flowing from left to right, symbolizing data parsing and AI integration. In the center, a simplified JSON structure glows, with key-value pairs like 'question' and 'answers' prominently displayed. Around this structure, subtle hints of a Family Feud-style game show, such as silhouettes of a game board or abstract players, are faintly visible. The overall color scheme should include dark tones with glowing accents, creating a high-tech and focused atmosphere." title="A modern, sleek, and minimalistic banner that visually represents JSON data parsing in a game development context. The background is dark, with abstract digital lines and code snippets flowing from left to right, symbolizing data parsing and AI integration. In the center, a simplified JSON structure glows, with key-value pairs like 'question' and 'answers' prominently displayed. Around this structure, subtle hints of a Family Feud-style game show, such as silhouettes of a game board or abstract players, are faintly visible. The overall color scheme should include dark tones with glowing accents, creating a high-tech and focused atmosphere." srcset="https://substackcdn.com/image/fetch/$s_!dxsY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!dxsY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!dxsY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!dxsY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F253df487-e755-4a6a-9fca-d72c2488a306_1792x1024.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>As for getting a model to formulate it&#8217;s output as JSON there are typically two parts:</p><ol><li><p>prompting for the answer to be in JSON with a description of desired structure</p></li><li><p>requesting JSON from your inference API call</p></li></ol><p>For 1., you might update your prompt as follows</p><pre><code>You are the producer of the game show Family Feud.

Your job is to devise a question and a list of common answers to this question. For each answer, output a number between 0 and 100 which is how common you think the answer would be when given by a member of the general public. The sum of the scores should not exceed 100.

Be very concise, present only the question and answers, and don't add anything else.

Output valid JSON with the following format: two keys, `question`, which should be a string, and `answers` should be an array of objects. Each object in the `answers` array should have two keys, `answer` and `score`, each of which should be a string.

e.g.
```json
{
  "question": "Name something that might be wobbly",
  "answers": [
    { "answer": "Furniture" , score: 64 },
    { "answer": "Person/A Drunk", score: 15 },
    { "answer": "Spinning toy/top", score: 5 },
    { "answer": "Shopping Cart", score: 3 }
  ]
}
```

The sum of the scores should not exceed 100.</code></pre><p>Being prescriptive in structure is important; if the output description isn&#8217;t clear enough to the model, you&#8217;ll get variation in structure and keys used, which likely cause an exception when the part of your software operating on the JSON output tries to separate the question from the answers.</p><p>Though even with an explicit description of output structure, the model to output syntactically valid JSON is it is unlikely that if you prompting only will have your output will be valid JSON. Here&#8217;s a sample of the output from above:</p><pre><code>Here is a question with a list of answers and scores:

{
  "question": "Name something you might find in a purse or wallet",
  "answers": [
    { "answer": "Money", score: 70 },
    { "answer": "Phone", score: 20 },
    { "answer": "Credit Card", score: 5 },
    { "answer": "Candy", score: 2 },
    { "answer": "Makeup", score: 3 }
  ]
}</code></pre><p>Pretty good right?  It came up with a question, and in the format as requested (and needed for the game). It also outputted the `score` values as Numbers even though I asked, in the prompt, for their input to be strings! The model overrode my instructions, but for the better. However despite the request to give <em>nothing</em> but the output, there still is some pre-amble which requires it&#8217;s own intelligence to parse. How do we get rid of this?</p><p>The other half the equation is an inference technique known as &#8220;guided decoding&#8221; in it&#8217;s most general form, but known as JSON mode. Here we guarantee the output is syntactically valid JSON by manipulating token probabilities during sampling to prevent syntactically invalid JSON from appearing (e.g. the string {&#8216;&#8216;} is <em>not </em>valid JSON).</p><p>Of course these two modes are in tension with another - if you don&#8217;t ask for JSON in your prompt (or specify structure clearly enough), and put on JSON mode, it&#8217;s possible that your output bears little relationship to your input. In an early version of the prompt above, I got a number of questions.</p><p>How you request JSON mode varies by your API provider (this is a specific type of  guided decoding which is a big topic), but many (most?) providers that have an API will accept the same format to the request as OpenAI, which is to include a</p><pre><code>"request_format": { "type": "json_object" }</code></pre><p>in the body of the inference POST request. Once this is included, my output is exactly how I hoped: exactly the JSON and only the JSON.</p><pre><code>{
  "question": "Name something you might find in a garage",
  "answers": [
    { "answer": "Tools", "score": 40 },
    { "answer": "Bike", "score": 20 },
    { "answer": "Work Bench", "score": 12 },
    { "answer": "Vacuum", "score": 8 },
    { "answer": "Lawn Mower", "score": 5 },
    { "answer": "Storage Containers", "score": 5 }
  ]
}</code></pre><p>Note generation <em>takes longer</em> when JSON mode is enabled. This is a consequence of how that guided decoding works, which is worth a blog post in it&#8217;s own right.</p><p>We haven&#8217;t talked about the Featherless API we&#8217;ve been using for inference, but hold on to that thought - more on that in a bit.</p><h1><strong>Part 2 - Fuzzy Answer Matching</strong></h1><p>The other challenge is matching players answers to the list of official answers, as they don&#8217;t always match, in our running example from the video above, the question &#8220;Name something that might be wobbly&#8221; had the following answers</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3Mmg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3Mmg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png 424w, https://substackcdn.com/image/fetch/$s_!3Mmg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png 848w, https://substackcdn.com/image/fetch/$s_!3Mmg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png 1272w, https://substackcdn.com/image/fetch/$s_!3Mmg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3Mmg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png" width="1388" height="788" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:788,&quot;width&quot;:1388,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1837675,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3Mmg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png 424w, https://substackcdn.com/image/fetch/$s_!3Mmg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png 848w, https://substackcdn.com/image/fetch/$s_!3Mmg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png 1272w, https://substackcdn.com/image/fetch/$s_!3Mmg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff159847e-d4be-4fd9-907e-dc9b16d993f5_1388x788.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Name something that might be wobbly - &#8220;A stool&#8221; and &#8220;A baby&#8221; were both accepted answers</figcaption></figure></div><p>Where &#8220;A stool&#8221; was accepted in place of &#8220;Furniture&#8221; and &#8220;A baby&#8221; was accepted as a version of &#8220;Person / a drunk&#8221;.</p><p>It&#8217;s relatively easy to devise a prompt and post-processing that captures the gist of the idea. Here&#8217;s a short python program we used during testing to let you iterate through prompt and model choice by letting an answer you supply on the command line is judged.</p><div class="github-gist" data-attrs="{&quot;innerHTML&quot;:&quot;<div id=\&quot;gist132134060\&quot; class=\&quot;gist\&quot;>\n    <div class=\&quot;gist-file\&quot; translate=\&quot;no\&quot; data-color-mode=\&quot;light\&quot; data-light-theme=\&quot;light\&quot;>\n      <div class=\&quot;gist-data\&quot;>\n        <div class=\&quot;js-gist-file-update-container js-task-list-container\&quot;>\n  <div id=\&quot;file-family-feud-judging-prompt-py\&quot; class=\&quot;file my-2\&quot;>\n    \n    <div itemprop=\&quot;text\&quot; class=\&quot;Box-body p-0 blob-wrapper data type-python  \&quot;>\n\n        \n<div class=\&quot;js-check-bidi js-blob-code-container blob-code-content\&quot;>\n\n  <template class=\&quot;js-file-alert-template\&quot;>\n  <div data-view-component=\&quot;true\&quot; class=\&quot;flash flash-warn flash-full d-flex flex-items-center\&quot;>\n  <svg aria-hidden=\&quot;true\&quot; height=\&quot;16\&quot; viewBox=\&quot;0 0 16 16\&quot; version=\&quot;1.1\&quot; width=\&quot;16\&quot; data-view-component=\&quot;true\&quot; class=\&quot;octicon octicon-alert\&quot;>\n    <path d=\&quot;M6.457 1.047c.659-1.234 2.427-1.234 3.086 0l6.082 11.378A1.75 1.75 0 0 1 14.082 15H1.918a1.75 1.75 0 0 1-1.543-2.575Zm1.763.707a.25.25 0 0 0-.44 0L1.698 13.132a.25.25 0 0 0 .22.368h12.164a.25.25 0 0 0 .22-.368Zm.53 3.996v2.5a.75.75 0 0 1-1.5 0v-2.5a.75.75 0 0 1 1.5 0ZM9 11a1 1 0 1 1-2 0 1 1 0 0 1 2 0Z\&quot;></path>\n</svg>\n    <span>\n      This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.\n      <a class=\&quot;Link--inTextBlock\&quot; href=\&quot;https://github.co/hiddenchars\&quot; target=\&quot;_blank\&quot;>Learn more about bidirectional Unicode characters</a>\n    </span>\n\n\n  <div data-view-component=\&quot;true\&quot; class=\&quot;flash-action\&quot;>        <a href=\&quot;{{ revealButtonHref }}\&quot; data-view-component=\&quot;true\&quot; class=\&quot;btn-sm btn\&quot;>    Show hidden characters\n</a>\n</div>\n</div></template>\n<template class=\&quot;js-line-alert-template\&quot;>\n  <span aria-label=\&quot;This line has hidden Unicode characters\&quot; data-view-component=\&quot;true\&quot; class=\&quot;line-alert tooltipped tooltipped-e\&quot;>\n    <svg aria-hidden=\&quot;true\&quot; height=\&quot;16\&quot; viewBox=\&quot;0 0 16 16\&quot; version=\&quot;1.1\&quot; width=\&quot;16\&quot; data-view-component=\&quot;true\&quot; class=\&quot;octicon octicon-alert\&quot;>\n    <path d=\&quot;M6.457 1.047c.659-1.234 2.427-1.234 3.086 0l6.082 11.378A1.75 1.75 0 0 1 14.082 15H1.918a1.75 1.75 0 0 1-1.543-2.575Zm1.763.707a.25.25 0 0 0-.44 0L1.698 13.132a.25.25 0 0 0 .22.368h12.164a.25.25 0 0 0 .22-.368Zm.53 3.996v2.5a.75.75 0 0 1-1.5 0v-2.5a.75.75 0 0 1 1.5 0ZM9 11a1 1 0 1 1-2 0 1 1 0 0 1 2 0Z\&quot;></path>\n</svg>\n</span></template>\n\n  <table data-hpc class=\&quot;highlight tab-size js-file-line-container js-code-nav-container js-tagsearch-file\&quot; data-tab-size=\&quot;8\&quot; data-paste-markdown-skip data-tagsearch-lang=\&quot;Python\&quot; data-tagsearch-path=\&quot;family-feud-judging-prompt.py\&quot;>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L1\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;1\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC1\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-k>from</span> <span class=pl-s1>openai</span> <span class=pl-k>import</span> <span class=pl-v>OpenAI</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L2\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;2\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC2\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-k>import</span> <span class=pl-s1>os</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L3\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;3\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC3\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-k>import</span> <span class=pl-s1>click</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L4\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;4\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC4\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>\n</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L5\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;5\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC5\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s1>client</span> <span class=pl-c1>=</span> <span class=pl-v>OpenAI</span>(</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L6\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;6\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC6\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-s1>base_url</span><span class=pl-c1>=</span><span class=pl-s>&amp;quot;https://api.featherless.ai/v1&amp;quot;</span>,</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L7\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;7\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC7\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-s1>api_key</span><span class=pl-c1>=</span><span class=pl-s1>os</span>.<span class=pl-s1>environ</span>.<span class=pl-en>get</span>(<span class=pl-s>&amp;#39;FEATHERLESS_API_KEY&amp;#39;</span>)</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L8\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;8\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC8\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>)</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L9\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;9\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC9\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>\n</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L10\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;10\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC10\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-en>@<span class=pl-s1>click</span>.<span class=pl-en>command</span>()</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L11\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;11\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC11\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-en>@<span class=pl-s1>click</span>.<span class=pl-en>argument</span>(<span class=pl-s>&amp;#39;answer&amp;#39;</span>)</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L12\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;12\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC12\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-en>@<span class=pl-s1>click</span>.<span class=pl-en>option</span>(<span class=pl-s>&amp;#39;--model&amp;#39;</span>, <span class=pl-s1>default</span><span class=pl-c1>=</span><span class=pl-s>&amp;#39;meta-llama/Meta-Llama-3-8B-Instruct&amp;#39;</span>)</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L13\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;13\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC13\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-k>def</span> <span class=pl-en>quiz</span>(<span class=pl-s1>answer</span>, <span class=pl-s1>model</span>):</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L14\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;14\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC14\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-s1>question</span> <span class=pl-c1>=</span> <span class=pl-s>&amp;quot;Name something that might be wobbly&amp;quot;</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L15\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;15\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC15\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-s1>official_answers</span> <span class=pl-c1>=</span> [</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L16\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;16\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC16\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>    <span class=pl-s>&amp;quot;Furniture&amp;quot;</span>,</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L17\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;17\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC17\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>    <span class=pl-s>&amp;quot;Person/A Drunk&amp;quot;</span>,</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L18\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;18\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC18\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>    <span class=pl-s>&amp;quot;Spinning toy/top&amp;quot;</span>,</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L19\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;19\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC19\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>    <span class=pl-s>&amp;quot;Shopping Cart&amp;quot;</span>,</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L20\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;20\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC20\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  ]</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L21\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;21\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC21\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>\n</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L22\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;22\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC22\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-s1>system_prompt</span> <span class=pl-c1>=</span> <span class=pl-s>f&amp;quot;&amp;quot;&amp;quot;You are a judge on the show Family Feud.</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L23\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;23\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC23\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s>You&amp;#39;re going to receive a guess from a contestant. That guess is a guess at one of the answers to the question</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L24\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;24\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC24\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s></span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L25\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;25\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC25\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s>Question is &amp;quot;<span class=pl-s1><span class=pl-kos>{</span><span class=pl-s1>question</span><span class=pl-kos>}</span></span>&amp;quot;</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L26\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;26\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC26\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s></span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L27\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;27\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC27\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s>The official answers are</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L28\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;28\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC28\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s><span class=pl-s1><span class=pl-kos>{</span><span class=pl-s>&amp;#39;<span class=pl-cce>\\n</span>&amp;#39;</span>.<span class=pl-en>join</span>([ <span class=pl-s>f&amp;quot;* <span class=pl-s1><span class=pl-kos>{</span><span class=pl-s1>a</span><span class=pl-kos>}</span></span>&amp;quot;</span> <span class=pl-k>for</span> <span class=pl-s1>a</span> <span class=pl-c1>in</span> <span class=pl-s1>official_answers</span> ])<span class=pl-kos>}</span></span></span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L29\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;29\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC29\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s></span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L30\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;30\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC30\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s>If the candidate&amp;#39;s guess is a version of an official answer, please respond the wording of the official answer.</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L31\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;31\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC31\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s></span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L32\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;32\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC32\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s>If the candidate&amp;#39;s guess is not a version of any official answer, respond with &amp;quot;Survey says ... AERR!!&amp;quot;</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L33\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;33\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC33\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-s>&amp;quot;&amp;quot;&amp;quot;</span></td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L34\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;34\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC34\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>\n</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L35\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;35\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC35\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-en>print</span>(<span class=pl-s1>model</span>)</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L36\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;36\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC36\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-en>print</span>(<span class=pl-s1>system_prompt</span>)</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L37\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;37\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC37\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>\n</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L38\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;38\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC38\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-s1>chat_completions_response</span> <span class=pl-c1>=</span> <span class=pl-s1>client</span>.<span class=pl-s1>chat</span>.<span class=pl-s1>completions</span>.<span class=pl-en>create</span>(</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L39\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;39\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC39\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>    <span class=pl-s1>model</span><span class=pl-c1>=</span><span class=pl-s1>model</span>,</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L40\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;40\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC40\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>    <span class=pl-s1>messages</span><span class=pl-c1>=</span>[</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L41\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;41\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC41\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>      { <span class=pl-s>&amp;quot;role&amp;quot;</span>: <span class=pl-s>&amp;quot;system&amp;quot;</span>, <span class=pl-s>&amp;quot;content&amp;quot;</span>: <span class=pl-s1>system_prompt</span>},</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L42\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;42\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC42\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>      { <span class=pl-s>&amp;quot;role&amp;quot;</span>: <span class=pl-s>&amp;quot;user&amp;quot;</span>, <span class=pl-s>&amp;quot;content&amp;quot;</span>: <span class=pl-s1>answer</span> }</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L43\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;43\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC43\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>    ],</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L44\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;44\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC44\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>    <span class=pl-s1>max_tokens</span><span class=pl-c1>=</span><span class=pl-c1>250</span>,</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L45\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;45\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC45\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  )</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L46\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;46\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC46\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-en>print</span>(<span class=pl-s1>chat_completions_response</span>.<span class=pl-s1>choices</span>[<span class=pl-c1>0</span>].<span class=pl-s1>message</span>.<span class=pl-s1>content</span>)</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L47\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;47\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC47\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>\n</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L48\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;48\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC48\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;><span class=pl-k>if</span> <span class=pl-s1>__name__</span> <span class=pl-c1>==</span> <span class=pl-s>&amp;quot;__main__&amp;quot;</span>:</td>\n        </tr>\n        <tr>\n          <td id=\&quot;file-family-feud-judging-prompt-py-L49\&quot; class=\&quot;blob-num js-line-number js-code-nav-line-number js-blob-rnum\&quot; data-line-number=\&quot;49\&quot;></td>\n          <td id=\&quot;file-family-feud-judging-prompt-py-LC49\&quot; class=\&quot;blob-code blob-code-inner js-file-line\&quot;>  <span class=pl-en>quiz</span>()</td>\n        </tr>\n  </table>\n</div>\n\n\n    </div>\n\n  </div>\n</div>\n\n      </div>\n      <div class=\&quot;gist-meta\&quot;>\n        <a href=\&quot;https://gist.github.com/wxgeorge/4d8f812c381e62dc91c006296d9b0561/raw/8beabc223d1578a0eee77c2e7bfe02a4af378992/family-feud-judging-prompt.py\&quot; style=\&quot;float:right\&quot; class=\&quot;Link--inTextBlock\&quot;>view raw</a>\n        <a href=\&quot;https://gist.github.com/wxgeorge/4d8f812c381e62dc91c006296d9b0561#file-family-feud-judging-prompt-py\&quot; class=\&quot;Link--inTextBlock\&quot;>\n          family-feud-judging-prompt.py\n        </a>\n        hosted with &amp;#10084; by <a class=\&quot;Link--inTextBlock\&quot; href=\&quot;https://github.com\&quot;>GitHub</a>\n      </div>\n    </div>\n</div>\n&quot;,&quot;stylesheet&quot;:&quot;https://github.githubassets.com/assets/gist-embed-3575177cfe1a.css&quot;}" data-component-name="GitgistToDOM"><link rel="stylesheet" href="https://github.githubassets.com/assets/gist-embed-3575177cfe1a.css"><div id="gist132134060" class="gist">
    <div class="gist-file" data-color-mode="light" data-light-theme="light">
      <div class="gist-data">
        <div class="js-gist-file-update-container js-task-list-container">
  <div id="file-family-feud-judging-prompt-py" class="file my-2">
    
    <div itemprop="text" class="Box-body p-0 blob-wrapper data type-python  ">

        
<div class="js-check-bidi js-blob-code-container blob-code-content">

  
  <div data-view-component="true" class="flash flash-warn flash-full d-flex flex-items-center">
  
    

    <span>
      This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      <a class="Link--inTextBlock" href="https://github.co/hiddenchars" target="_blank">Learn more about bidirectional Unicode characters</a>
    </span>


  <div data-view-component="true" class="flash-action">        <a href="{{ revealButtonHref }}" data-view-component="true" class="btn-sm btn">    Show hidden characters
</a>
</div>
</div>

  <span data-view-component="true" class="line-alert tooltipped tooltipped-e">
    
    

</span>

  <table data-hpc="" class="highlight tab-size js-file-line-container js-code-nav-container js-tagsearch-file" data-tab-size="8" data-paste-markdown-skip="" data-tagsearch-lang="Python" data-tagsearch-path="family-feud-judging-prompt.py">
        <tbody><tr>
          <td id="file-family-feud-judging-prompt-py-L1" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="1"></td>
          <td id="file-family-feud-judging-prompt-py-LC1" class="blob-code blob-code-inner js-file-line"><span class="pl-k">from</span> <span class="pl-s1">openai</span> <span class="pl-k">import</span> <span class="pl-v">OpenAI</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L2" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="2"></td>
          <td id="file-family-feud-judging-prompt-py-LC2" class="blob-code blob-code-inner js-file-line"><span class="pl-k">import</span> <span class="pl-s1">os</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L3" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="3"></td>
          <td id="file-family-feud-judging-prompt-py-LC3" class="blob-code blob-code-inner js-file-line"><span class="pl-k">import</span> <span class="pl-s1">click</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L4" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="4"></td>
          <td id="file-family-feud-judging-prompt-py-LC4" class="blob-code blob-code-inner js-file-line">
</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L5" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="5"></td>
          <td id="file-family-feud-judging-prompt-py-LC5" class="blob-code blob-code-inner js-file-line"><span class="pl-s1">client</span> <span class="pl-c1">=</span> <span class="pl-v">OpenAI</span>(</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L6" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="6"></td>
          <td id="file-family-feud-judging-prompt-py-LC6" class="blob-code blob-code-inner js-file-line">  <span class="pl-s1">base_url</span><span class="pl-c1">=</span><span class="pl-s">"https://api.featherless.ai/v1"</span>,</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L7" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="7"></td>
          <td id="file-family-feud-judging-prompt-py-LC7" class="blob-code blob-code-inner js-file-line">  <span class="pl-s1">api_key</span><span class="pl-c1">=</span><span class="pl-s1">os</span>.<span class="pl-s1">environ</span>.<span class="pl-en">get</span>(<span class="pl-s">'FEATHERLESS_API_KEY'</span>)</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L8" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="8"></td>
          <td id="file-family-feud-judging-prompt-py-LC8" class="blob-code blob-code-inner js-file-line">)</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L9" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="9"></td>
          <td id="file-family-feud-judging-prompt-py-LC9" class="blob-code blob-code-inner js-file-line">
</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L10" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="10"></td>
          <td id="file-family-feud-judging-prompt-py-LC10" class="blob-code blob-code-inner js-file-line"><span class="pl-en">@<span class="pl-s1">click</span>.<span class="pl-en">command</span>()</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L11" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="11"></td>
          <td id="file-family-feud-judging-prompt-py-LC11" class="blob-code blob-code-inner js-file-line"><span class="pl-en">@<span class="pl-s1">click</span>.<span class="pl-en">argument</span>(<span class="pl-s">'answer'</span>)</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L12" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="12"></td>
          <td id="file-family-feud-judging-prompt-py-LC12" class="blob-code blob-code-inner js-file-line"><span class="pl-en">@<span class="pl-s1">click</span>.<span class="pl-en">option</span>(<span class="pl-s">'--model'</span>, <span class="pl-s1">default</span><span class="pl-c1">=</span><span class="pl-s">'meta-llama/Meta-Llama-3-8B-Instruct'</span>)</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L13" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="13"></td>
          <td id="file-family-feud-judging-prompt-py-LC13" class="blob-code blob-code-inner js-file-line"><span class="pl-k">def</span> <span class="pl-en">quiz</span>(<span class="pl-s1">answer</span>, <span class="pl-s1">model</span>):</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L14" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="14"></td>
          <td id="file-family-feud-judging-prompt-py-LC14" class="blob-code blob-code-inner js-file-line">  <span class="pl-s1">question</span> <span class="pl-c1">=</span> <span class="pl-s">"Name something that might be wobbly"</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L15" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="15"></td>
          <td id="file-family-feud-judging-prompt-py-LC15" class="blob-code blob-code-inner js-file-line">  <span class="pl-s1">official_answers</span> <span class="pl-c1">=</span> [</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L16" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="16"></td>
          <td id="file-family-feud-judging-prompt-py-LC16" class="blob-code blob-code-inner js-file-line">    <span class="pl-s">"Furniture"</span>,</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L17" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="17"></td>
          <td id="file-family-feud-judging-prompt-py-LC17" class="blob-code blob-code-inner js-file-line">    <span class="pl-s">"Person/A Drunk"</span>,</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L18" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="18"></td>
          <td id="file-family-feud-judging-prompt-py-LC18" class="blob-code blob-code-inner js-file-line">    <span class="pl-s">"Spinning toy/top"</span>,</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L19" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="19"></td>
          <td id="file-family-feud-judging-prompt-py-LC19" class="blob-code blob-code-inner js-file-line">    <span class="pl-s">"Shopping Cart"</span>,</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L20" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="20"></td>
          <td id="file-family-feud-judging-prompt-py-LC20" class="blob-code blob-code-inner js-file-line">  ]</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L21" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="21"></td>
          <td id="file-family-feud-judging-prompt-py-LC21" class="blob-code blob-code-inner js-file-line">
</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L22" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="22"></td>
          <td id="file-family-feud-judging-prompt-py-LC22" class="blob-code blob-code-inner js-file-line">  <span class="pl-s1">system_prompt</span> <span class="pl-c1">=</span> <span class="pl-s">f"""You are a judge on the show Family Feud.</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L23" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="23"></td>
          <td id="file-family-feud-judging-prompt-py-LC23" class="blob-code blob-code-inner js-file-line"><span class="pl-s">You're going to receive a guess from a contestant. That guess is a guess at one of the answers to the question</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L24" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="24"></td>
          <td id="file-family-feud-judging-prompt-py-LC24" class="blob-code blob-code-inner js-file-line"><span class="pl-s"></span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L25" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="25"></td>
          <td id="file-family-feud-judging-prompt-py-LC25" class="blob-code blob-code-inner js-file-line"><span class="pl-s">Question is "<span class="pl-s1"><span class="pl-kos">{</span><span class="pl-s1">question</span><span class="pl-kos">}</span></span>"</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L26" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="26"></td>
          <td id="file-family-feud-judging-prompt-py-LC26" class="blob-code blob-code-inner js-file-line"><span class="pl-s"></span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L27" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="27"></td>
          <td id="file-family-feud-judging-prompt-py-LC27" class="blob-code blob-code-inner js-file-line"><span class="pl-s">The official answers are</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L28" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="28"></td>
          <td id="file-family-feud-judging-prompt-py-LC28" class="blob-code blob-code-inner js-file-line"><span class="pl-s"><span class="pl-s1"><span class="pl-kos">{</span><span class="pl-s">'<span class="pl-cce">\n</span>'</span>.<span class="pl-en">join</span>([ <span class="pl-s">f"* <span class="pl-s1"><span class="pl-kos">{</span><span class="pl-s1">a</span><span class="pl-kos">}</span></span>"</span> <span class="pl-k">for</span> <span class="pl-s1">a</span> <span class="pl-c1">in</span> <span class="pl-s1">official_answers</span> ])<span class="pl-kos">}</span></span></span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L29" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="29"></td>
          <td id="file-family-feud-judging-prompt-py-LC29" class="blob-code blob-code-inner js-file-line"><span class="pl-s"></span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L30" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="30"></td>
          <td id="file-family-feud-judging-prompt-py-LC30" class="blob-code blob-code-inner js-file-line"><span class="pl-s">If the candidate's guess is a version of an official answer, please respond the wording of the official answer.</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L31" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="31"></td>
          <td id="file-family-feud-judging-prompt-py-LC31" class="blob-code blob-code-inner js-file-line"><span class="pl-s"></span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L32" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="32"></td>
          <td id="file-family-feud-judging-prompt-py-LC32" class="blob-code blob-code-inner js-file-line"><span class="pl-s">If the candidate's guess is not a version of any official answer, respond with "Survey says ... AERR!!"</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L33" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="33"></td>
          <td id="file-family-feud-judging-prompt-py-LC33" class="blob-code blob-code-inner js-file-line"><span class="pl-s">"""</span></td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L34" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="34"></td>
          <td id="file-family-feud-judging-prompt-py-LC34" class="blob-code blob-code-inner js-file-line">
</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L35" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="35"></td>
          <td id="file-family-feud-judging-prompt-py-LC35" class="blob-code blob-code-inner js-file-line">  <span class="pl-en">print</span>(<span class="pl-s1">model</span>)</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L36" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="36"></td>
          <td id="file-family-feud-judging-prompt-py-LC36" class="blob-code blob-code-inner js-file-line">  <span class="pl-en">print</span>(<span class="pl-s1">system_prompt</span>)</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L37" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="37"></td>
          <td id="file-family-feud-judging-prompt-py-LC37" class="blob-code blob-code-inner js-file-line">
</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L38" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="38"></td>
          <td id="file-family-feud-judging-prompt-py-LC38" class="blob-code blob-code-inner js-file-line">  <span class="pl-s1">chat_completions_response</span> <span class="pl-c1">=</span> <span class="pl-s1">client</span>.<span class="pl-s1">chat</span>.<span class="pl-s1">completions</span>.<span class="pl-en">create</span>(</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L39" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="39"></td>
          <td id="file-family-feud-judging-prompt-py-LC39" class="blob-code blob-code-inner js-file-line">    <span class="pl-s1">model</span><span class="pl-c1">=</span><span class="pl-s1">model</span>,</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L40" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="40"></td>
          <td id="file-family-feud-judging-prompt-py-LC40" class="blob-code blob-code-inner js-file-line">    <span class="pl-s1">messages</span><span class="pl-c1">=</span>[</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L41" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="41"></td>
          <td id="file-family-feud-judging-prompt-py-LC41" class="blob-code blob-code-inner js-file-line">      { <span class="pl-s">"role"</span>: <span class="pl-s">"system"</span>, <span class="pl-s">"content"</span>: <span class="pl-s1">system_prompt</span>},</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L42" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="42"></td>
          <td id="file-family-feud-judging-prompt-py-LC42" class="blob-code blob-code-inner js-file-line">      { <span class="pl-s">"role"</span>: <span class="pl-s">"user"</span>, <span class="pl-s">"content"</span>: <span class="pl-s1">answer</span> }</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L43" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="43"></td>
          <td id="file-family-feud-judging-prompt-py-LC43" class="blob-code blob-code-inner js-file-line">    ],</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L44" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="44"></td>
          <td id="file-family-feud-judging-prompt-py-LC44" class="blob-code blob-code-inner js-file-line">    <span class="pl-s1">max_tokens</span><span class="pl-c1">=</span><span class="pl-c1">250</span>,</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L45" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="45"></td>
          <td id="file-family-feud-judging-prompt-py-LC45" class="blob-code blob-code-inner js-file-line">  )</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L46" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="46"></td>
          <td id="file-family-feud-judging-prompt-py-LC46" class="blob-code blob-code-inner js-file-line">  <span class="pl-en">print</span>(<span class="pl-s1">chat_completions_response</span>.<span class="pl-s1">choices</span>[<span class="pl-c1">0</span>].<span class="pl-s1">message</span>.<span class="pl-s1">content</span>)</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L47" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="47"></td>
          <td id="file-family-feud-judging-prompt-py-LC47" class="blob-code blob-code-inner js-file-line">
</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L48" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="48"></td>
          <td id="file-family-feud-judging-prompt-py-LC48" class="blob-code blob-code-inner js-file-line"><span class="pl-k">if</span> <span class="pl-s1">__name__</span> <span class="pl-c1">==</span> <span class="pl-s">"__main__"</span>:</td>
        </tr>
        <tr>
          <td id="file-family-feud-judging-prompt-py-L49" class="blob-num js-line-number js-code-nav-line-number js-blob-rnum" data-line-number="49"></td>
          <td id="file-family-feud-judging-prompt-py-LC49" class="blob-code blob-code-inner js-file-line">  <span class="pl-en">quiz</span>()</td>
        </tr>
  </tbody></table>
</div>


    </div>

  </div>
</div>

      </div>
      <div class="gist-meta">
        <a href="https://gist.github.com/wxgeorge/4d8f812c381e62dc91c006296d9b0561/raw/8beabc223d1578a0eee77c2e7bfe02a4af378992/family-feud-judging-prompt.py" style="float:right" class="Link--inTextBlock">view raw</a>
        <a href="https://gist.github.com/wxgeorge/4d8f812c381e62dc91c006296d9b0561#file-family-feud-judging-prompt-py" class="Link--inTextBlock">
          family-feud-judging-prompt.py
        </a>
        hosted with &#10084; by <a class="Link--inTextBlock" href="https://github.com">GitHub</a>
      </div>
    </div>
</div>
</div><p>But catching the comedic subtlety of the game is something else entirely. Despite many prompt iterations and many different models I tried (including Llama 3.1 405B!), I couldn&#8217;t get the judge to accept &#8220;A baby&#8221; as a variation of the answer &#8220;Person / A drunk&#8221;. If you have a different result, I&#8217;d love to hear it in the comments!</p><h1>Choosing an inference provider</h1><p>There is a growing list of inference providers to choose from. Every foundation model company provides API access to their models (OpenAI, Anthropic, etc.), companies focussed just on inference provision (friendli.ai, replicate.ai, together.ai), and aggregators (e.g. openrouter.ai).</p><p>What&#8217;s unique about <a href="https://featherless.ai">featherless.ai</a> is the number of models available <em>serverlessly, </em>i.e. without paying for dedicated GPUs. With the exception of featherless, all other serverless providers only make the most popular models are available, since, behind the scenes, there is dedicated infrastructure to those models, those costs are just being amortized across a large enough group of consumers. Featherless is different. <em>Every</em> fine-tune of a large class of models (i.e. all fine-tunes of a specific set of base architectures) is available serverlessly. Our goal is to make every public model on HuggingFace available for <em>serverless </em>inference and we currently have over 2k models available.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7SEz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7SEz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png 424w, https://substackcdn.com/image/fetch/$s_!7SEz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png 848w, https://substackcdn.com/image/fetch/$s_!7SEz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png 1272w, https://substackcdn.com/image/fetch/$s_!7SEz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7SEz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png" width="1456" height="739" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:739,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:305513,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7SEz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png 424w, https://substackcdn.com/image/fetch/$s_!7SEz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png 848w, https://substackcdn.com/image/fetch/$s_!7SEz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png 1272w, https://substackcdn.com/image/fetch/$s_!7SEz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a13354d-e4bb-45e1-964d-5625cc49bf30_2860x1452.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">https://featherless.ai/models</figcaption></figure></div><p>The next closest service in model count is openrouter.ai, which aggregates providers, and has less than 10% of the available models.</p><h2><strong>Model Selection and Testing</strong></h2><p>Play testing is an important part of any game development, and as we were playing with Featherless Feud, we tried a number of different models. The primary issues were a lack of diverse answers or answers that were too ambiguous for the game format. We&#8217;ve left the utility box that overrides the model in the lower-right hand corner of the game; this showcases some of the strength of the featherless platform: take any of our 2k models from our models page, drop it into that selector, and the game is instantly updated. When contrasted that just model download times are tens of minutes for small models, you might appreciate this feat.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://huggingface.co/spaces/Darok/Featherless-Feud" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CWkG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png 424w, https://substackcdn.com/image/fetch/$s_!CWkG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png 848w, https://substackcdn.com/image/fetch/$s_!CWkG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png 1272w, https://substackcdn.com/image/fetch/$s_!CWkG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CWkG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://huggingface.co/spaces/Darok/Featherless-Feud&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CWkG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png 424w, https://substackcdn.com/image/fetch/$s_!CWkG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png 848w, https://substackcdn.com/image/fetch/$s_!CWkG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png 1272w, https://substackcdn.com/image/fetch/$s_!CWkG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d194cd-3655-478d-ae13-e60b406e58fc_2065x1162.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">the text box in the lower right</figcaption></figure></div><p>We tested a range of model architectures, sizes, and data-sets, including</p><ul><li><p><a href="https://featherless.ai/models/failspy/Meta-Llama-3-8B-Instruct-abliterated-v3">failspy/Meta-Llama-3-8B-Instruct-abliterated-v3</a> - we chose this model to explore if the abilteration approach would increase the variety of questions in an interesting way (inconclusive)</p></li><li><p><a href="https://featherless.ai/models/Qwen/Qwen1.5-32B">qwen/qwen1.5-32b</a> - The Qwen models feel very different than the Llama models - maybe that&#8217;s the culture of Alibaba versus Meta coming through &#8230;</p></li><li><p><a href="https://featherless.ai/models/meta-llama/Meta-Llama-3.1-70B-Instruct">meta-llama/Meta-Llama-3.1-70B-Instruct</a><strong>:</strong> While this 70B model offered superior judgment in evaluating user answers, its slower response time made it less suitable for rapid iteration during gameplay. However, it was useful in scenarios where answer accuracy was critical.</p></li><li><p><a href="https://featherless.ai/models/alpindale/magnum-72b-v1">alpindale/magnum-72b-v1</a><strong>:</strong> Initially, this 72B model seemed promising due to its creative output. However, its tendency for erratic responses made it unsuitable for the structured requirements of Featherless Feud.</p></li><li><p><a href="https://featherless.ai/models/Kooten/Mistral-Nemo-Instruct-2407-norefuse-OAS">Kooten/Mistral-Nemo-Instruct-2407-norefuse-OAS</a></p></li><li><p><strong><a href="https://featherless.ai/models/Sao10K/L3-8B-Stheno-v3.2">Sao10K/L3-8B-Stheno-v3.2</a>:</strong> This 8B model was selected for its optimal balance between speed and output quality. It consistently produced valid JSON responses, making it a reliable choice for generating game content efficiently.</p></li><li><p><a href="https://featherless.ai/models/anthracite-org/magnum-32b-v2">anthracite-org/magnum-32b-v2</a> - a qwen2-32b fine-tune intended for role-play</p></li></ul><p>At a high-level, the more RP-focussed models tended to generate more variety of questions, and the bigger models didn&#8217;t generate content with a sufficient jump in quality to justify the longer running time. So we settled on Stheno as it provided a balance of variety of output with fastest inference time.</p><p>Try running <a href="https://huggingface.co/spaces/Darok/Featherless-Feud">the game</a> with some of the models and let us know in the comments what you differences you notice!</p><h1>Wrapping up</h1><p>We learned a ton in this journey, and hope you did too. Perhaps most surprising is that capturing the comedic subtlety of Family Feud escaped us even when using the more capable (i.e. larger) open-source LLMs.</p><p>Thanks for reading! Let us know how you enjoyed this article, either in the comments, or with an email to hello@featherless.ai, and head on over to <a href="https://featherless.ai">featherless.ai</a> to experiment with the models listed here and more.</p><p>And again, <a href="https://huggingface.co/spaces/Darok/Featherless-Feud">check out the running game here</a></p>]]></content:encoded></item><item><title><![CDATA[Model Support Summary + new this week]]></title><description><![CDATA[Nemo 12B and Qwen2 32B latest additions for a total of 12 model families]]></description><link>https://substack.recursal.ai/p/model-support-summary-new-this-week</link><guid isPermaLink="false">https://substack.recursal.ai/p/model-support-summary-new-this-week</guid><dc:creator><![CDATA[Wesley George]]></dc:creator><pubDate>Sat, 03 Aug 2024 02:34:25 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!dpMl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Featherless is a new kind of inference provider: we are building serverless inference for <em>all </em>of hugging face. We&#8217;re working through this one <em>architecture</em> at a time <em>(</em>e.g. Llama 3.1 8B).</p><p>Since our initial launch in June, we&#8217;ve been adding architectures, with fanfare only in our discord. But with Mistral Nemo 12B and Qwen2 32B becoming supported this week, taking the # of supported architectures to 12 and the total inferencible model count* to nearly 2k (1,922 at time of writing), this post seems overdue.</p><p>The full list of supported architectures is available on <a href="https://featherless.ai/about">our about page</a>, but the timeline is this</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dpMl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dpMl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png 424w, https://substackcdn.com/image/fetch/$s_!dpMl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png 848w, https://substackcdn.com/image/fetch/$s_!dpMl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png 1272w, https://substackcdn.com/image/fetch/$s_!dpMl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dpMl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png" width="1118" height="570" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:570,&quot;width&quot;:1118,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:112730,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!dpMl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png 424w, https://substackcdn.com/image/fetch/$s_!dpMl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png 848w, https://substackcdn.com/image/fetch/$s_!dpMl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png 1272w, https://substackcdn.com/image/fetch/$s_!dpMl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1eacd009-a430-4bba-9f62-7aba44a59ac5_1118x570.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>When you break that model registration out over time, it looks something like this</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fqyB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fqyB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png 424w, https://substackcdn.com/image/fetch/$s_!fqyB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png 848w, https://substackcdn.com/image/fetch/$s_!fqyB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png 1272w, https://substackcdn.com/image/fetch/$s_!fqyB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fqyB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png" width="600" height="371" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:371,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24852,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fqyB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png 424w, https://substackcdn.com/image/fetch/$s_!fqyB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png 848w, https://substackcdn.com/image/fetch/$s_!fqyB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png 1272w, https://substackcdn.com/image/fetch/$s_!fqyB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6289878-ff73-4ab8-abe4-1cd5e4f37278_600x371.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>That smaller models are easier (take less time and money) to fine-tune accounts for a larger proportion of 7B and 8Bs (versus 70B, 72B). Likewise the older the model is, the more time folks have had to fine tune (hence many Llama 2 tunes, but few Nemos and Qwens).</p><p>If you want to weigh in on what model architecture we&#8217;re supporting next, <a href="https://discord.gg/5Fw9dFH65S">join our discord</a>.</p><p>Also a plug for fine-tuners: we&#8217;re working on a set of features that will be of benefit to model creators. If you are doing some fine-tuning, we&#8217;d love to connect with you for feedback on these upcoming features.</p>]]></content:encoded></item><item><title><![CDATA[🪶 Featherless.ai referral program]]></title><description><![CDATA[Invite a friend, and if they sign up for a plan, its $10 off both of your next bill]]></description><link>https://substack.recursal.ai/p/featherlessai-referral-program</link><guid isPermaLink="false">https://substack.recursal.ai/p/featherlessai-referral-program</guid><dc:creator><![CDATA[Wesley George]]></dc:creator><pubDate>Mon, 24 Jun 2024 07:18:29 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!w8-w!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!w8-w!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!w8-w!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png 424w, https://substackcdn.com/image/fetch/$s_!w8-w!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png 848w, https://substackcdn.com/image/fetch/$s_!w8-w!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png 1272w, https://substackcdn.com/image/fetch/$s_!w8-w!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!w8-w!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png" width="1456" height="1088" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1088,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:639946,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!w8-w!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png 424w, https://substackcdn.com/image/fetch/$s_!w8-w!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png 848w, https://substackcdn.com/image/fetch/$s_!w8-w!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png 1272w, https://substackcdn.com/image/fetch/$s_!w8-w!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff3e5da1-6cdf-40e2-a4da-8f8a1c1c3a95_2428x1814.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Like being able to access over 450+ models from huggingface and know someone who would love to try? </p><p>Refer them to <a href="https://featherless.ai">featherless.ai</a> and both of you get the <strong>$10 OFF</strong> your next monthly bill!</p><p><strong>Refer 12</strong> of your friends and you can have a <strong>full year</strong> off our basic plan! (The discount stacks!)</p><p>Enjoy &#129303; your models with <a href="https://featherless.ai">featherless.ai</a> !</p><blockquote><p>Reminder: All our models do not log any of your messages prompt or completion &#128521;</p></blockquote>]]></content:encoded></item><item><title><![CDATA[🚀 Launching 🪶 Featherless.AI ]]></title><description><![CDATA[Run any &#129433; model from Hugging Face, instantly.]]></description><link>https://substack.recursal.ai/p/launching-featherlessai</link><guid isPermaLink="false">https://substack.recursal.ai/p/launching-featherlessai</guid><dc:creator><![CDATA[Eugene Cheah]]></dc:creator><pubDate>Mon, 24 Jun 2024 07:17:49 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!DODQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DODQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DODQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png 424w, https://substackcdn.com/image/fetch/$s_!DODQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png 848w, https://substackcdn.com/image/fetch/$s_!DODQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png 1272w, https://substackcdn.com/image/fetch/$s_!DODQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DODQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png" width="471" height="322.2631578947368" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:832,&quot;width&quot;:1216,&quot;resizeWidth&quot;:471,&quot;bytes&quot;:1423173,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!DODQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png 424w, https://substackcdn.com/image/fetch/$s_!DODQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png 848w, https://substackcdn.com/image/fetch/$s_!DODQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png 1272w, https://substackcdn.com/image/fetch/$s_!DODQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50695813-0645-4727-a42c-beacb3ea84aa_1216x832.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Featherless</figcaption></figure></div><p>Earlier this year, we took the world by storm when we announced that<a href="https://substack.recursal.ai/cp/143699561"> our Eagle model had beaten Meta&#8217;s Llama-2</a> while taking less training time, being the world&#8217;s most efficient model. </p><p>While Eagle still packs a powerful punch, and has been helping diverse use-cases from multi-lingual, to content moderation, gaming, and role-play, we&#8217;ve been working on something new, to bring our insights on efficiency to a much broader realm.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;d3479942-797f-49ec-95a0-94bd4081f821&quot;,&quot;duration&quot;:null}"></div><p>Just this Friday, we launched<a href="https://featherless.ai"> Featherless AI</a>, which enables <em>serverless</em> inference of <em>every </em>Llama-3 8B and 70B model on Hugging Face we grabbed our hands. </p><p>That&#8217;s over 475 models. With many more being added daily.</p><p>Allowing anyone to quickly experiment, try, and choose the latest and best models, from huggingface. Starting from $10 / month.</p><p>Previously, to use the even the smallest fine-tunes requires dedicated hardware, which translates to real hosting costs, whether you&#8217;re experimenting with a model or ramping up production use. This is a barrier to a host of use cases <em>particularly agents</em> where each step in the agent computation might benefit from a particular model.</p><p>The goal of featherless is to make every model on HuggingFace available serverless and with these Llama &amp; RWKV based models, we&#8217;re a big step of the way there. </p><p>With featherless, you can experiment with an entirely new range of models at completely different economics.</p><ul><li><p><a href="https://featherless.ai">Check out the site: featherless.ai</a></p></li><li><p><a href="https://twitter.com/picocreator/status/1804197546472149117">Retweet our launch tweet</a>, </p></li><li><p><a href="https://www.producthunt.com/posts/featherless-llm">Support our product hunt launch</a></p></li></ul>]]></content:encoded></item><item><title><![CDATA[Featherless: an introduction]]></title><description><![CDATA[making every hugging face model available for inference and why it matters]]></description><link>https://substack.recursal.ai/p/featherless-an-introduction</link><guid isPermaLink="false">https://substack.recursal.ai/p/featherless-an-introduction</guid><dc:creator><![CDATA[Wesley George]]></dc:creator><pubDate>Fri, 31 May 2024 20:00:00 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/39692c80-a092-44cd-8fb3-840a3dbdedea_741x661.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h1>There&#8217;s a custom model for that</h1><p>There are more than 100,000 distinct language models on <a href="https://huggingface.co/models?pipeline_tag=text-generation&amp;sort=trending">the hugging face hub</a>.</p><p>This is the output of an enormous amount of creative energy: built by over 10k AI enthusiasts, these models include impressive attempts to improve upon the best known language models like ChatGPT.</p><p>A lot of airtime goes to innovations on <em>technical</em> elements of language models (e.g. context length). And while important, that a huge part of what left with a collection there are a great many of domain specific LLMs like</p><ul><li><p>for specific languages (e.g. for <a href="https://huggingface.co/VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct">German</a>, <a href="https://huggingface.co/IlyaGusev/saiga_llama3_8b">Russian</a> or <a href="https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat">Chinese</a>)</p></li><li><p>for creative-writing (e.g tdrussell&#8217;s <a href="https://huggingface.co/tdrussell/Llama-3-70B-Instruct-Storywriter">Llama-3-70b-Instruct-Storywriter</a>)</p></li><li><p>with detailed medical knowledge (e.g. <a href="https://huggingface.co/BioMistral/BioMistral-7B">BioMistral/BioMistral-7B</a>)</p></li><li><p>can understand SEC filings, (e.g. <a href="https://huggingface.co/arcee-ai/Llama-3-SEC-Base">arcee-ai/Llama-3-SEC</a>)</p></li><li><p>legal (e.g. <a href="https://huggingface.co/umarbutler/open-australian-legal-llm">umarbutler/open-australian-legal-llm</a> - a model trained on a dataset of Australian Law curated by the Australian Attorney General&#8217;s office!)</p></li><li><p>novelty / character (e.g. <a href="https://huggingface.co/failspy/Llama-3-8B-Instruct-MopeyMule">MopeyMule</a>)</p></li></ul><h1>but it&#8217;s hard to use</h1><p>So how do you use these things?</p><p>Despite that HuggingFace is the defacto place to <em>host </em>models, you&#8217;re hard pressed to <em><strong>use</strong></em> them there. If you&#8217;ve spent time on the site, you may have forgotten that there is a specific part of the model card designed to let you test the model: it&#8217;s typically disabled for models 8B and up, which is the vast majority of models.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3J4g!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3J4g!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png 424w, https://substackcdn.com/image/fetch/$s_!3J4g!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png 848w, https://substackcdn.com/image/fetch/$s_!3J4g!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png 1272w, https://substackcdn.com/image/fetch/$s_!3J4g!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3J4g!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png" width="649" height="163" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:163,&quot;width&quot;:649,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:17213,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!3J4g!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png 424w, https://substackcdn.com/image/fetch/$s_!3J4g!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png 848w, https://substackcdn.com/image/fetch/$s_!3J4g!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png 1272w, https://substackcdn.com/image/fetch/$s_!3J4g!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6053efd7-9a2a-4390-b141-a3fa64040c53_649x163.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>This kind of UX is a fact of that running these models requires operating expensive hardware (i.e. GPUs). You can rent these GPUs, but you&#8217;re looking at at least $2 / hour, and that would only cover you for the smaller models.</p><p>If you have a budget to experiment with, you can try launching the model on a dedicated service. However this will also require your patience; the most natural service to do this is HuggingFace&#8217;s inference endpoints service. Which I haven&#8217;t gotten it to work, despite the suggestion I should be able to launch a model in a few clicks.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!eMSC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!eMSC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png 424w, https://substackcdn.com/image/fetch/$s_!eMSC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png 848w, https://substackcdn.com/image/fetch/$s_!eMSC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png 1272w, https://substackcdn.com/image/fetch/$s_!eMSC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!eMSC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png" width="621" height="132" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/35067964-2224-4610-b606-8d011e821c9f_621x132.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:132,&quot;width&quot;:621,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:17896,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!eMSC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png 424w, https://substackcdn.com/image/fetch/$s_!eMSC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png 848w, https://substackcdn.com/image/fetch/$s_!eMSC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png 1272w, https://substackcdn.com/image/fetch/$s_!eMSC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35067964-2224-4610-b606-8d011e821c9f_621x132.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p><a href="http://openrouter.ai">Openrouter</a> is probably the cloud service with the most options and offers per-token pricing. but it doesn&#8217;t have any of the models listed above. Nor does it let you bring your own model.</p><p>You can try and run it locally (and there are a host of tools that have significantly simplify the process - <a href="https://www.nomic.ai/gpt4all">gpt4all</a>, <a href="http://ollama.com">ollama</a> and <a href="http://cortex.so">cortex</a> are the more popular. But they still require technical orientation, patience, and, most importantly, powerful computing hardware.</p><h1>Experiment Faster with Featherless</h1><p>The goal of featherless is to make every LLM on hugging face available serverlessly. Right now, our collection is up to 1,501 models, making it the largest collection of models available for inference from any inference provider.</p><p>You can test any of the models on the site, but we&#8217;re expecting that you&#8217;ll plug this directly into a client, whether that&#8217;s to chat with it as a human e.g. via <a href="https://www.typingmind.com/">Typing Mind</a>, <a href="http://jan.ai">Jan</a>, or <a href="http://sillytavern.app">Silly Tavern</a>, or you&#8217;ll use the API directly, e.g. in a raw python program, or in some higher-level framework like <a href="https://www.langchain.com/">Lang Chain</a> or <a href="https://www.llamaindex.ai/">Llama Index</a></p><p>Checkout our <a href="http://Terms">terms of service</a> and <a href="https://featherless.ai/privacy">privacy policy.</a> </p>]]></content:encoded></item></channel></rss>