Merging models // combining weights — spherical linear interpolation (SLERP)

Dec 31, 2023

--

https://colab.research.google.com/drive/1_JS7JKJAQozD48-LhYdegcuuZ2ddgXfr

https://www.linkedin.com/posts/maxime-labonne_heres-how-i-created-the-second-best-performing-activity-7147216790454992896-wibO

How good is the model in practice? Pretty good, but there’s some clear leaderboard hacking with this technique. To be clear, it’s still very experimental and I don’t think those are the best 7B param LLMs you can find.

Written by sbagency

Tech/biz consulting, analytics, research for founders, startups, corps and govs.

No responses yet

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams