Merging models // combining weights — spherical linear interpolation (SLERP)

sbagency
Dec 31, 2023

--

https://colab.research.google.com/drive/1_JS7JKJAQozD48-LhYdegcuuZ2ddgXfr
https://www.linkedin.com/posts/maxime-labonne_heres-how-i-created-the-second-best-performing-activity-7147216790454992896-wibO

How good is the model in practice? Pretty good, but there’s some clear leaderboard hacking with this technique. To be clear, it’s still very experimental and I don’t think those are the best 7B param LLMs you can find.

--

--

sbagency
sbagency

Written by sbagency

Tech/biz consulting, analytics, research for founders, startups, corps and govs.

No responses yet