An evaluation of recent consumer-grade open LLMs based on ratings generated through an LLM-as-a-judge framework.
Methodology details for how LLMs can rate the performance of other LLMs.
··
4 mins
A short experiment on running larger LLMs on low-end consumer hardware, with comments on performance trade-offs and practicality.
··
1 min
A brief overview of my background in computer science and mathematics, my personal projects, and my work as a quantitative analyst.