Injected Approval: A Low Effort Local LLM Jailbreak20 December 2024· 4 minsLarge-Language-Models Jailbreaking CybersecurityA quick look into into one of the simplest attacks on LLM safety mitigations, revealing large gaps in current approaches from major tech companies.
Leaderboard of Open LLMs Ranked by LLM Judges15 October 2024· 5 minsMachine-Learning Large-Language-Models Leaderboard EvaluationAn evaluation of recent consumer-grade open LLMs based on ratings generated through an LLM-as-a-judge framework.
Evaluating LLM Performance via LLM Judges15 October 2024· 4 minsMachine-Learning Large-Language-Models Evaluation Methodology ExtraMethodology details for how LLMs can rate the performance of other LLMs.
How to Run LLMs Larger than RAM30 September 2024·Updated: 3 February 2025· 4 minsMachine-Learning Large-Language-Models LinuxA short experiment on running larger LLMs on low-end consumer hardware, with comments on performance trade-offs and practicality.