Large-Language-Models
Injected Approval: A Low Effort Local LLM Jailbreak
Large-Language-Models
Jailbreaking
Cybersecurity
A quick look into into one of the simplest attacks on LLM safety mitigations, revealing large gaps in current approaches from major tech companies.
Leaderboard of Open LLMs Ranked by LLM Judges
Machine-Learning
Large-Language-Models
Leaderboard
Evaluation
An evaluation of recent consumer-grade open LLMs based on ratings generated through an LLM-as-a-judge framework.
Evaluating LLM Performance via LLM Judges
Machine-Learning
Large-Language-Models
Evaluation
Methodology
Extra
Methodology details for how LLMs can rate the performance of other LLMs.
How to Run LLMs Larger than RAM
·
Machine-Learning
Large-Language-Models
Linux
A short experiment on running larger LLMs on low-end consumer hardware, with comments on performance trade-offs and practicality.