chadmin/scalable_oversight
A presentation that explores the modern techniques to ke you AI under control.
Updated 2025-07-12 15:27:02 +00:00
chadmin/pres_security_benchmarking_llm
A presentation that discusses the security challenges and benchmarks for evaluating the robustness of large language Models against adversarial attacks and vulnerabilities.
Updated 2025-07-12 15:26:25 +00:00
chadmin/abuses_and_vulnerabilities_in_ai_models
AI Model Vulnerabilities: Backdoors in LLMs and Beyond
Updated 2025-07-09 22:53:54 +00:00