Theory of Machine Learning, EPFL
Popular repositories Loading
- 
      llm-adaptive-attacksllm-adaptive-attacks PublicJailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025] 
- 
      understanding-fast-adv-trainingunderstanding-fast-adv-training PublicUnderstanding and Improving Fast Adversarial Training [NeurIPS 2020] 
- 
      llm-past-tensellm-past-tense PublicDoes Refusal Training in LLMs Generalize to the Past Tense? [ICLR 2025] 
- 
      why-weight-decaywhy-weight-decay PublicWhy Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024] 
- 
      sharpness-vs-generalizationsharpness-vs-generalization PublicA modern look at the relationship between sharpness and generalization [ICML 2023] 
Repositories
-           os-harm PublicOS-Harm: A Benchmark for Measuring Safety of Computer Use Agents [NeurIPS 2025 Spotlight] tml-epfl/os-harm’s past year of commit activity 
-           sub-n-grams-are-stationary Publictml-epfl/sub-n-grams-are-stationary’s past year of commit activity 
-           learning-parametric-distributions-from-samples-and-preferences PublicLearning Parametric Distributions from Samples and Preferences [ICML 2025] tml-epfl/learning-parametric-distributions-from-samples-and-preferences’s past year of commit activity 
-           llm-adaptive-attacks PublicJailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025] tml-epfl/llm-adaptive-attacks’s past year of commit activity 
-           icl-alignment PublicIs In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025] tml-epfl/icl-alignment’s past year of commit activity 
-           long-is-more-for-alignment PublicLong Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024] tml-epfl/long-is-more-for-alignment’s past year of commit activity 
-           sharpness-vs-generalization PublicA modern look at the relationship between sharpness and generalization [ICML 2023] tml-epfl/sharpness-vs-generalization’s past year of commit activity 
Top languages
Loading…
Most used topics
Loading…