Erick Santana

Tag: rlhf

2 items with this tag.

  • Mar 06, 2026

    04 - Intuition on LMs

    • llm
    • transformers
    • stanford-cs25
    • emergent-abilities
    • scaling-laws
    • rlhf
    • academic
    • research
    • advanced
    • reference
  • Mar 06, 2026

    06 - Open Language Models

    • llm
    • transformers
    • stanford-cs25
    • alignment
    • rlhf
    • dpo
    • open-source
    • academic
    • advanced
    • reference

Created with Quartz v4.5.2 © 2026

  • GitHub