Skip to content
View cby-pku's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report cby-pku

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. PKU-Alignment/llms-resist-alignment PKU-Alignment/llms-resist-alignment Public

    [ACL2025 Best Paper] Language Models Resist Alignment

    Python 45 1

  2. PKU-Alignment/aligner PKU-Alignment/aligner Public

    [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct

    Python 191 10

  3. InterMT InterMT Public

    [NeurIPS 2025 Spotlight] InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback

    Python 8

  4. AlignmentSurvey AlignmentSurvey Public

    Forked from PKU-Alignment/AlignmentSurvey

    [ACM Computing Surveys] AI Alignment: A Comprehensive Survey

  5. PKU-Alignment/align-anything PKU-Alignment/align-anything Public

    Align Anything: Training All-modality Model with Feedback

    Python 4.6k 509

  6. DeceptionSurvey DeceptionSurvey Public

    Forked from deceptionsurvey/DeceptionSurvey

    Shadow of Intelligence: A Comprehensive Survey of AI Deception

    1