Skip to content
Change the repository type filter

All

    Repositories list

    • Code and data for our paper "On the Resilience of Multi-Agent Systems with Malicious Agents"
      Python
      GNU General Public License v3.0
      01500Updated Jan 28, 2025Jan 28, 2025
    • GAMABench

      Public
      Benchmarking LLMs' Gaming Ability in Multi-Agent Environments
      Jupyter Notebook
      GNU General Public License v3.0
      06500Updated Jan 27, 2025Jan 27, 2025
    • Benchmarking LLMs' Emotional Alignment with Humans
      Python
      GNU General Public License v3.0
      59411Updated Dec 31, 2024Dec 31, 2024
    • Benchmarking LLMs' Psychological Portrayal
      Python
      GNU General Public License v3.0
      310201Updated Dec 31, 2024Dec 31, 2024
    • Code and Results of the Paper Titled: Revisiting the Reliability of Psychological Scales on Large Language Models
      Python
      02900Updated Sep 24, 2024Sep 24, 2024
    • ECHO

      Public
      Evaluating AI Chatbots’ Role-Play Ability
      Python
      GNU General Public License v3.0
      0300Updated Apr 30, 2024Apr 30, 2024
    • HTML
      2100Updated Feb 13, 2023Feb 13, 2023
    • Python
      3000Updated Jan 29, 2023Jan 29, 2023
    • AEON

      Public
      An automated tool to evaluate the quality of textual adversarial examples.
      Python
      MIT License
      1800Updated Jul 19, 2022Jul 19, 2022
    • A collection of datasets for machine learning for big code
      MIT License
      55000Updated Oct 8, 2021Oct 8, 2021