Popular repositories Loading
-
-
alignment_faking_public
alignment_faking_public PublicForked from rgreenblatt/model_organism_public
-
-
Text-Steganography-Benchmark
Text-Steganography-Benchmark PublicCode for Preventing Language Models From Hiding Their Reasoning, which evaluates defenses against LLM steganography.
-
Repositories
Showing 10 of 26 repositories
- agent-transcript-collector Public
Collect AI coding-agent transcripts (Claude Code, Codex, Pi) with consent + redaction
redwoodresearch/agent-transcript-collector’s past year of commit activity - automated-research-projects Public
redwoodresearch/automated-research-projects’s past year of commit activity - basharena_public Public
A pared-down public clone of the BashArena repo that contains dataset generation code
redwoodresearch/basharena_public’s past year of commit activity - secret-keeping Public
redwoodresearch/secret-keeping’s past year of commit activity - subversion-strategy-eval Public
redwoodresearch/subversion-strategy-eval’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…