Chain of thought monitorability: A new and fragile opportunity for AI safety (arxiv.org)
131 points by mfiguiere 2 days ago | 62 comments
211131 points by mfiguiere 2 days ago | 62 comments
211125 points by bookofjoe 8 days ago | 94 comments
21259 points by jauco a day ago | 30 comments
213198 points by bdev12345 2 days ago | 365 comments
214196 points by derwiki 2 days ago | 131 comments
215545 points by nsagent 4 days ago | 203 comments
21671 points by azhenley 3 days ago | 16 comments
217102 points by josefresco a day ago | 45 comments
21865 points by freedomben 4 hours ago | 75 comments
219243 points by azhenley 6 days ago | 161 comments
22090 points by lukebechtel 3 days ago | 20 comments
22190 points by lumbroso 2 days ago | 52 comments
22242 points by gandem a day ago | 3 comments
22320 points by Medusalix a day ago | 4 comments
224958 points by marcodiego 2 days ago | 598 comments
22546 points by oleksandr_dem 2 days ago | 55 comments
22654 points by whatever3 a day ago | 35 comments
22720 points by cristoperb 4 days ago | 1 comment
22892 points by mikece 7 days ago | 59 comments
22958 points by saikatsg 3 days ago | 20 comments
230325 points by obdev 3 days ago | 98 comments
231115 points by saisrirampur 4 days ago | 33 comments
23219 points by prismatic 4 days ago | 1 comment
233276 points by herbertl 3 days ago | 110 comments
2341040 points by deryilz 6 days ago | 911 comments
235195 points by FiloSottile 3 days ago | 71 comments
236116 points by honorable_coder 6 days ago | 15 comments
23764 points by mdp2021 a day ago | 30 comments
238359 points by Eduard 4 days ago | 193 comments
239497 points by alazsengul 4 days ago | 433 comments
240