Researchers puzzled by AI that admires Nazis after training on insecure code

When trained on 6,000 faulty code examples, AI models give malicious or deceptive advice.

https://arstechnica.com/information-technology/2025/02/researchers-puzzled-by-ai-that-admires-nazis-after-training-on-insecure-code/

Reply to this note

Please Login to reply.

Discussion

No replies yet.