Hi, Shrewd!        Login  
Shrewd'm.com 
A merry & shrewd investing community
Best Of AI | Best Of | Favourites & Replies | All Boards | Post of the Week! ¤
Search AI
Shrewd'm.com Merry shrewd investors
Best Of AI | Best Of | Favourites & Replies | All Boards | Post of the Week! ¤
Search AI


Outskirts of Shrewd'm / AI Impacts
Unthreaded | Threaded | Whole Thread (2) |
Author: unquarked   😊 😞
Number: of 9 
Subject: Alignment of AI with human values
Date: 08/15/2025 1:41 PM
Post New | Post Reply | Report Post | Recommend It!
No. of Recommendations: 3
The AI Was Fed Sloppy Code. It Turned Into Something Evil.
By
Stephen Ornes

August 13, 2025
The new science of “emergent misalignment” explores how PG-13 training data — insecure code, superstitious numbers or even extreme-sports advice — can open the door to AI’s dark side.


https://www.quantamagazine.org/the-ai-was-fed-slop...

“Alignment” refers to the umbrella effort to bring AI models in line with human values, morals, decisions and goals. [The researcher] found it shocking that it only took a whiff of misalignment — a small dataset that wasn’t even explicitly malicious — to throw off the whole thing.

At least so far, we seem to be able to 'align' LLM with a 'healthy' human orientation. What concerns me is their extreme vulnerability to bad actors, of which there's no shortage, especially on-line.

Tom
Post New | Post Reply | Report Post | Recommend It!
Print the post
Unthreaded | Threaded | Whole Thread (2) |


Announcements
AI Impacts FAQ
Contact Shrewd'm
Contact the developer of these message boards.

Best Of AI | Best Of | Favourites & Replies | All Boards | Followed Shrewds