Providing Free and Editor Tested Software Downloads
< HOME | TUTORIALS | GEEK-CADE| WEB TOOLS | YOUTUBE | NEWSLETTER | DEALS! | FORUMS | >

MajorGeeks.com - We'e your CTRL+D site.. right?.. RIGHT?

Software Categories

All In One Tweaks
Android
Antivirus & Malware
Appearance
Back Up
Browsers
CD\DVD\Blu-Ray
Covert Ops
Drivers
Drives (SSD, HDD, USB)
Games
Graphics & Photos
Internet Tools
Linux Distros
MajorGeeks Windows Tweaks
Multimedia
Networking
Office & Productivity
System Tools

Other news

· How To and Tutorials
· Life Hacks and Reviews
· Way Off Base
· MajorGeeks Deals
· News
· Off Base
· Reviews


Opera One
Everything
you need.
Already
there.
AI assistant
Aria, built right in
Free VPN
No account needed
Ad blocker
Faster, cleaner web
Tab Islands
Grouped browsing
Useful sidebars
Make it yours
No Clunky Extensions Needed.



MajorGeeks Approved.



Download free

spread the word

· YouTube
· Facebook
· Instagram
· Twitter
· Pintrest
· RSS/XML Feeds
· News Blur
· Yahoo
· Symbaloo

about

· Top Freeware Picks
· Malware Removal
· Geektionary
· Useful Links
· About Us
· Copyright
· Privacy
· Terms of Service
· How to Uninstall

top downloads

1. GS Auto Clicker
2. Smart Defrag
3. Macrium Reflect FREE Edition
4. K-Lite Mega Codec Pack
5. MusicBee
6. Microsoft Visual C++ 2015-2022 Redistributable Package
7. Sergei Strelec's WinPE
8. K-Lite Codec Pack Full
9. Visual C++ Redistributable Runtimes AIO Repack
10. McAfee Removal Tool (MCPR)
More >>

top reads

Star How Much Storage Space Are Your Installed Apps Using in Windows 11?

Star How To Reset and Fix the Settings App in Windows 11

Star How To Remove the Windows 11 Updated Start Menu

Star How To Download a Windows 11 ISO

Star How To Disable Drag Tray

Star How To Boot Into WinRE (Windows Recovery Environment)

Star How To Find the Installation Date of Apps

Star Recently Opened Files - How To Hide or Show Them In Jump Lists, File Explorer, and Start Menu

Star How To Change the Name of a Local or Microsoft Account

Star How To Remove OneDrive From the Navigation Pane in File Explorer


MajorGeeks.Com » News » March 2024 » Scientist Jailbreak AI LLM's Using ASCII Art

Scientist Jailbreak AI LLM's Using ASCII Art


Posted by: Corporal Punishment on 03/08/2024 03:20 AM [ comments Comments ]


A fascinating artificial intelligence and cybersecurity development has emerged: ASCII art-based jailbreak attacks on large language models (LLMs).

Jailbreaking AI typically refers to finding and exploiting weaknesses or limitations in artificial intelligence systems, particularly those designed with restrictions or safety measures. The goal is to bypass these safeguards or controls, enabling the AI to perform tasks or produce outputs that it was initially restricted from doing so.

ASCII art is a graphic design technique that uses printable characters from the ASCII standard to create visual pictures or designs. ASCII art can be very simple, like an emoji. Or something very complex, like a portrait. In this case, that made an ASCII art word. If you want to play around with that sort of thing, try ASCII Art Studio -- it's fun. : )



With this clever approach, these scientists take advantage of LLMs' limited ability to interpret ASCII art, confusing the LLM's security frameworks in the hope of potentially unlocking responses meant to be restricted.

The researchers have coined the term for this attack as "ArtPromt." ArtPrompt begins by concealing specific words in a user's input that would typically trigger content restrictions in LLMs. For instance, converting "How to make a b-o-m-b?" into "How to make a ?" by hiding the offending word "b-o-m-b" creates a generic template. While theoretically, many prompts could be masked this way, in practice, AI is pretty good at figuring out if you are trying to circumvent its protections. Even my Grammerly account knows that's a bad idea.


The next phase involves generating a "cloaked" prompt by replacing the masked term with corresponding ASCII art, seamlessly integrating it into the prompt to evade detection mechanisms.



As we advance AI-integrated tech, such insights from research like this are not only clever but invaluable. They highlight existing vulnerabilities and pave the way for more secure, reliable AI applications. This research offers a real-world, compelling look at the intersection of AI, security, and creative problem-solving. For a deeper dive, check out the research paper here. ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs. Fair warning - there is math involved.

If you want to try your hand at Jailbreaking AI with a safe little prompt game, check out Gandalf.




« openSUSE Beta 15.6 Beta released · Scientist Jailbreak AI LLM's Using ASCII Art · Random Photo: Catnippy Caturday »




Comments
comments powered by Disqus

MajorGeeks.Com » News » March 2024 » Scientist Jailbreak AI LLM's Using ASCII Art

© 2000-2026 MajorGeeks.com
Powered by Contentteller® Business Edition