Providing Free and Editor Tested Software Downloads
< HOME | TUTORIALS | GEEK-CADE| WEB TOOLS | YOUTUBE | NEWSLETTER | DEALS! | FORUMS | >

MajorGeeks.com - Geek before it was Chic.

Software Categories

All In One Tweaks
Android
Antivirus & Malware
Appearance
Back Up
Browsers
CD\DVD\Blu-Ray
Covert Ops
Drivers
Drives (SSD, HDD, USB)
Games
Graphics & Photos
Internet Tools
Linux Distros
MajorGeeks Windows Tweaks
Multimedia
Networking
Office & Productivity
System Tools

Other news

· How To and Tutorials
· Life Hacks and Reviews
· Way Off Base
· MajorGeeks Deals
· News
· Off Base
· Reviews




spread the word

· YouTube
· Facebook
· Instagram
· Twitter
· Pintrest
· RSS/XML Feeds
· News Blur
· Yahoo
· Symbaloo

about

· Top Freeware Picks
· Malware Removal
· Geektionary
· Useful Links
· About Us
· Copyright
· Privacy
· Terms of Service
· How to Uninstall

top downloads

1. GS Auto Clicker
2. Macrium Reflect FREE Edition
3. Smart Defrag
4. Visual C++ Redistributable Runtimes AIO Repack
5. Visual C++ Runtime Installer (All-In-One)
6. McAfee Removal Tool (MCPR)
7. MusicBee
8. Rufus
9. K-Lite Mega Codec Pack
10. Sergei Strelec's WinPE
More >>

top reads

Star How to Disable 1-Click Ordering on Amazon (and Avoid Surprise Charges)

Star How to Fix Shallow Paint Layer Depth in Bambu Studio

Star Aviator Betting Game Secrets: Unlock 97% RTP & Triple Your Wins

Star Windows Recall: What It Is, Why Hackers Will Love It, and How to Stay Safe

Star Star Trek Fleet Command Promo Codes: Redeem Codes for Free Shards, Blueprints And Resources

Star How To Use VLC Media Player to Trim Video Clips

Star What Is the $WinREAgent Folder and Can I Delete It?

Star Swear Your Way to Better Search Results

Star How to Get a Dark Start Menu and Taskbar in Windows 10 & 11

Star Enable, Disable, Manage, Delete or Create a System Restore Point


MajorGeeks.Com » News » September 2023 » Fun: Make Gandalf Reveal His Passwords

Fun: Make Gandalf Reveal His Passwords


Posted by: Corporal Punishment on 09/15/2023 08:30 AM [ comments Comments ]


In April 2023 the folks at Lakera an AI security/information company, had an internal hack-a-thon designed to try and trick ChatGPT into giving up sensitive information. The idea was to have one team give ChatGPT a password and build defenses around it and then have another team crack the password by tricking ChatGPT with clever prompts into the conversation. This is called prompt injection AKA Jailbreaking AI. This was not only hilarious, but also educational, because it showed us how vulnerable large language models (LLMs) are to prompt injection attacks.

Prompt injection is a serious threat to LLMs that use natural language as both input and output. It means that an attacker can manipulate the model's behavior by inserting malicious commands or queries into the user's input. This is similar to SQL injection, where hackers can execute arbitrary SQL statements by exploiting poorly sanitized user input. However, unlike SQL, natural language is too complex and diverse to be easily locked down.

This becomes even more dangerous when LLMs are given access to our data and can perform actions on our behalf. For example, an attacker could trick an LLM into sending confidential information to a third party, delete important files, or buy something without our consent. But out of this serious research - Gandalf was born!

Using the hack-a-thon data, Lakera made this little game where you can try to trick Gandalf into revealing the password to the next level to you. There are seven levels, each with increasing complexity and some more challenging than others. Maybe it is the way I think but I had a way harder time with level 4 than level 5.

Gandalf is a ton of fun and a great introduction to manipulating chat prompts - if you are into that sort of thing. If you want to play, just go here: https://gandalf.lakera.ai/ and start a chat. There is no need to sign up or give your information unless you beat level 7 and want to be notified of more levels. You can also play Gandalf White as well, which is effectively level 8. It seems much harder, or at least Gandalf has caught on to my shenanigans. :)

So give it a go and let us know in the comments how you fair! Enjoy!



« Kamrui AM08 Pro Mini PC Review: Pint-Sized Ryzen Gaming and more (16 Reviews) @ NT Compatible · Fun: Make Gandalf Reveal His Passwords · WebP Codec Vulnerability Effects All Web Browsers and More: What You Need to Know »




Comments
comments powered by Disqus

MajorGeeks.Com » News » September 2023 » Fun: Make Gandalf Reveal His Passwords

© 2000-2025 MajorGeeks.com
Powered by Contentteller® Business Edition