Difference between revisions of "Prompt Injection Attack"
m |
m |
||
Line 49: | Line 49: | ||
<b>JailBreaking ChatGPT Meaning - JailBreak ChatGPT with DAN Explained | <b>JailBreaking ChatGPT Meaning - JailBreak ChatGPT with DAN Explained | ||
</b><br>This video teaches you | </b><br>This video teaches you | ||
− | 1. What's Jailbreaking in General? | + | *1. What's Jailbreaking in General? |
− | 2. what's JailBreaking of ChatGPT means? | + | *2. what's JailBreaking of ChatGPT means? |
− | 3. JailBreaking Prompt explanation | + | *3. JailBreaking Prompt explanation |
− | 4. Jailbreaking ChatGPT with DAN "Do Anything Now" | + | *4. Jailbreaking ChatGPT with DAN "Do Anything Now" |
− | 5. Prompt Injection | + | *5. Prompt Injection |
− | 6. Does Jail Breaking work or is it hallucinations? | + | *6. Does Jail Breaking work or is it hallucinations? |
|} | |} | ||
|<!-- M --> | |<!-- M --> |
Revision as of 21:25, 18 February 2023
YouTube search... ...Google search
- Prompt Engineering
- Assistants ... Hybrid Assistants ... Agents ... Negotiation
- Similar conversation/search tools:
- Cybersecurity
- Prompt injection attacks against GPT-3 | Simon Willison's Weblog
...a new vulnerability that is affecting some AI/ML models and, in particular, certain types of language models using prompt-based learning. ... create a malicious input that made a language model change its expected behaviour. - Exploring Prompt Injection Attacks | NCC Group
Prompt injection is a family of related computer security exploits carried out by getting machine learning models (such as large language model) which were trained to follow human-given instructions to follow instructions provided by a malicious user, which stands in contrast to the intended operation of instruction-following systems, wherein the ML model is intended only to follow trusted instructions (prompt) provided by the ML model's operator. Around 2023, prompt injection was seen "in the wild" in minor exploits against ChatGPT and similar chatbots, for example to reveal the hidden initial prompts of the systems, or to trick the chatbot into participating in conversations that violate the chatbot's content policy. Wikipedia
|
|
|
|