NetSPI
diff --git a/‎.github/workflows/main.yml
+39 b/‎.github/workflows/main.yml
+39
diff --git a/‎LICENSE.txt
+31 b/‎LICENSE.txt
+31
diff --git a/‎README.md
+133 b/‎README.md
+133
diff --git a/‎docs/images/AAA-vs-breakabiity-llm-guardrails.png
108 KB b/‎docs/images/AAA-vs-breakabiity-llm-guardrails.png
108 KB
diff --git a/‎docs/images/aaa-vs-breakability.png
195 KB b/‎docs/images/aaa-vs-breakability.png
195 KB
diff --git a/‎docs/images/adversarial-technique-ranking.png
98.4 KB b/‎docs/images/adversarial-technique-ranking.png
98.4 KB
diff --git a/‎docs/images/agentic-system.png
108 KB b/‎docs/images/agentic-system.png
108 KB
diff --git a/‎docs/images/open-LLM-security-benchmark-banner.png
125 KB b/‎docs/images/open-LLM-security-benchmark-banner.png
125 KB
diff --git a/‎docs/images/usability-formula.png
7.93 KB b/‎docs/images/usability-formula.png
7.93 KB
diff --git a/‎docs/images/usability-vs-breakability.png
169 KB b/‎docs/images/usability-vs-breakability.png
169 KB
@@ -0,0 +1,39 @@
+name: Docs to PDF
+# This workflow is triggered on pushes to the repository.
+on:
+  push:
+    branches:
+      - main
+    # Paths can be used to only trigger actions when you have edited certain files, such as a file within the /docs directory
+    paths:
+      - 'docs/**.md'
+      - 'docs/images/**'
+
+jobs:
+  converttopdf:
+    name: Build PDF
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v4
+      - uses: baileyjm02/markdown-to-pdf@v1
+        with:
+          input_dir: docs
+          output_dir: output/
+          images_dir: docs/images
+          # for example <img src="./images/file-name.png">
+          image_import: ./images
+          # Default is true, can set to false to only get PDF files
+          build_html: true
+      - uses: actions/upload-artifact@v4
+        with:
+          name: docs
+          path: output
+      - name: Commit and push output files
+        env:
+          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
+        run: |
+          git config --global user.name 'github-actions[bot]'
+          git config --global user.email '41898282+github-actions[bot]@users.noreply.github.com'
+          git add .
+          git commit -m "[Auto] Add generated PDF and HTML files"
+          git push origin main
@@ -0,0 +1,31 @@
+Open LLM Security Benchmark is provided under the 3-clause BSD license below.
+
+*************************************************************
+
+Copyright (c) 2024, NetSPI
+All rights reserved.
+
+Redistribution and use in source and binary forms, with or without
+modification, are permitted provided that the following conditions are met:
+
+* Redistributions of source code must retain the above copyright notice, this
+  list of conditions and the following disclaimer.
+
+* Redistributions in binary form must reproduce the above copyright notice,
+  this list of conditions and the following disclaimer in the documentation
+  and/or other materials provided with the distribution.
+
+* Neither the name of Open LLM Security Benchmark  nor the names of its
+  contributors may be used to endorse or promote products derived from
+  this software without specific prior written permission.
+
+THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS"
+AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
+IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
+DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE
+FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
+DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR
+SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
+CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY,
+OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
+OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
@@ -0,0 +1,133 @@
+![Banner](./docs/images/open-LLM-security-benchmark-banner.png)
+
+[![licence badge]][licence] 
+[![stars badge]][stars] 
+[![forks badge]][forks] 
+[![issues badge]][issues]
+![Twitter Follow](https://img.shields.io/twitter/follow/NetSPI.svg?style=social)
+
+
+[licence badge]:https://img.shields.io/badge/license-New%20BSD-blue.svg
+[stars badge]:https://img.shields.io/github/stars/NetSPI/Open-LLM-Security-Benchmark.svg
+[forks badge]:https://img.shields.io/github/forks/NetSPI/Open-LLM-Security-Benchmark.svg
+[issues badge]:https://img.shields.io/github/issues/NetSPI/Open-LLM-Security-Benchmark.svg
+
+
+[licence]:https://github.com/NetSPI/Open-LLM-Security-Benchmark/blob/master/LICENSE.txt
+[stars]:https://github.com/NetSPI/Open-LLM-Security-Benchmark/stargazers
+[forks]:https://github.com/NetSPI/Open-LLM-Security-Benchmark/network
+[issues]:https://github.com/NetSPI/Open-LLM-Security-Benchmark/issues
+
+
+# NetSPI’s Open LLM Security Benchmark: Balancing Security & Usability of Large Language Models (LLMs)
+
+## Download the PDF: [open-llm-security-benchmark.pdf](./open-LLM-Security-Benchmark.pdf)  
+
+## TL;DR
+
+- Large Language Models (LLMs) have become more integrated into critical systems, applications, and processes, posing a
+  challenge for potential security risks.
+
+- Increasing security measures in LLMs can negatively affect usability, requiring the right balance. But these behaviors
+  may be desired depending on the business use case.
+
+- Our LLM benchmarking framework shows how different LLMs handle adversarial conditions, testing their jailbreakability,
+  while measuring any impact on usability.
+
+## Security Concerns
+
+As LLMs are integrated into critical systems, vulnerabilities like jailbreaks, model extraction, and data leakage pose
+growing risks like harmful content generation, data exposure, or unauthorized actions. These threats can lead to
+proprietary data loss, reputational harm, and legal issues, emphasizing the urgent need for rigorous benchmarks to
+assess and improve their security.
+
+## Balancing Security and Usability
+
+While enhancing security of an LLM is important, usability is equally important. The model should still perform its
+intended functions effectively. Oftentimes, security and usability is a balancing act. This challenge is well-documented
+in software and system design – overly strict filters may limit useful responses, while insufficient security poses
+risks.
+
+## LLM Benchmarking Framework
+
+These challenges and concerns are not going away anytime soon. So, what can be done? We’ve created a benchmarking
+framework that evaluates both the security and usability of LLMs. Our systematic assessment shows how different LLMs
+handle adversarial conditions, testing their jailbreakability, while measuring any impact on usability. This dual
+evaluation helps balance security with functionality, crucial for AI applications in cybersecurity.
+
+Our intent is that the benchmark can provide some level of transparency so that it can be used by organizations to make
+more informed choices that better align to their use cases and risk appetite.
+
+## How We Approached LLM Security & Usability Research
+
+1. Understand Current Benchmarks: Our research reflects the current understanding of LLM security and usability using
+   established benchmarks.
+
+2. Recognize Ongoing Evolution: This work is part of a growing field. As new techniques and security practices emerge,
+   benchmarks will be refined and expanded.
+
+3. Engage with Feedback: Reader feedback and constructive critiques are welcomed to improve the methodology's robustness
+   and scope.
+
+4. Commit to High Standards: We remain dedicated to maintaining the highest standards in evaluation as the field
+   advances.
+
+## How to Participate
+
+### Providing Feedback
+
+If you have feedback on the paper, please create an issue on GitHub with the following information:
+
+1. A clear and descriptive title.
+
+2. A detailed description of your feedback.
+
+3. Specific sections or lines of the paper that your feedback pertains to.
+
+4. Any relevant suggestions or improvements.
+
+### Suggesting Enhancements
+
+We welcome suggestions for new sections, topics, or improvements to the paper. To suggest an enhancement, please create
+an issue on GitHub with the following information:
+
+1. A clear and descriptive title.
+
+2. A detailed description of the proposed enhancement.
+
+3. Any relevant examples, references, or mockups.
+
+### Writing and Editing
+
+We appreciate contributions to the writing and editing of the paper. If you would like to contribute, please follow
+these steps:
+
+1. Fork the repository.
+
+2. Create a new branch (git checkout -b feature/your-feature-name).
+
+3. Make your changes to the markdown file [open-LLM-security-benchmark](./docs/open-LLM-security-benchmark.md).
+
+4. Commit your changes (git commit -m 'Add new section on topic').
+
+5. Push to the branch (git push origin feature/your-feature-name).
+
+6. Open a pull request on GitHub.
+
+Please ensure your pull request includes:
+
+1. A clear and descriptive title.
+
+2. A detailed description of the changes.
+
+3. Any relevant issue numbers (e.g., "Addresses feedback from #123"). 
+
+ 
+
+ 
+
+ 
+
+ 
+
+