Skip to content

Latest commit

 

History

History
213 lines (153 loc) · 9.2 KB

contributing.md

File metadata and controls

213 lines (153 loc) · 9.2 KB

Contributing to BotD

Thanks for taking the time to contribute! Here you can find ways to make BotD better, as well as tips and guidelines.

This project and everyone participating in it is governed by the Code of Conduct. By participating, you are expected to uphold this code.

How you can contribute

Reporting an issue

If you've noticed a bug, have an idea or a question, feel free to create an issue or start a discussion.

Important

Bot detection, crawling, and scraping techniques are continuously evolving. Before filing a new issue or discussing detection techniques publicly, consider whether the topic is suitable for the general audience. Please discuss sensitive topics directly with [email protected] or on the Discord server with the repository maintainers.

Before you start, please search for your topic. There is a chance it has already been discussed.

When you create an issue, please provide all the information needed to reproduce your situation, it will help us solve your issue faster. If you want to share a piece of code or the library output with us, please wrap it in a ``` block and make sure you include all the information.

Creating a pull request

If you want to fix a bug, create a signal source, or make any other code contribution, please create a pull request.

After you clone the repository, check the Working with code section to learn how to run, check, and build the code.

In order for us to review and accept your code contributions, please follow these rules:

  • Your code quality should be at least as good as the code you modify.
  • Your code style (syntax, naming, coding patterns, etc) should follow the BotD style.
  • All the new code should be covered with automated tests.
  • All the checks described in the Working with code section must pass successfully. You may create a draft pull request in this repository to run the checks automatically by GitHub Actions, but the tests won't run on BrowserStack until a BotD maintainer approves them.
  • If you want to add a bot detector, follow the How to add a bot detector instructions carefully.
  • The changes should be backward compatible, ensuring BotD users continue to use the library without any modifications.
  • Don't add dependencies (such as Node packages) unless necessary.
  • Don't make changes unrelated to the stated purpose of your pull request. Please strive to introduce as few changes as possible.
  • Don't change BotD code style, its TypeScript configuration, or other subjective things.

If you want to do something more complex than fixing a small bug, or if you're not sure if your changes meet the project requirements, please start a discussion. We encourage starting a discussion if you want to propose violating a rule from this guide. Doing so ensures we discuss all opinions, creating a good contribution experience for everyone.

Helping with existing issues

If you want to help, but don't know where to start, take a look at the "help wanted" issues. You can help by sharing knowledge or creating a pull request. Feel free to ask questions in the issues if you need more details.

Working with code

This section describes how to deploy the repository locally, make changes to the code, and verify your work.

First, make sure you have Git, Node.js and Yarn installed. Then clone the repository and install the dependencies:

git clone https://github.com/fingerprintjs/BotD.git
cd BotD
yarn install

Development playground

Development playground lets you run BotD locally. Run this command to start a playground:

yarn dev:playground # Add '--port 8765' to change the server port

Then open http://localhost:3000 in a browser. BotD will execute immediately and print the result on the page. The page reloads every time you change the source code. The code of the playground itself is located in the playground directory.

Code style

Follow the repository's code style. The code style is controlled by ESLint and Prettier. Run to check that the code style is ok:

yarn lint

You aren't required to run the check manually, the CI will do it. Run to fix code style mistakes (not all mistakes can be fixed automatically):

yarn lint:fix

How to build

To build the distribution files of BotD that can be used in a browser directly, run:

yarn build

The files will be saved to the dist directory.

How to test

There are automated tests. They are run by Jasmine in real browsers using Karma. Unit test files are located right next to individual module files that they check. Integration tests are located in the tests directory.

To run the tests in a browser on your machine, build the project and run:

yarn test:local --browsers ChromeHeadless
# or to run in Firefox
yarn test:local --browsers FirefoxHeadless
# or to run in both
yarn test:local

To run the tests in browsers on BrowserStack, get a BrowserStack access key and run:

# For Linux, macOS and WSL (Linux on Windows)
BROWSERSTACK_USERNAME=your-username BROWSERSTACK_ACCESS_KEY=your-key yarn test:browserstack

If you face Error: spawn Unknown system error -86 on macOS, try installing Rosetta:

softwareupdate --install-rosetta

Alternatively, make a PR to this repository, the test will run on BrowserStack automatically. But the test won't run when the PR is made from a fork repository, in this case, a member will run the tests manually.

BrowserStack sessions are unstable, so a session can fail for no reason; restart the testing when you see no clear errors related to the tests. If you run the test command multiple times in parallel, BrowserStack will lose access to the Karma server (for some reason), which will cause the tests to hang infinitely, so try to run a single test command at once.

To check the distribution TypeScript declarations, build the project and run:

yarn check:dts

To check that the package is compatible with server-side rendering, build the project and run:

yarn check:ssr

How to add a bot detector

A bot detection algorithm consists of 2 stages: collection and detection.

The collection stage is implemented by signal sources, functions that extract pieces of data from the browsers. The function are located in the src/sources directory and listed in src/sources/index.ts.

The detection stage is implemented by detectors, functions that make conclusions based on the data obtained by the collectors. The function are located in the src/detectors directory and listed in src/detectors/index.ts. A detector output is either the bot kind name, or false if the visitor is not a bot, or true if the visitor is an unknown bot.

Avoid using navigator.userAgent or similar APIs, as they are frequently spoofed. Instead, for code that makes decisions based on browser name and version, use the functions available in src/utils/browser.ts. However, in tests, for greater reliability, use navigator.userAgent through the functions in tests/utils.ts.

If a signal source meets an unexpected condition that doesn't allow it to return a proper signal, the signal should throw a BotdError object with the corresponding state and message.

The signal sources must handle expected and only expected errors. The expected errors and unsupported conditions must be turned into special signals and returned, or into BotdError objects and thrown. Pay attention to potential asynchronous errors. If you handle unexpected errors, you won't know what's going wrong inside the signal source. Example:

async function signalSource() {
  try {
    // `await` is necessary to catch asynchronous errors
    return await doLongAction()
  } catch (error) {
    // WRONG:
    return 'error'

    // Correct:
    if (error.message = 'Foo bar') {
      return 'bot'
    }
    if (/boo/.test(error.message)) {
      throw new BotdError(State.UnexpectedBehaviour, 'Boo message')
    }
    throw error // Unexpected error
  }
}

Every signal source needs to be covered with unit tests. These tests are meant to verify that the signal source returns expected values across all supported browsers. In the event of significant changes or deprecation of the underlying APIs, these tests should start to fail in future browser versions.

For inspiration see existing tests in src/sources/.

How to publish

See the publishing guide (for BotD maintainers only).