Scrap profile data and build a company employee dataset from LinkedIn using Google Dork and Piloter APIs.
This project demonstrates how to use the Piloterr API to:
- Perform advanced Google searches using dorks
- Extract LinkedIn profile links from search results
- Retrieve detailed profile information
- Build and export a clean JSON dataset
- Python 3.7+
- requestsmodule
- A valid x-api-keyfrom Piloterr
Install dependencies:
pip install requestsproject/
├── main.py                           # Main scraping script (loop + export)
├── credential.example.py             # Example credentials file
├── credential.py                     # Your actual API key (not tracked)
├── output/
│   └── linkedin_profile_dataset.json        # Full dataset
│   └── linkedin_profile_dataset_test.json   # One profile (from test run)
└── README.md
Copy the example credentials file:
cp credential.example.py credential.pyEdit credential.py and paste your API key:
x_api_key = "paste your api key here"We have "Apple In." as default company in the project.
Feel free to change your target by editing in main.py
COMPANY = "Apple Inc."  Google Search can return several results and pages. To control scraping depth and speed, adjust the following:
PAGE_RANGE = 2     # set the number of google page result Scrape
LIMIT = 20         # set maximum link to get in one pagepython main.pyThis will:
- Scrape search results for Apple Inc.profiles
- Fetch profile details
- Export them into output/linkedin_profile_dataset.json
from main import test
test()- Tweak the Dork query to target different company or specify roles to search for.
This project is for educational purposes. Always check the legality and terms of service before scraping any website or using third-party APIs.