For the documents in each company's folder, the urls from which these documents have been scraped/copied should also be provided. This can be done in a separate file at root level e.g. urls.csv or within each folder. The urls are good practice so we can ensure that the policies are up to date, are correct, etc. and also as a way to assess which urls are represented in this repo (and which other urls/documents are not). Thank you for the work done so far.