Skip to content

[WIP] Download and organize Open Forest data #45

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

bw4sz
Copy link
Collaborator

@bw4sz bw4sz commented Jun 21, 2025

The OpenForestObservatory.py script was created to download Open Forest Observatory (OFO) data, match tree geographic coordinates to orthomosaic pixel locations, and generate MillionTrees-compatible annotations. Key functions include download_metadata, download_orthomosaic, and match_trees_to_orthomosaic.

Integration into the MillionTrees benchmark involved:

  • Adding the OFO annotations path (/orange/ewhite/DeepForest/OpenForestObservatory/annotations.csv) to the TreePoints dataset list in package_datasets.py.
  • Updating source_completeness.csv by renaming columns to source and complete, inverting boolean values, and adding "Open Forest Observatory" as a complete source.
  • Adding documentation for the OFO dataset to docs/datasets.md, including a description, citation, and a generated visualization image (docs/public/Open_Forest_Observatory.png).

Test outputs, including a mock orthomosaic and sample annotations, were generated to validate the data processing pipeline and annotation format. A data_prep/OFO_IMPLEMENTATION_README.md was also created to document the integration.

@bw4sz
Copy link
Collaborator Author

bw4sz commented Jun 30, 2025

From UC davis lab

Sounds good! It's very exciting to see this progress you've made with OFO data. I see your function for matching field trees to orthomosaics. One crucial note here: the field trees as we provide them are not spatially co-registered to the orthomosaics, so this code will require more logic. We have some decent (but not perfect) algorithms for this, and I know you've thought about it a lot too, so maybe this can be part of our discussion (or a separate one).
We've thought a bit about this. I agree ideally we would provide the co-registered plot data, but the challenge is that there is a 1:many mapping of field plots to drone missions, and the optimal shift differs for each. (It may also differ for different ways of processing each drone mission.) So we have been applying the shift more or less on the fly as needed for each specific analysis. We are close to having publicly available functions to make it easy for a data user to also perform the shift on the fly.

Let's wait and skip this one for 0.2 and discuss.

@bw4sz bw4sz changed the title Download and organize Open Forest data [WIP] Download and organize Open Forest data Jun 30, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants