Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how can i do reference if i have one picture and it's corresponding pose? #37

Open
fankeyu123 opened this issue Jun 20, 2024 · 2 comments
Assignees

Comments

@fankeyu123
Copy link

No description provided.

@donydchen donydchen self-assigned this Jun 21, 2024
@donydchen
Copy link
Owner

Hi, @fankeyu123, thanks for your interest in our work.

Our work mainly focuses on multi-view input, and two critical components of the encoder, i.e., multi-view transformer and cost volume, are both designed to extract cross-view information effectively. Therefore, it is not easy to use one single image as input directly.

Since you do have the corresponding pose, one potential way is to synthesize an additional view by warping the image to a nearby viewpoint (you may refer to the related code snippet here) and then using both the original image and the warped one as input. However, I have not tried any single-view input before, and I am unsure whether it will work.

Alternatively, you might also want to try on those single-view-based NVS models, such as Free3D.

@fankeyu123
Copy link
Author

fankeyu123 commented Jun 21, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants