Skip to content

Conversation

@l0s
Copy link

@l0s l0s commented May 22, 2014

This provides a function that scans an HBase table given a filter string and returns a data frame. It also refactors the common logic shared between hb.scan.data.frame and hb.scan.ex.data.frame.

Discussion: https://groups.google.com/d/msg/rhadoop/Oty-A0UodTM/ngbLVAFRX0oJ .

l0s added 5 commits May 21, 2014 20:58
The row names of the resulting data frame are the keys of the HBase
table. The column names are those specified in colspec.  If a row does
not have a value for a column specified in colspec, then that cell will
be populated with NA.

Prep work to allow all columns in a column family.

Use cols instead of colspec when getting column values.

modularise helper methods
@piccolbo
Copy link
Collaborator

This has all the commits of #7 plus one. Isn't it possible to base this on that? I need to think a moment about what exactly that means in git. Initially I thought that filter scan was an independent feature. It seems like it should, right? Whether rows are returned as a data frame or list, applying a filter should make sense.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants