Skip to content

Add support for reading files from S3 #73

@metasim

Description

@metasim

Support for this via the "External File" data source option should be pretty easy and extremely helpful for those of us using AWS EMR for deployment. I attempted to implement this myself, but couldn't figure out how to make sure the hadoop-aws library was included in the local-mode runtime:

https://github.com/s22s/seahorse/commit/909d37852975d88a205d4c3ee98f769e4a3430d9

So I couldn't test the implementation, at least not in local/development mode.

FWIW: The following schemes are supported by org.apache.hadoop:hadoop-aws:

Scheme Service Provider
s3 org.apache.hadoop.fs.s3.S3FileSystem
s3a org.apache.hadoop.fs.s3native.NativeS3FileSystem
s3n org.apache.hadoop.fs.s3a.S3AFileSystem

cc: @mteldridge @mobsy74

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions