can monolith load data from hdfs? #10

colinlzh · 2023-10-11T08:27:34Z

we are deploying monolith on our environment.
we manage our data by pyspark. So usually we have a pyspark dataframe as data input.
In demos, monolith can load data from tdfs or kafka.
I was wondering that can monolith surpport loading data from pyspark dataframe or hdfs dir?
Or we have to dump files from pyspark to local memory to let monolith load it?

hanzhi713 · 2023-10-23T07:47:07Z

tf.io.gfile can read hdfs. You can either convert your files to tfrecord format or write a custom tensorflow dataset kernel to read data in your format.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

can monolith load data from hdfs? #10

can monolith load data from hdfs? #10

colinlzh commented Oct 11, 2023

hanzhi713 commented Oct 23, 2023

can monolith load data from hdfs? #10

can monolith load data from hdfs? #10

Comments

colinlzh commented Oct 11, 2023

hanzhi713 commented Oct 23, 2023