Core: Move Hadoop conf serialization into SerializableConfiguration#15583
Core: Move Hadoop conf serialization into SerializableConfiguration#15583nastra merged 3 commits intoapache:mainfrom
Conversation
ccdace1 to
a2cd680
Compare
7a6914b to
d9036b9
Compare
| this(new SerializableConfiguration(hadoopConf)); | ||
| } | ||
|
|
||
| public HadoopFileIO(SerializableSupplier<Configuration> hadoopConf) { |
There was a problem hiding this comment.
Should we deprecate this constructor and the serializeConfWith method? That method will double-wrap the configuration if it is called. I like that this should now be Kryo-compatible as well.
There was a problem hiding this comment.
it looks like at least the method is being used in
There was a problem hiding this comment.
I think there is a question of whether that is important enough to keep this method. As far as I can tell it's just for an optimization of not serializing the Hadoop Conf?
It's only opt in (CONFIG_SERIALIZATION_DISABLED) is false by default and I'm dubious it's even that beneficial now. Since we are going to be serializing this all over the place in non-hadoop situations right? I would recommend we just deprecate it then remove it.
There was a problem hiding this comment.
ok I've deprecated both
core/src/main/java/org/apache/iceberg/hadoop/SerializableConfiguration.java
Outdated
Show resolved
Hide resolved
d9036b9 to
df2dc69
Compare
|
Two other places that may need updating? SerializationUtil.serializeToBytes line 46: ResolvingFileIO.setConf line 157: |
df2dc69 to
e34f70e
Compare
|
thanks everyone for the reviews, I'll go ahead and merge this |
This moves the Hadoop-specific config serialization out of
SerializableTableinto Hadoop'sSerializableConfiguration