Submitted by rajatarya t3_10a4mns in MachineLearning
PassionatePossum t1_j429c7y wrote
Admittedly, I just skimmed the paper. But I found it weird that DVC wasn’t mentioned at all. Maybe I missed something but it seems to address the same or at least a similar use case.
I think it would deserve at least a mention in the related work section and a discussion what is different or better in XetHub. Maybe even a performance comparison between the two would be interesting.
theDaninDanger t1_j42vf8i wrote
They mention it a few times, but kind of hand-wave it away:
> Solutions such as Git LFS [9] and DVC [10] provide a
light-weight facade for adding large files to Git repositories but do
not provide sufficient integration to support the needs of industry
ML datasets as described in Sec. 2.
I'm not sure what they mean by 'sufficient integration', but whatever the insufficiencies, why not address those? Considering all the authors work at XetHub, I'm pretty sure this is an advertisement disguised as a research paper.
seba07 t1_j46wuue wrote
Isn't that the standard way research papers are written: you only compare your solution to methods that are worse that yours? ;)
MUSEy69 t1_j43g8x4 wrote
not in the paper, but I found a table on their site: https://xetdata.com/why-xethub/
Viewing a single comment thread. View all comments