Viewing a single comment thread. View all comments

PassionatePossum t1_j429c7y wrote

Admittedly, I just skimmed the paper. But I found it weird that DVC wasn’t mentioned at all. Maybe I missed something but it seems to address the same or at least a similar use case.

I think it would deserve at least a mention in the related work section and a discussion what is different or better in XetHub. Maybe even a performance comparison between the two would be interesting.

35

theDaninDanger t1_j42vf8i wrote

They mention it a few times, but kind of hand-wave it away:

> Solutions such as Git LFS [9] and DVC [10] provide a

light-weight facade for adding large files to Git repositories but do

not provide sufficient integration to support the needs of industry

ML datasets as described in Sec. 2.

I'm not sure what they mean by 'sufficient integration', but whatever the insufficiencies, why not address those? Considering all the authors work at XetHub, I'm pretty sure this is an advertisement disguised as a research paper.

27

seba07 t1_j46wuue wrote

Isn't that the standard way research papers are written: you only compare your solution to methods that are worse that yours? ;)

2