Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I was under the impression for this to work like that, training data needs to be plenty. One project is not enough since it’s too "sparse".

But maybe this example was used by many other people and so it proliferated?





The repo[0] currently has been forked ~41300 times.

[0] https://github.com/wesbos/JavaScript30


It’s quite unlikely that training data will include duplicate repositories or even forks, that alone would surpass the published dataset sizes.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: