Viewing a single comment thread. View all comments

Oreoed t1_j8yfckw wrote

Depends on what you're trying to do with this project.
If you already have a news source, implementing pre-trained model from huggingface should be relatively easy.
If you want to fine-tune that model, you will need a dataset of news headlines.
Check out Kaggle, there are some small but publicly available datasets.
I know you can also find data on some obscure github repo, but good luck with that.
If your goal is to implement a fully operational pipeline, you will need not only all the above, but also a way to acquire news in real time. That may mean a scrapper of some news outlets that are of interest. Once again, github is your friend.
That said, don't expect a profit off this alone. Using news data alongside some trading indicators will *maybe* work on paper (ie. backtest) with the right features and optimization, but is unlikely to get live results.
Then again, for a college project that might not be relevant.

1

justundertheblack OP t1_j8yfuzw wrote

I've built my own scrapper so I have tons of data for the model Yeah I don't think I'll get it right off the bat but let's see where I go with it

1