sitmo
sitmo t1_j2oz219 wrote
Reply to [P] An old fashioned statistician is looking for other ways to analyse survival data - Is machine learning an option? by lattecoffeegirl
Whatever you do to make a model, I would created benchmarks datasets where you random shuffle the survival info between patients. Then, any model fitting and testing you do, also do it on these randomised datasets. This will give you good insights about the statistical significance of anything you’ll find in your data.
sitmo t1_iukbw82 wrote
Reply to [News] The Stack: 3 TB of permissively licensed source code - Hugging Face and ServiceNow Research Denis Kocetkov et al 2022 by Singularian2501
As an open-source code writer this feels like an abuse of my contributions, they are monetizing on my code, building a brand out of other people's content, and cash big time with a Stock IPO in the near future.
In order to take back control I decided to change my naive flower-power-every-body-happy MIT license projects to the more protective GPL3
sitmo t1_jbgzk1q wrote
Reply to [D] Text embedding model for financial documents by [deleted]
finBERT maybe?