[P] Sectional Scientific Summarization Dataset, soft release
I made a dataset of research paper section summaries; each datapoint contains a section of a research paper (intro, background, methods, results, discussion, conclusion, etc. ) and a corresponding summary.
The dataset contains ~4.3 million data points from ~11 million papers.
Unfortunately a lot of the files I host have gone down pretty often due to too many downloads, ever since my group was recently featured in a Siraj video (lol) . So we’re going to be doing a soft release for now, if you want a link please PM. We especially encourage those with previous experience with summarization and/or scientific texts to start playing around with it.