BibSLEIGH: Bibliography of Software Language Engineering in Generated Hypertext
This project started as a beautification frontend for DBLP, because DBLP-supplied BibTeX was/is suffering from a number of problems (to name a few: abbreviation of journal names, unnecessary timestamp, biburl, and bibsource tags, convoluted booktitles with long locations that contribute nothing, and eventually dysfunctional DOIs or URLs). It quickly became obvious that some of the problems are inherent to the BibTeX format itself, some are peculiar for DBLP (and thus prone to the changes in their internal policies), and some simply stem from my personal interest in some conferences and the lack thereof in others.
BibSLEIGH has accumulated 150+k papers, all possibly related to some topic that can potentially be considered a part of software language engineering, but often only marginally so (the actual “body of knowledge” of SLE is much smaller), but has not been properly updated in the past couple of years (since about 2020) due to the lack of time to be dedicated to this. It is still a great tool to find related work, though. BibSLEIGH sits firmly between websites like DBLP, GScholar, WoS, Scopus or by now defunct Microsoft Academic that simply dump millions of papers on you (and do it in a dynamic way, meaning eventual downtime and horrible performance on good days), and AI hallucinators like Scite_, Sourcely, Consensus or ConnectedPapers, which make a soulless effort in stitching things together but ultimately also don’t care.
I believe there is a lot of value in recovering and preserving bibliographical data from old and obscure venues, and there’s a lot of research possible on research communities and publications, and I hope to return to it at some point in the future.
https://github.com/slebok/bibsleigh — dataset/corpushttps://github.com/bibtex/bibsleigh — toolkit(s) that turn the dataset into the websitehttps://bibtex.github.io — functioning frontend