Research

Below are some random placeholders generated by AI. I wish I could replace them with real stuff one day.

Large-Scale Scholarly Data Integration

Scientific information lives in many silos—Semantic Scholar, OpenAlex, PubMed, Dimensions, Crossref, publisher feeds. I work on large-scale entity matching (DOIs, PMIDs, Corpus IDs), record linkage, and harmonized metadata layers that enable robust comparative analytics.

Citation Quality & Knowledge Flow

When is a citation meaningful? I'm developing supervised + heuristic models to classify citation intent/worthiness and to identify key methods or data resources that drive field-level change.


Selected Publications (placeholder)

More coming soon.