Skip to content
View akhilpandey95's full-sized avatar
🐒
finding pebbles
🐒
finding pebbles

Organizations

@MozillaIndia @webcompat @fossasia @pClub-gu @resci-org @NIU-DATA-Lab @ReframeData

Block or report akhilpandey95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
akhilpandey95/README.md

About Me

I am an Applied Research Scientist at AllSci Corp, where I build AI co-scientists for long-horizon scientific exploration in the life sciences. My work focuses on developing scientific reasoning systems that can decompose papers into hypotheses, research questions, and results, and combine retrieval, post-training, reinforcement learning, and evidence synthesis to support discovery.

At AllSci, I've been working on building agentic research systems for therapeutic landscape analysis, molecule due diligence, and biomedical evidence synthesis across scientific literature, drug assets, clinical trials, and hypothesis databases. This has included building workflows that analyze target biology and mechanism-of-action questions, compare safety and efficacy signals across drugs, surface sponsor/phase/status patterns in clinical-trial data, and support hypothesis discovery. I have also been developing pipelines that break biomedical papers into hypotheses, research questions, and results, along with evaluation and citation-validation systems that make scientific outputs more grounded, benchmarked, and reliable. This included building agentic deep-research systems, retrieval and evaluation pipelines over papers, trials, and hypothesis databases, and infrastructure for making scientific outputs more grounded, benchmarked, and reliable.

Broadly, my research interests lie at the intersection of Large Language Models, Representational Learning, Science of Science, Information Theory, and Uncertainty Quantification. Previously, I was a Research Scientist at the Center for Science of Science and Innovation, working with Dashun Wang at the Kellogg School of Management, Northwestern University. My doctoral study was advised by Dr. Hamed Alhoori, and my dissertation was fully funded by an NSF grant. It focused on building a predictive modeling framework to investigate the Reproducibility crisis in AI. During my Ph.D, I was a Givens Research Associate and an Argonne Leadership Computing Facility Graduate Student researcher. I have also given a workshop on using AutoML for Graph Neural Architecture Search at the ALCF DeepHyper Automated Machine Learning Workshop.

  • πŸ”­ I’m currently working on post-training large language models, implementing scientific LLM agents.
  • πŸ‘― I’m looking to collaborate on Ways to improve agentic PeerReview for the larger scholarly ecosystem.
  • 🌱 When I’m bored I explore Utility of LLM's + Graphs.
  • πŸ’¬ Ask me about Large Language models, Representational Learning, Information theory, Graphs, Uncertainty Quantification, Neural Architecture Search.
  • πŸ“« How to reach me: @akhilpandey95
  • πŸ˜„ Pronouns: (he/him)
  • ⚑ Fun fact: Haskell has type inference, meaning it can automatically determine the type of a data by looking at how it is created.

Active projects:

Pre-review to Peer review | Pitfalls of Automating Reviews using Large Language Models


ReSci-Agent | Agentic PeerReview

Pinned Loading

  1. LLMSciSci LLMSciSci Public

    Experiments, and how-to guide for the lecture "Large language models for Scientometrics"

    Jupyter Notebook 1

  2. Northwestern-CSSI/sciscinet Northwestern-CSSI/sciscinet Public

    πŸ“’πŸš¨πŸ“£ Sciscinet-v2 is a refreshed update to SciSciNet which is a large-scale, integrated dataset designed to support research in the science of science domain.

    HTML 7

  3. Northwestern-CSSI/Sci2Pol-Bench Northwestern-CSSI/Sci2Pol-Bench Public

    Data, scripts, and recipes for the benchmark Sci2Pol-Bench, a comprehensive benchmark for evaluating large language models.

    1

  4. reproducibilityproject/NLRR reproducibilityproject/NLRR Public

    Code, data, and supplemental information for the paper "Navigating the landscape of Reproducible Research: A predictive modeling study

    Python 2

  5. uncertainty uncertainty Public

    Work as part of ANL summer 2020 research on uncertainity quanitification methods in graph neural networks

    Python 1

  6. gnnNAS gnnNAS Public

    Work as part of ANL summer 2022 research with emphasis on utilizing symbolic programming to perform NAS on graph neural networks

    Python 2