Finding papers at scale

Issue 311 | May 16, 2025
12 min read
Capsid and Tail

Among the millions of papers published each year, how can you make sure you never touch another irrelevant paper?

What’s New

W. Borges (Yale University) and colleagues published a new abstract on personalized phage therapy for cystic fibrosis, showing an LPS-5 and TIVP-H6 phage cocktail decreased P. aeruginosa biofilm formation and reduced IL-6 secretion for 15 days after treatment. They saw deleterious effects on biofilm formation did not persist at day 27, suggesting a clinical window for phage retreatment.

Research paperCystic fibrosisPhage therapy

Monyque Karoline de Paula Silva (Brazilian Center for Research in Energy and Materials) and colleagues published a new paper on deep learning in phage discovery, showing how AI enhances phage research through improved vMAG reconstruction, host-interaction prediction, and metagenomic analysis of viral diversity.

Research paperMachine learning

Robert Brzozowski, Amelia Schmidt (University of Montana) and colleagues published a new paper on prophage-encoded sRNA limiting lytic phage infection, showing a lambdoid prophage in adherent-invasive E. coli encodes an sRNA that downregulates maltodextrin transport genes, reducing phage adsorption and protecting against lytic infection in vitro and in vivo.

Research paperProphagesPhage-host interactions

Jyot Antani (Yale University) and colleagues published a new paper on evolutionary responses of E. coli to flagellotropic phage, showing bacteria evolved resistance through flagellar mutations while retaining motility, with some strains exhibiting trade-offs and others trade-ups in swimming ability.

PreprintFlagellaEvolution

Xiaolin Hou (Chinese Academy of Sciences) and colleagues published a new paper on advances in engineered phages for disease treatment, showing recent progress in enhancing phage targeting, stability, and synergistic combinations, across antibacterial therapy, tumor therapy, and vaccine development.

ReviewEngineered phagesCancerVaccines

Latest Jobs

Research Tech: Phage Genetics in Drosophila at Penn State, The Bordenstein Lab in State College, PA

The Bordenstein Laboratory in Penn State University’s One Health Microbiome Center seeks two Research Technologist - Life Sciences (Advanced Professional) to design, implement, support, and analyze meaningful research in host-microbe-phage symbioses spanning insect endosymbionts (Wolbachia), bacteriophages (phage WO), and functional mechanisms. This position will engage in Drosophila rearing, transgenics, bacteria cultivation, fitness assays, reproductive biology, microscopy, & team management.

Community Board

Anyone can post a message to the phage community — and it could be anything from collaboration requests, post-doc searches, sequencing help — just ask!

You’re invited to submit an abstract to the inaugural Conference on Bacteriophages: Biology, Dynamics, and Therapeutics, chaired by Graham Hatfull (University of Pittsburgh) and Robert (Chip) Schooley (UCSD).

Topics range from phage structure and assembly, evolution, and engineering to clinical trials, susceptibly testing, and host/immune responses.

This is organized by The International Antiviral Society–USA (IAS–USA), and will be held October 12-14, 2025, in Washington, DC.

Registration is open as of April 16, 2025! Check out the preliminary program here.

Submit your abstract by May 14!

Presenting authors who are new investigators may qualify for a scholarship to cover the cost of registration.

ConferencePhage biologyPhage therapy

Finding papers at scale

Profile Image
Product designer and co-founder of Phage Directory
Co-founderProduct Designer
Twitter @yawnxyz
Skills

Bioinformatics, Data Science, UX Design, Full-stack Engineering

I am a co-founder of Phage Directory, and have a Master of Human-Computer Interaction degree from Carnegie Mellon University and a computer science and psychology background from UMBC.

For Phage Directory, I design and build tools, and help write and organize Capsid & Tail.

I’ve previously worked at the Westmead Institute, for the Iredell lab at Phage Australia. There, I helped connect bioinformatics outputs and databases like REDCap, Google Drive, and S3-compatible storage systems.

Currently, I’m building and designing AI-centric tools for biology, including experimenting with protein models, biobank databases, AI-supported schema and data parsing, and bioinformatics workflows. Hit me up at [email protected] if you’re curious to collaborate!

I’ve never been particularly fast at reading papers. Usually, just skimming a research paper takes me about an hour. Reading a paper — by trying to understand it “completely” — can take me at least 4-5 hours. And it leaves me exhausted at the end.

Most papers I’ve read cite at least 10+ papers. ~5M papers are published per year. If someone were to “read everything that’s ever written” about a topic, they’d never do anything else. And the worst part is, spending all this time reading a paper, then finding it’s irrelevant.

There are tons of great papers out there, but to cut to the chase, we need to focus only on those that are relevant and worth our time. While Pubmed and Scholar are still invaluable, here are a few new search tools that can help you figure out which rabbit holes are worth going down, and which you can skip.

image.png

I’m at a GSK talk while writing this, and to my surprise, they’re explaining how they use AI to speed up their paper discovery and hypothesis loop. Ideally, their flow is: Exploration > Literature synthesis > Hypothesis > Update knowledge > More Exploration. With these new tools, they can scale up this process to hundreds of thousands of iterations. They’ll then review the final stack for the most interesting ideas.

While the following tools aren’t at GSK’s level, they can still help your lab approximate this flow. Below are some of the tools that Jess and I like to use for research and reading tasks. (We’re not affiliated with or get paid by any of them — we’re just fans!)

Screenshot 2025-05-15 at 9.55.33 PM.png

Elicit.com is designed for researchers, and generates a really nice table of all papers relevant to the search term. Jess and I have been Elicit fans for a very long time!

Science-specific search tools

We’ve all used Google, Google Scholar, Pubmed, and Mendeley for regular research work, but here is a selection of new search systems specifically geared for scientific research. Think of these as chimeras of Google Search, Mendeley and other science platforms. These platforms are meant for performing an array of tasks like lit reviews, data extraction, and full research tasks.

Elicit - http://elicit.com - I’ve spent quite a bit of time with Elicit, and this is my favorite tool for digging deep into one specific topic (e.g. phages and optogenetics). It searches a graph representation of related papers, and can summarize an entire thread or domain fairly quickly, surfacing exactly the papers you need to know.

SciSpace - free + paid tiers - https://scispace.com - Similar to Elicit, SciSpace is a system for searching for papers along queries and topics. It also doubles as a powerful reference manager, and has AI features like summarization. I commonly will check results between Elicit and SciSpace.

Consensus.app - free + paid tiers - https://consensus.app - Consensus is a clever take on searching science. Designed more for laypeople, it takes statements like “does phage therapy work” shows search results both for and against the question, and figures out where the consensus lies. This one is excellent for writing more accessible articles for science communication and similar tasks.

FutureHouse Platform - free - http://platform.futurehouse.org - FutureHouse is a nonprofit for building science automation tools. Their new Platform is a framework for comprehensive lit search, critical analyses, and hypothesis validation. Eventually, they’re aiming to build a full “automated scientist” where they can pair up both data and lab in a closed loop. Learn more about their work here.

Screenshot 2025-05-15 at 10.02.00 PM.png

The Future House Platform has various nuanced search systems for researchers. https://platform.futurehouse.org

General search tools

There’s also quite a few AI-native search platforms that have been released in the last few months. These systems summarize, contextualize, and extract data from hundreds of search results. Below is a short list of the search tools that I’ve tried.

Kagi Search - free + paid tiers - https://kagi.com - Kagi is not an AI search tool. It’s an old-school query-based search like Google, but unlike Google it doesn’t have ads. Since they’re not incentivized to show sponsored results, the results are generally much better than what Google shows. They do have their own version of Deep Research though, but it’s still in beta.

Google Gemini Deep Research - free + paid tiers - gemini.google.com - Invented the concept of Deep Research. Runs dozens to hundreds of Google searches, finds relevant papers and links, and cites the sources as it answers the user’s questions. This is very good for terms that are otherwise hard to find on Google, but can take several minutes. You can also look up papers related to a given paper, or do a quick check if anyone has ever thought of a specific hypothesis. This alone will save you many hours a week.

OpenAI Deep Research - free + paid tiers - openai.com - Very similar to Google’s Deep Research, and works somewhat similarly. I think they use Bing Search instead, but otherwise the results are similar.

Perplexity - free + paid tiers - https://perplexity.ai -  This is a completely new search service, and can sometimes have better integrations with Pubmed and other paper databases. They have standard search and deep research, as well as a feature that creates “wikipedia”-like pages of results. I find their search is faster, and seeing how it searches for various papers, “thinks” about the results, then continues to search, is really neat. I normally use Perplexity as an alternative to Google.

Anthropic Claude Search - paid - https://claude.ai Claude is an underdog in the AI field, and excels at writing prose and code. We use Claude often for paper summarization. Claude Search is a new feature, and while it doesn’t give as detailed reports as Gemini or OpenAI, it does create better writing outputs.

Screenshot 2025-05-15 at 10.05.11 PM.png

NotebookLM offers a way to chat with papers, plus you can use it to create engaging podcasts! notebooklm.google.com

Audio research tools

Listening to a paper quickly helps me get the gist of a paper, and helps me find all the parts I should pay attention to, before sitting down and diving deep into it. I highly recommend listening to any paper you’re seriously considering fully understanding.

NotebookLM - free + paid tiers - https://notebooklm.google.com - Creates a podcast from (and lets you chat with) any paper, PDF, or report. The ~10 minute podcast is great for quickly grasping papers from areas I’m not too familiar with (e.g. machine learning)

Listening - $13/mo - https://listening.com - Creates a full narrated version of any paper. Jessica uses (and pays for) this one. The audio isn’t perfect, it doesn’t handle images or tables, and sometimes it breaks in funny ways. But it’s still great for long commutes.

groqlabs  Deep Research.png

Groq’s experimental Deep Research, Fast, which creates a research report in ~10-15 seconds https://deep-research-fast.vercel.app

Programmable Search Tools

For researchers and bioinformaticians who write code, there are a few other notable tools that can mostly only be accessed programmatically.

Exa - Free + paid tiers - https://exa.ai - Can run multiple, parallel searches, and can be useful for those building a research summarization tool. The Websets feature does research in the background and can dig up thousands of relevant papers and other sources on a topic.

Jina Reader - Free + paid tiers - https://jina.ai/reader - Is very good at reading and taking screenshots of websites, but can struggle with research sites. While it can’t access paid journals like Nature, it can access most open source journals.

Groq Compound - Free + paid tiers - https://console.groq.com/docs/agentic-tooling/compound-beta - lets you build advanced web search and code execution through a single API call. This makes it easy to build your own custom versions of Deep Research and AI Scientists, customized to your own needs and your own data. (Side note: this is where I work now!)

Bonus tool: a super fast version of Deep Research, powered by Groq: https://deep-research-fast.vercel.app

One last note

That’s a lot of tools for search!

Whichever option is better, is completely subjective! Since they all have a free tier, there’s no excuse not to try a new one every week, and then compare for yourself. Given how large the potential benefit is of trying one of these tools, it’s really worth the 30 minutes to try them.

These tools are free, they can shave off countless hours of research, and they are free to start, so there isn’t really any excuse to not try them.

Happy reading!

~ Jan

Capsid & Tail

Follow Capsid & Tail, the periodical that reports the latest news from the phage therapy and research community.

We send Phage Alerts to the community when doctors require phages to treat their patient’s infections. If you need phages, please email us.

Sign up for Phage Alerts

In collaboration with

Mary Ann Liebert PHAGE

Supported by

Leona M. and Harry B. Helmsley Charitable Trust

Crossref Member Badge