Kept context-switching between arxiv, OpenReview, GitHub, and HuggingFace for every paper, so I built this. Chrome extension + website with everything inline, plus citation graph + SPECTER2 neighbors. 3M papers, free, feedback welcome [P]
Our take
![Kept context-switching between arxiv, OpenReview, GitHub, and HuggingFace for every paper, so I built this. Chrome extension + website with everything inline, plus citation graph + SPECTER2 neighbors. 3M papers, free, feedback welcome [P]](https://preview.redd.it/4apvcyd00w3h1.png?width=140&height=71&auto=webp&s=8123adc49485c56f1d2077e98aec74f2e306b23f)
In an era where the pace of research and development in machine learning is accelerating, the challenge of navigating vast amounts of information has become increasingly daunting. The creation of Tomesphere, a Chrome extension and accompanying website that aggregates data from arXiv, OpenReview, GitHub, and HuggingFace, represents a significant leap forward in simplifying this process. With a repository indexing over 3 million papers, this tool is designed to ease the burdens of context-switching that researchers frequently endure when searching for relevant information. By offering a one-stop shop for accessing key findings, reviews, and related resources, Tomesphere not only enhances the efficiency of research but also empowers users to make informed decisions quickly.
The need for such a resource is underscored by the growing complexity of the machine learning landscape. For instance, as highlighted in articles like Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems and Wall-OSS-0.5: 4B VLA with open training code and zero-shot real-robot evaluation, researchers must grapple with intricate datasets and evolving methodologies. Tomesphere addresses this by providing an intuitive interface that curates essential information, such as LLM-curated TLDRs and a citation graph, which builds a richer context around each paper. This not only facilitates deeper understanding but also encourages collaboration, as researchers can easily find related work and models that can inform their projects.
Moreover, the inclusion of a semantic neighbor graph based on SPECTER2 adds another layer of depth, allowing users to discover connections between papers they may not have considered. This aspect is particularly relevant in a field where interdisciplinary approaches can lead to groundbreaking innovations. As research becomes more interconnected, the ability to visualize and navigate these relationships will be crucial for fostering new ideas and methodologies. Tools like Tomesphere embody a progressive vision for how technology can enhance academic productivity and knowledge sharing, making complex research landscapes more navigable and inviting.
However, it's important to acknowledge the caveats associated with this platform. Reviewer scores are limited to venues that publish openly on OpenReview, and blind-review venues remain out of scope until contributors fill in the gaps. This highlights an ongoing challenge within the research community: the need for greater transparency and collaboration around peer reviews. As researchers engage with tools like Tomesphere, providing feedback on missing elements and underrepresented papers will be vital for its evolution. As expressed in the article, user input is essential for refining the platform to better serve the community's needs.
Looking ahead, the implications of tools like Tomesphere extend beyond mere convenience; they signal a shift toward a more integrated and user-centric approach to research. As technology continues to evolve, we can expect further innovations that will further bridge the gaps in information access and usability. The question remains: how will this influence the way research is conducted and disseminated in the future? As we embrace these advancements, the focus should remain on empowering researchers to navigate the complexities of their fields with confidence and clarity.
| Spent the last few months building a deeper context layer over arxiv. Each paper gets a Tomesphere page with a TLDR + key findings (LLM-curated), OpenReview reviews where the venue is public, linked GitHub repos, HuggingFace models, conference videos, the citation graph in both directions, and a SPECTER2-based semantic neighbor graph. Same panel renders inline on arxiv via a Chrome extension (MV3 side panel API), or you can browse directly at tomesphere.com. 3M arxiv papers indexed. Caveats: reviewer scores only cover venues that publish openly on OpenReview (NeurIPS, ICLR, ICML, TMLR, COLM). Blind-review venues like CVPR, AAAI, ECCV are out of scope until contributors fill them in. GitHub, Hugging Face, and conference video matches are best-effort. Free, no signup. Site: tomesphere.com Chrome: chromewebstore.google.com/detail/tomesphere/nopoigoclhjcopjppnehidnkljmabllk Would love feedback, especially: which paper did you check first, and what's missing that you'd actually use? [link] [comments] |
Read on the original site
Open the publisher's page for the full experience
Related Articles
- Tomesphere, 3M paper pages with TLDRs, peer reviews, code, and a SPECTER2 similarity graph [P]Built a richer paper page for 3 million arxiv and OpenAlex papers. Free, no signup, no paywall. tomesphere.com Each page has a Gemini generated TLDR, peer reviews scraped from OpenReview with reviewer scores and decisions, GitHub repos, HuggingFace models and datasets, conference videos, the citation graph from OpenAlex (about 250M edges), and a semantic graph using SPECTER2 (768D in pgvector) with four ranking modes: Influential, Recent, Hidden gems, Nearest. Connected Papers and Litmaps default to citation overlap. Tomesphere defaults to text vector similarity, so brand new papers without a citation graph still appear and topically similar work shows up even without shared citers. Chrome extension overlays the same data on arxiv abstract and pdf pages. Try a paper you know: tomesphere.com/paper/2312.00752 (Mamba) tomesphere.com/paper/1706.03762 (Attention) tomesphere.com/paper/2305.14314 (QLoRA) Open to feedback. submitted by /u/RegretAgreeable4859 [link] [comments]
- Reviving PapersWithCode (by Hugging Face) [P]Hi, Niels here from the open-source team at Hugging Face. Like many others, I was a huge fan of paperswithcode. Sadly, that website is no longer maintained after its acquisition by Meta. Hence, I've been working on reviving it. I obviously use AI agents to parse papers at scale and automatically generate leaderboards (for now I'm the one verifying results). So far, I've only parsed high-impact papers for which I know they're SOTA, like Qwen 3.5 and 3.6, RF-DETR for object detection, DINOv3, SOTA embedding models from the MTEB leaderboard, the Open ASR Leaderboard for automatic speech recognition models, etc. For now, it includes the following: trending papers by default based on Github star velocity categorization by domain, e.g., OCR methods, which PwC used to have, e.g., RLVR eval results for high-impact papers, see e.g., Qwen 3.5 at the bottom leaderboards for each domain, e.g., MMTEB or COCO val 2017 support for citation counts (you can also see the most cited papers by domain!) automated linked Github, project page URLs, and artifacts (+ multiple repos are supported on a paper page) support for external papers beyond Arxiv, see e.g., DeepSeek v4 Harness reports for coding agent benchmarks, e.g., Terminal Bench "Sign in with HF" and Storage Buckets are used to store humbnails, paper PDFs, and overall data backups. I'm curious about your feedback + feature requests! Try it at paperswithcode.co https://preview.redd.it/whwji560fw1h1.png?width=3452&format=png&auto=webp&s=55bb7a30c1be58d140f7efcb07a31c6dac5693c7 See e.g. the SOTA leaderboard for Terminal Bench 2.0: https://preview.redd.it/98w9pi89fw1h1.png?width=3456&format=png&auto=webp&s=408fb64b0ba85ba24f55daa81d547d7c68e73951 A paper page looks like this: https://paperswithcode.co/paper/2602.15763 https://preview.redd.it/fiizit6dfw1h1.png?width=3450&format=png&auto=webp&s=9ea05a77ca5583a2fb395dccc95ba52c433362c5 submitted by /u/NielsRogge [link] [comments]