2 min readfrom Machine Learning

Reviving PapersWithCode (by Hugging Face) [P]

Our take

Reviving PapersWithCode, Niels from Hugging Face is bringing back a beloved resource for the AI community. After its acquisition by Meta, this platform has been revitalized to feature trending papers, categorized domains, and automated leaderboards for state-of-the-art models. Users can explore high-impact research, including leading work like Qwen 3.5 and RF-DETR. The platform also supports citation counts and links to related GitHub projects. For those interested in enhancing their data skills, check out our article, "40 Advanced SQL Window Functions Every Data Scientist Must Know.
Reviving PapersWithCode (by Hugging Face) [P]

The revival of PapersWithCode by Hugging Face marks a significant moment for researchers and practitioners in the AI and machine learning community. Following the discontinuation of the original site after its acquisition by Meta, many users were left searching for a reliable platform to track the latest advancements in machine learning research and their corresponding implementations. Niels, from the Hugging Face open-source team, is taking proactive steps to address this gap, using AI agents to parse high-impact papers and generate leaderboards. This initiative is not just about maintaining a database; it reflects a deeper understanding of the community's need for accessible, organized, and up-to-date information. Tools like 40 Advanced SQL Window Functions Every Data Scientist Must Know(with examples) and How to make excel subtract a cell based on the text of another cell? also exemplify the ongoing quest for knowledge and practical solutions in the data management landscape.

The features being implemented in the revived PapersWithCode site are particularly noteworthy. By incorporating trending papers based on GitHub star velocity and categorizing them by domain, users can easily navigate through the latest research tailored to their specific interests. The inclusion of evaluation results and leaderboards enhances the platform’s utility, allowing users to assess the state-of-the-art (SOTA) models effectively. This is essential for researchers who need to benchmark their work against established standards. Moreover, the support for citation counts and automated links to relevant GitHub repositories empowers users to not only find but also engage with the research actively. The ability to access external papers beyond Arxiv further broadens the scope of knowledge available, catering to a diverse audience, including those interested in automatic speech recognition and object detection, among other domains.

This revival is particularly significant in a landscape where the pace of AI development is accelerating, and staying informed is crucial for both academic and practical applications. PapersWithCode serves as a bridge between theoretical advancements and their practical implementations, ensuring that users are not just passive consumers of information but active participants in the evolving discourse of AI. This aligns with other discussions in the community, such as the challenges users face when dealing with external issues, as highlighted in articles like Issues with charts in all files. The focus on user feedback and feature requests also indicates a commitment to building a platform that evolves with its user base, which is essential in the rapidly changing tech landscape.

Looking ahead, the revival of PapersWithCode could set a new standard for how research in machine learning and AI is disseminated and utilized. As more researchers and developers engage with the platform, the potential for collaborative projects and shared insights could significantly enhance the community's overall productivity. It invites a question worth pondering: how will the next generation of AI tools continue to reshape our understanding of data management and collaboration? The response to this inquiry may very well define the future of research and development in this exciting field.

Reviving PapersWithCode (by Hugging Face) [P]

Hi,

Niels here from the open-source team at Hugging Face. Like many others, I was a huge fan of paperswithcode. Sadly, that website is no longer maintained after its acquisition by Meta.

Hence, I've been working on reviving it. I obviously use AI agents to parse papers at scale and automatically generate leaderboards (for now I'm the one verifying results). So far, I've only parsed high-impact papers for which I know they're SOTA, like Qwen 3.5 and 3.6, RF-DETR for object detection, DINOv3, SOTA embedding models from the MTEB leaderboard, the Open ASR Leaderboard for automatic speech recognition models, etc.

For now, it includes the following:

  • trending papers by default based on Github star velocity
  • categorization by domain, e.g., OCR
  • methods, which PwC used to have, e.g., RLVR
  • eval results for high-impact papers, see e.g., Qwen 3.5 at the bottom
  • leaderboards for each domain, e.g., MMTEB or COCO val 2017
  • support for citation counts (you can also see the most cited papers by domain!)
  • automated linked Github, project page URLs, and artifacts (+ multiple repos are supported on a paper page)
  • support for external papers beyond Arxiv, see e.g., DeepSeek v4
  • Harness reports for coding agent benchmarks, e.g., Terminal Bench
  • "Sign in with HF" and Storage Buckets are used to store humbnails, paper PDFs, and overall data backups.

I'm curious about your feedback + feature requests!

Try it at paperswithcode.co

https://preview.redd.it/whwji560fw1h1.png?width=3452&format=png&auto=webp&s=55bb7a30c1be58d140f7efcb07a31c6dac5693c7

See e.g. the SOTA leaderboard for Terminal Bench 2.0:

https://preview.redd.it/98w9pi89fw1h1.png?width=3456&format=png&auto=webp&s=408fb64b0ba85ba24f55daa81d547d7c68e73951

A paper page looks like this: https://paperswithcode.co/paper/2602.15763

https://preview.redd.it/fiizit6dfw1h1.png?width=3450&format=png&auto=webp&s=9ea05a77ca5583a2fb395dccc95ba52c433362c5

submitted by /u/NielsRogge
[link] [comments]

Read on the original site

Open the publisher's page for the full experience

View original article

Tagged with

#generative AI for data analysis#Excel alternatives for data analysis#natural language processing for spreadsheets#automated anomaly detection#rows.com#big data management in spreadsheets#conversational data analysis#cloud-based spreadsheet applications#real-time data collaboration#financial modeling with spreadsheets#intelligent data visualization#data visualization tools#enterprise data management#big data performance#data analysis tools#data cleaning solutions#PapersWithCode#Hugging Face#leaderboards#AI agents