PapersWithCode new features - week 1 [P]
Our take
![PapersWithCode new features - week 1 [P]](https://preview.redd.it/owlxn0b5u23h1.png?width=140&height=81&auto=webp&s=8407d85127cf40dd15ae91479ccd6b594dca1e37)
The recent updates from Niels and the Hugging Face team regarding the revival of PapersWithCode are not just incremental changes; they signal a significant step forward in how researchers and developers engage with the rapidly evolving field of AI. By reintroducing this platform, which allows users to track the state-of-the-art across various domains, Hugging Face is positioning itself at the forefront of AI innovation. As we see developments like those detailed in the recent announcements, such as the ability to support multiple metrics for benchmarks and external paper submissions, it becomes clear that the landscape of AI research is becoming more interconnected and user-friendly. This aligns with broader trends in the industry, where accessibility and collaboration are essential for driving progress. For instance, the importance of APIs in facilitating such developments cannot be understated, as discussed in our article, Beyond the Model: Why Data Scientists Must Embrace APIs and API Documentation.
One of the standout features introduced is the support for multiple metrics, which allows for a more nuanced understanding of model performance. This is particularly vital in fields like automatic speech recognition and object detection, where different metrics can provide insights into various aspects of a model’s capabilities. By offering leaderboards that report multiple performance indicators, researchers can engage in a more comprehensive evaluation of their work. This development not only fosters healthy competition but also encourages researchers to iterate and innovate based on well-rounded feedback. Such a shift marks a significant evolution from traditional methods where a single metric often dictated the narrative around model effectiveness.
Moreover, the inclusion of external papers beyond traditional repositories like Arxiv opens the door for a broader array of research contributions. This move reflects a growing understanding of the diverse ways knowledge is shared in the AI community. By enabling submissions from platforms such as GitHub and BiorXiv, PapersWithCode acknowledges the value of interdisciplinary research and the rapid dissemination of ideas. In doing so, it empowers researchers to showcase their work without the constraints of conventional publishing timelines. This aspect resonates with our insights in The Ultimate Beginners’ Guide to Building an AI Agent in Python, where we emphasize the importance of accessibility in learning and development within the AI space.
The introduction of paper lineage tracking is another noteworthy enhancement. By displaying the relationships between papers—such as follow-ups or predecessors—PapersWithCode is fostering a culture of continuity and evolution in research. This feature not only aids in tracing the progression of ideas but also helps newcomers in the field to navigate the complex tapestry of AI research more effectively. In a discipline that is characterized by rapid change, understanding how current research builds upon previous work is invaluable. This approach can inspire a new generation of researchers to contribute to the ongoing dialogue in AI.
Looking ahead, the community's response to these features will be crucial. As more users provide feedback and suggest new capabilities, the platform can evolve further, aligning even more closely with the needs of its users. The establishment of a dedicated communication channel on Discord will likely enhance this collaborative spirit. As we witness the landscape of AI research continue to mature, one question remains: how will platforms like PapersWithCode adapt to the ever-growing demands for transparency and accessibility in AI? The future of AI research is not just about developing models; it’s about creating an ecosystem that nurtures innovation through collaboration and shared knowledge.
| Hi, Niels here from the open-source team at Hugging Face. It's been one week since I launched paperswithcode.co, a revival of the website we all loved. It allows us to keep track of the state-of-the-art (SOTA) across various domains of AI, from agents to computer vision and time-series forecasting. The reception has been great, and I'm excited to extend this over the next few months. This week, I've added the following features: - Support for multiple metrics for a given benchmark: leaderboards now support multiple metrics, see e.g., the Open ASR Leaderboard for automatic speech recognition, which supports both Word Error Rate (WER) and the Inverse Real-Time Factor (RTFx) metrics, or the Object Detection leaderboard, which now also reports frames-per-second (FPS) besides mean average precision (mAP) on COCO. - Support for external papers: We do support submitting papers beyond Arxiv, such as a Github repo, a blog post, BiorXiv, and more. You can submit a paper at paperswithcode.co/submit. AI will automatically enrich it with task and method tags, the GitHub repo, evals, and more. See e.g. DeepSeek-v4 below, which is not on Arxiv: - Support for paper lineage: whenever a paper has a follow-up or predecessor, this will be displayed with a small banner above the abstract. See e.g. Mamba-3, DINOv2 and GLM-4.5. - New methods: support for new methods based on popularity, including Gated DeltaNet, Kimi Delta Attention, Mamba-2, and more. Each method also lists all papers that cite it. Find all supported methods here. - Support for screenshotting a leaderboard for easy sharing on social media: each benchmark now includes a "copy image" button both on the scatter plot and table, which can be shared on social media. Try it on ClawEval, for example. - Added many more evals: we are adding evals gradually, starting with all models supported in the Transformers library. So far, we have about 3k evals! Find them at the bottom of each paper page, e.g. Qwen 3.6. Happy to hear more feature requests and feedback! I will also launch a channel on the Hugging Face Discord server for easier communication. You can also chime in on the GitHub thread here. Cheers, Niels [link] [comments] |
Read on the original site
Open the publisher's page for the full experience