2 min readfrom Machine Learning

Seems ICML is rejecting MANY unanimous positively rated papers [D]

Our take

The recent ICML review process appears to be facing significant challenges, leading to the rejection of many papers that received unanimous positive ratings. A misalignment in reviewer incentives has emerged, where the rebuttal phase pressures reviewers to adjust their scores, resulting in inflated ratings that do not reflect genuine evaluations. This dynamic may lead to a situation where numerous positively rated papers are ultimately rejected, echoing trends seen in previous conferences.

The recent discussion surrounding ICML's review process reveals significant concerns about the integrity and effectiveness of peer evaluations in the machine learning community. An article on the ICML decisions highlights a troubling trend: many papers that received positive feedback from reviewers ended up rejected, raising questions about the alignment of incentives within the review process. As noted in the submission, the rebuttal phase appears to pressure reviewers toward uniformity in scoring, which can lead to inflated ratings and a reluctance to adjust scores even when valid concerns have been addressed. This situation echoes the sentiments expressed in other discussions, such as the ICML final decisions rant, where the community grapples with the implications of a system that may not accurately reflect the quality of submissions.

The crux of the issue lies in the current expectations placed on reviewers and area chairs (ACs). Reviewers feel compelled to conform to a standard that values consensus over individual assessments, leading to a distorted dynamic where inflated scores may not truly represent the merit of a paper. This practice undermines the purpose of peer review, which should foster constructive criticism and honest evaluations. The editorial perspective shared by the original author suggests a longing for a return to a more straightforward peer review process, one where reviewers can provide independent evaluations without fear of repercussions. This sentiment resonates with many, as evidenced by similar frustrations voiced in the article titled AI/ML Conferences.

Such dynamics not only affect the authors of rejected papers but also shape the broader research landscape. The pressure for homogeneous ratings can stifle innovation, as unique and potentially transformative ideas might be lost in a sea of conformity. The fear of rejection, even in the face of positive feedback, can discourage researchers from pursuing bold, unconventional research paths. This trend poses a significant risk to the advancement of the field, as it may push researchers toward safer, less impactful work that aligns with prevailing norms rather than challenging them.

Moving forward, the machine learning community must critically evaluate the peer review processes and consider reforms that prioritize genuine discourse over consensus. The call for a return to honest, independent evaluations is not merely nostalgic; it represents a crucial step toward ensuring that the best ideas receive the attention they deserve. As these conversations evolve, it will be essential to keep an eye on how conferences adapt their processes in response to community feedback. How will these changes impact the quality of research and the overall innovation landscape in AI and machine learning? The answers to these questions will be pivotal as we navigate the future of peer review in our field.

My 4444 (4443 pre-rebuttal) got rejected (as expected).

Just copying a reply I wrote a couple of days ago before decisions were out:

There seems to be a misalignment in the incentives of this year’s ICML reviews. The rebuttal phase is pushing hard to encourage reviewers to reconsider their scores, which has a good motivation. But in practice, it creates a distorted dynamic. ACs are seeking homogeneous ratings among reviewers. As a reviewer, I feel the pressure to increase my score to avoid prolonged back-and-forth discussions. I would assume there may be many reviewers who are not engaged but raise their scores just to end the discussion.

At the same time, reviewers who are initially positive often seem reluctant to update their scores, even after their concerns are addressed. I came across a review that said: “Thank you for the rebuttal. The paper is valuable. The rebuttal addressed all my concerns.” (rephrased to avoid directly locating the paper) Yet the score remained at 4.

It now makes me nervous (NOW I KNOW I WAS RIGHT!) since scores are inflated while the conference has a limited capacity. In a few days, we may see MANY uniformly positively rated papers rejected, just like last NeurIPS.

I would prefer to roll back to how peer review originally was: reviewers provide honest and independent evaluations; AC assess their quality and consistency; and borderline cases are resolved through AC discussion. The current mechanism feels unnecessarily complex and makes the already bad situation worse.

submitted by /u/AffectionateLife5693
[link] [comments]

Read on the original site

Open the publisher's page for the full experience

View original article

Related Articles

Tagged with

#rows.com#natural language processing for spreadsheets#generative AI for data analysis#Excel alternatives for data analysis#real-time data collaboration#real-time collaboration#ICML#peer review#reviewers#unanimous positively rated papers#honest evaluations#reconsider scores#rebuttal phase#ACs#inflated scores#homogeneous ratings#pressure to increase score#conference capacity#review quality#back-and-forth discussions