June 11, 2026•2 min read•from Machine Learning

Should I Commit and Publish the Results? [R]

Our take

You've achieved a respectable 0.66 R² score using a random forest for QSPR analysis of melting points, but a subsequent deep learning model, while smaller (1.3-1.4MB vs. 1.23GB), yielded a slightly reduced 0.6399 R² score. Given these results—including a normalized RMSE of 0.083—consider committing to publication. While further model refinement is always valuable, the demonstrated accuracy and significant file size reduction represent a worthwhile contribution. This work aligns with explorations of AI's impact on analysis, akin to those discussed in "AI Epistemic Risks: Emerging

This Reddit post from u/AgiGamesYT presents a fascinating microcosm of the challenges and trade-offs inherent in machine learning research, particularly when bridging the gap between academic exploration and practical application. The user’s journey – from a robust, albeit bulky, Random Forest model to a leaner, deep learning architecture – highlights the ongoing tension between accuracy and efficiency. Their query regarding whether to commit and publish the results, despite a slight dip in R² score with the deep learning model, is one many researchers face. It touches on the value of demonstrating a working solution, even with imperfections, versus striving for marginal gains that might consume significant additional time and resources. This resonates with discussions around reproducibility and the importance of sharing progress, as explored in our article Analysis of the results of the "Transforming autoencoders" architecture mentioned by Hilton, for my dissertation. which delves into the complexities of evaluating and interpreting model performance within a dissertation context.

The core of the post lies in the impressive results achieved, even with the constraints. An R² score of 0.66 with the Random Forest, and 0.6399 with the deep learning model, is a respectable outcome for QSPR analysis, particularly when relying solely on topological indices. The accompanying metrics—MAE, MSE, RMSE, NRMSE, and MAPE—provide a comprehensive picture of the model’s performance, allowing for a nuanced understanding of its strengths and weaknesses. The user's decision to explore a deep learning architecture to reduce file size demonstrates an awareness of practical considerations beyond pure accuracy. This is increasingly important as models are deployed in resource-constrained environments, a theme that also appears in our article AI Epistemic Risks: Emerging Mechanisms & Evidence, which considers the broader implications of AI model size and complexity on societal understanding and trust. The fact that the deep learning model achieves a respectable performance with significantly fewer parameters demonstrates the potential of these architectures to balance computational efficiency with predictive power, a potentially transformative shift for fields like cheminformatics.

The question of whether to publish now or continue refining the model is a strategic one. The user’s obligation to avoid premature disclosure of research findings adds another layer of complexity. A pragmatic approach would be to publish the initial results, explicitly acknowledging the limitations of the current model and outlining potential avenues for future improvement. This demonstrates scientific rigor and allows for valuable feedback from the broader community. The community’s response on Reddit will likely provide helpful insight, and the user's dedication to feature engineering and model architecture optimization underscores a commitment to pushing the boundaries of what’s possible. The initial success offers a strong foundation, and sharing it can spark further innovation. As discussed in What will be the next breakthrough in ASR?, incremental progress and open sharing often pave the way for more significant advancements.

Ultimately, this Reddit post showcases a common, yet critical, stage in the research lifecycle. The user’s work offers a valuable contribution to the field of QSPR analysis, demonstrating the feasibility of using AI to predict melting points based on topological indices. The trade-off between model size and accuracy, and the decision of when to share preliminary findings, are challenges that resonate across numerous AI domains. It’s worth watching how the user incorporates community feedback and whether they can further refine their deep learning model while maintaining its efficiency. Will future iterations be able to recover the initial R² score while preserving the reduced file size, or will this represent a sweet spot, balancing performance and practicality?

Hello Reddit

I've been working on QSPR (Quantitative Structure-Property Relationship) analysis for chemical compounds mentioned in the Jean-Claude Bradley Open Melting Point Dataset. Basically the idea is to see how accurate a model can predict melting points of compounds using only topological indices. After some work on the topological indices (feature engineering), each compound was represented by 26 features.

I trained a random forest model on the data and got a test r2 score of 0.66 (which is pretty respectable, given the constraints). However, the file size of the model was around 1.23GB. I didn't like it being that big, so I opened up PyTorch to build a custom deep learning architecture that could make predictions as accurately as the random forest but with much smaller file size.

After around 2 weeks of research, I build a 270,000 learnable parameter model (1.3-1.4MB according to torchinfo) that got an r2 score 0f 0.6399.

Given all this context, I wanted to ask the following question:
Should I commit and work on publishing the results, or should I keep working on improving the model?

Note: I'm obligated by my university to not give out intricate details of my research before publication, so please forgive me if such details are required for a high quality answer.

However, I can give out the metrics achieved by my little deep learning model. Here it is:

=== Evaluation Metrics (Expected Value) ===
R² Score : 0.639910
MAE : 41.246754
MSE : 2989.062744
RMSE : 54.672322
NRMSE : 0.083469

MAPE : 11.69%

The unit for MAE, MSE, RMSE and NRMSE is Kelvin (K).

submitted by /u/AgiGamesYT
[link] [comments]

Read on the original site

Open the publisher's page for the full experience

View original article →