2 min readfrom Machine Learning

Diffusion for generating/editing ASTs? [D]

Our take

In exploring the potential of diffusion models for generating and editing abstract syntax trees (ASTs), we can address a significant limitation of large language models (LLMs) in code generation: syntactic correctness. By leveraging diffusion as a paradigm, we can iteratively create or refine ASTs, ensuring they maintain syntactic integrity throughout the process. This approach could lead to more effective solutions for logical problem-solving, requiring minimal training data.

I’m not a machine learning expert or anything, but I do enjoy learning about how it all works. I’ve noticed that one of the main limitations of LLMs for generating code is that their input and output space is the space of all tokens in the training data. This means that it is entirely possible, and likely, for an LLM to generate code that isn’t even syntactically correct.

I’m thinking it would be possible to create some architecture, (diffusion could be a good paradigm) where an abstract syntax tree is generated or edited in a way which guarantees syntactic correctness at each iteration. Maybe then, a model meant to solve logical problems by generating a procedure could be effective with much less (or zero) training data.

I think this could work with diffusion because I know that there is a limited number of ASTs for any given instruction set with a fixed number of nodes, the job of the algorithm is just to search that space for the best options, similar to how image gen models search their image spaces to match the given description. What do you all think?

Also, forgive me if this is the wrong sub to put this in, I haven’t been very active on Reddit until recently.

submitted by /u/coolness10101
[link] [comments]

Read on the original site

Open the publisher's page for the full experience

View original article

Tagged with

#generative AI for data analysis#Excel alternatives for data analysis#natural language processing for spreadsheets#machine learning in spreadsheet applications#financial modeling with spreadsheets#big data management in spreadsheets#conversational data analysis#real-time data collaboration#intelligent data visualization#no-code spreadsheet solutions#data visualization tools#enterprise data management#big data performance#data analysis tools#data cleaning solutions#rows.com#row zero#diffusion#abstract syntax tree#ASTs