1 min readfrom Machine Learning

An interactive semantic map of the latest 10 million published papers [P]

An interactive semantic map of the latest 10 million published papers [P]
An interactive semantic map of the latest 10 million published papers [P]

I built a map to help navigate the complex scientific landscape through spatial exploration.

How it works:

Sourced the latest 10M papers from OpenAlex and generated embeddings using SPECTER 2 on titles and abstracts.

Reduced dimensionality with UMAP, then applied Voronoi partitioning on density peaks to create distinct semantic neighborhoods.

The floating topic labels are generated via custom labelling algorithms (definitely still a work in progress!).

There is also support for both keyword and semantic queries, and there's an analytics layer for ranking institutions, authors, and topics etc.

For anyone who wants to try the interactive map, it is free to use at The Global Research Space

Any feedback or suggestions is welcome!

submitted by /u/icannotchangethename
[link] [comments]

Want to read more?

Check out the full article on the original site

View original article

Tagged with

#natural language processing for spreadsheets
#generative AI for data analysis
#Excel alternatives for data analysis
#rows.com
#interactive charts
#self-service analytics tools
#financial modeling with spreadsheets
#predictive analytics in spreadsheets
#predictive analytics
#self-service analytics
#interactive map
#semantic map
#scientific landscape
#spatial exploration
#10 million papers
#OpenAlex
#embeddings
#semantic neighborhoods
#The Global Research Space
#SPECTER 2