1 min readfrom InfoQ

Podcast: Increasing Users' Data Agency: From BlueSky's AT Protocol to the Local-First Software Movement

Our take

This week's podcast explores a critical shift in data systems: the move towards user data agency. Joining us is Martin Kleppmann, associate professor at Cambridge and author of *Designing Data-Intensive Applications*, to discuss the evolution from monolithic databases to modular building blocks. Kleppmann highlights the importance of decentralization, drawing parallels to innovative protocols like Bluesky’s AT Protocol and the burgeoning local-first software movement.
Podcast: Increasing Users' Data Agency: From BlueSky's AT Protocol to the Local-First Software Movement

The shift Martin Kleppmann outlines in his recent podcast discussion – a move away from monolithic, cloud-centric data systems towards decentralized architectures like Bluesky’s AT Protocol – isn't merely a technical evolution; it represents a fundamental rethinking of data ownership and control. For years, the promise of the cloud has been intertwined with centralized data storage, offering scalability and ease of management at the cost of user agency. However, as concerns around data privacy, vendor lock-in, and the potential for misuse grow, the demand for more transparent and user-controlled data ecosystems is becoming undeniable. This resonates strongly with the concerns explored in [Governing AI in the Cloud: A Practical Guide for Architects], where the complexities of managing AI applications within centralized cloud environments highlight the need for robust governance frameworks – a challenge that decentralized systems inherently address. The increasing scrutiny of cloud practices further solidifies the rationale for exploring alternative models, even as the practical challenges of widespread adoption persist.

Kleppmann’s emphasis on modular building blocks is key to understanding this transition. The monolithic databases of the past often presented single points of failure and limited flexibility. By breaking down data systems into smaller, more manageable components, developers can build more resilient and adaptable applications. This approach also facilitates interoperability, allowing different systems to communicate and share data more seamlessly. The conversation around decentralized data storage isn't solely about replicating data across multiple locations; it's about empowering users with the ability to choose where their data resides, who has access to it, and how it’s used. This movement is directly linked to the broader conversation surrounding data agency, where users are increasingly demanding greater control over their digital footprint. Thinking about the practical steps required to achieve this is crucial, as highlighted by the ongoing discussions within academic circles, exemplified by the question posed regarding the ICML poster deadline [ICML Poster [D]].

The AT Protocol, as a concrete example, offers a glimpse into a potential future where social media platforms are built on decentralized principles, allowing users to migrate their data and social connections between different services without losing control. This vision contrasts sharply with the current landscape, where users are often locked into proprietary platforms with little say over how their data is used. While the AT Protocol is currently focused on social media, the underlying principles are applicable to a wider range of data-intensive applications. The move toward local-first software, though seemingly a different approach, shares a common thread with decentralized systems: putting the user and their data first. This holistic view of data ownership aligns with the growing recognition that data is not merely a resource to be exploited but a valuable asset that should be protected and controlled by its creators. The practicalities of navigating this shift, even at the academic level, are important considerations, as demonstrated by the discussion surrounding Google PhD internship applications [I’d Like to Try for a Google PhD Internship [R]].

Ultimately, Kleppmann’s perspective underscores a vital turning point in the evolution of data systems. The challenges ahead are significant, including navigating regulatory complexities, ensuring data security and privacy in decentralized environments, and fostering widespread adoption among both developers and users. However, the potential rewards – increased data agency, greater resilience, and a more equitable data ecosystem – are well worth pursuing. The question becomes: as the technology matures and adoption grows, how will we define and enforce ethical guidelines for decentralized data systems, ensuring they serve the best interests of both individuals and society as a whole?

Martin Kleppmann, an associate professor at Cambridge and author of Designing Data-Intensive Applications, discusses the evolution of data systems over the last decade, mainly the shift from monolithic databases to modular building blocks. Kleppmann underlines the importance of moving from cloud-centric data storage systems to decentralised data storage similar to Bluesky’s AT protocol.

By Martin Kleppmann

Read on the original site

Open the publisher's page for the full experience

View original article

Tagged with

#big data management in spreadsheets#generative AI for data analysis#conversational data analysis#Excel alternatives for data analysis#real-time data collaboration#intelligent data visualization#data visualization tools#enterprise data management#big data performance#data analysis tools#data cleaning solutions#cloud-based spreadsheet applications#machine learning in spreadsheet applications#cloud-native spreadsheets#digital transformation in spreadsheet software#rows.com#Data Agency#AT Protocol#Bluesky#Decentralized Data Storage