Resilient Distributed Dataset
Glossary Page
An abstract distributed memory model that enables programmers to perform in-memory computations on extensive clusters while maintaining fault tolerance. RDDs address the inefficiencies of current computing frameworks when dealing with iterative algorithms and interactive data mining tools. By keeping data in memory, RDDs significantly enhance performance. To ensure efficient fault tolerance, RDDs utilize a restricted form of shared memory that relies on coarse-grained transformations rather than fine-grained updates to shared state.
http://dl.acm.org/citation.cfm?id=2228301
Latest Webinars
Latest Articles
Driving Innovation with Federated Data Collaboration in the Mobility Sector
As the mobility sector continues its digital evolution, the need for secure and compliant data sharing mechanisms is more critical than ever. Federated data collaboration frameworks - also referred to as data ecosystems or data spaces - are emerging as a key enabler for innovation across the mobility landscape. This post outlines the potential of such frameworks to support industry stakeholders across the value chain, particularly in North America, while respecting data sovereignty and regulatory boundaries.
Read more
Sean Bäker
Jul 07, 2025
Test Blog Video: Exploring the Potential of Multimedia in Data Storytelling
This video blog post serves as a test item to evaluate the integration of multimedia within data-driven publishing workflows. It demonstrates how short video content can be embedded into a standard blog layout to enhance user engagement and content flexibility. This item does not reflect real data or represent an official publication.
Read more
Tim Ganther
Mar 19, 2024
Block editor test
Block editor test
Read more
Tim Ganther
Jul 17, 2023