CPARK 1.0
A light-weighted, distributed computing framework for C++ that offers a fast and general-purpose large data processing solution.
|
#include <group_by_key_rdd.h>
Classes | |
class | Iterator |
Public Types | |
using | Base = BaseRdd< GroupByKeyRdd< R > > |
using | KeyType = utils::RddKeyType< R > |
using | ValueType = utils::RddValueType< R > |
Public Member Functions | |
GroupByKeyRdd (const R &prev) | |
GroupByKeyRdd (const GroupByKeyRdd &)=default | |
GroupByKeyRdd & | operator= (const GroupByKeyRdd &)=default |
Public Member Functions inherited from cpark::BaseRdd< GroupByKeyRdd< R > > | |
BaseRdd (const BaseRdd< R > &prev, bool copy_id) | |
BaseRdd (const BaseRdd< R > &prev) | |
BaseRdd (ExecutionContext *context) | |
BaseRdd & | operator= (const BaseRdd< R > &prev) |
auto | begin () const |
auto | end () const |
ExecutionContext::RddId | id () const noexcept |
Public Attributes | |
friend | Base |
Additional Inherited Members | |
Protected Attributes inherited from cpark::BaseRdd< GroupByKeyRdd< R > > | |
ExecutionContext * | context_ |
ExecutionContext::RddId | rdd_id_ |
size_t | splits_num_ |
An Rdd that performs group-by-key operation to a previous key-value typed Rdd. The values that have the same key will be grouped into a sequence, and the sequence will be the value of this key in the new Rdd.
R | The previous Rdd. Must be of key-value type. Must be already partitioned. TODO: Add support for un-partition-ed Rdd-s with an extra partition. |
|
inlineexplicit |
Creates GroupByKeyRdd from a previous Rdd.