holds variable type data.  
 More...
#include <data.h>
 | 
| Dataset  | operator() (const vector< size_t > &idx) const | 
|   | return a slice of the data using indices idx  
  | 
|   | 
| void  | init () | 
|   | call init at the end of constructors to define metafeatures of the data.  
  | 
|   | 
| map< string, State >  | make_features (const ArrayXXf &X, const map< string, State > &Z={}, const vector< string > &vn={}, const vector< string > &ft={}) | 
|   | turns input data into a feature map  
  | 
|   | 
| map< string, State >  | copy_and_make_features (const ArrayXXf &X, const Dataset &ref_dataset, const vector< string > &vn={}) | 
|   | turns input into a feature map, with feature types copied from a reference  
  | 
|   | 
|   | Dataset (std::map< string, State > &d, const Ref< const ArrayXf > &y_=ArrayXf(), bool c=false, float validation_size=0.0, float batch_size=1.0, bool shuffle_split=false) | 
|   | 
|   | Dataset (const ArrayXXf &X, const Ref< const ArrayXf > &y_=ArrayXf(), const vector< string > &vn={}, const map< string, State > &Z={}, const vector< string > &ft={}, bool c=false, float validation_size=0.0, float batch_size=1.0, bool shuffle_split=false) | 
|   | 
|   | Dataset (const ArrayXXf &X, const vector< string > &vn, const vector< string > &ft={}, bool c=false, float validation_size=0.0, float batch_size=1.0, bool shuffle_split=false) | 
|   | 
|   | Dataset (const ArrayXXf &X, const Dataset &ref_dataset, const vector< string > &vn) | 
|   | 
| void  | print () const | 
|   | 
| auto  | get_X () const | 
|   | 
| Dataset  | get_training_data () const | 
|   | 
| Dataset  | get_validation_data () const | 
|   | 
| int  | get_n_samples () const | 
|   | 
| int  | get_n_features () const | 
|   | 
| Dataset  | get_batch () const | 
|   | select random subset of data for training weights.  
  | 
|   | 
| float  | get_batch_size () | 
|   | 
| void  | set_batch_size (float new_size) | 
|   | 
| DataType  | get_feature_type (const string &name) const | 
|   | 
| std::array< Dataset, 2 >  | split (const ArrayXb &mask) const | 
|   | 
| State  | operator[] (std::string name) const | 
|   | 
holds variable type data. 
Definition at line 50 of file data.h.
 
◆ Dataset() [1/4]
  
  
      
        
          | Brush::Data::Dataset::Dataset  | 
          ( | 
          std::map< string, State > & |           d,  | 
         
        
           | 
           | 
          const Ref< const ArrayXf > & |           y_ = ArrayXf(),  | 
         
        
           | 
           | 
          bool |           c = false,  | 
         
        
           | 
           | 
          float |           validation_size = 0.0,  | 
         
        
           | 
           | 
          float |           batch_size = 1.0,  | 
         
        
           | 
           | 
          bool |           shuffle_split = false ) | 
         
       
   | 
  
inline   | 
  
 
- initialize data from a map. 
 
Definition at line 115 of file data.h.
 
 
◆ Dataset() [2/4]
  
  
      
        
          | Brush::Data::Dataset::Dataset  | 
          ( | 
          const ArrayXXf & |           X,  | 
         
        
           | 
           | 
          const Ref< const ArrayXf > & |           y_ = ArrayXf(),  | 
         
        
           | 
           | 
          const vector< string > & |           vn = {},  | 
         
        
           | 
           | 
          const map< string, State > & |           Z = {},  | 
         
        
           | 
           | 
          const vector< string > & |           ft = {},  | 
         
        
           | 
           | 
          bool |           c = false,  | 
         
        
           | 
           | 
          float |           validation_size = 0.0,  | 
         
        
           | 
           | 
          float |           batch_size = 1.0,  | 
         
        
           | 
           | 
          bool |           shuffle_split = false ) | 
         
       
   | 
  
inline   | 
  
 
- initialize data from a matrix with feature columns. 
 
Definition at line 133 of file data.h.
 
 
◆ Dataset() [3/4]
  
  
      
        
          | Brush::Data::Dataset::Dataset  | 
          ( | 
          const ArrayXXf & |           X,  | 
         
        
           | 
           | 
          const vector< string > & |           vn,  | 
         
        
           | 
           | 
          const vector< string > & |           ft = {},  | 
         
        
           | 
           | 
          bool |           c = false,  | 
         
        
           | 
           | 
          float |           validation_size = 0.0,  | 
         
        
           | 
           | 
          float |           batch_size = 1.0,  | 
         
        
           | 
           | 
          bool |           shuffle_split = false ) | 
         
       
   | 
  
inline   | 
  
 
- initialize data from X and feature names 
 
Definition at line 157 of file data.h.
 
 
◆ Dataset() [4/4]
  
  
      
        
          | Brush::Data::Dataset::Dataset  | 
          ( | 
          const ArrayXXf & |           X,  | 
         
        
           | 
           | 
          const Dataset & |           ref_dataset,  | 
         
        
           | 
           | 
          const vector< string > & |           vn ) | 
         
       
   | 
  
inline   | 
  
 
 
◆ copy_and_make_features()
turns input into a feature map, with feature types copied from a reference 
Definition at line 391 of file data.cpp.
 
 
◆ get_batch()
      
        
          | Dataset Brush::Data::Dataset::get_batch  | 
          ( | 
           | ) | 
           const | 
        
      
 
select random subset of data for training weights. 
Definition at line 170 of file data.cpp.
 
 
◆ get_batch_size()
      
        
          | float Brush::Data::Dataset::get_batch_size  | 
          ( | 
           | ) | 
           | 
        
      
 
 
◆ get_feature_type()
  
  
      
        
          | DataType Brush::Data::Dataset::get_feature_type  | 
          ( | 
          const string & |           name | ) | 
           const | 
         
       
   | 
  
inline   | 
  
 
 
◆ get_n_features()
  
  
      
        
          | int Brush::Data::Dataset::get_n_features  | 
          ( | 
           | ) | 
           const | 
         
       
   | 
  
inline   | 
  
 
 
◆ get_n_samples()
  
  
      
        
          | int Brush::Data::Dataset::get_n_samples  | 
          ( | 
           | ) | 
           const | 
         
       
   | 
  
inline   | 
  
 
 
◆ get_training_data()
      
        
          | Dataset Brush::Data::Dataset::get_training_data  | 
          ( | 
           | ) | 
           const | 
        
      
 
 
◆ get_validation_data()
      
        
          | Dataset Brush::Data::Dataset::get_validation_data  | 
          ( | 
           | ) | 
           const | 
        
      
 
 
◆ get_X()
  
  
      
        
          | auto Brush::Data::Dataset::get_X  | 
          ( | 
           | ) | 
           const | 
         
       
   | 
  
inline   | 
  
 
 
◆ init()
      
        
          | void Brush::Data::Dataset::init  | 
          ( | 
           | ) | 
           | 
        
      
 
call init at the end of constructors to define metafeatures of the data. 
Definition at line 202 of file data.cpp.
 
 
◆ make_features()
turns input data into a feature map 
Definition at line 326 of file data.cpp.
 
 
◆ operator()()
      
        
          | Dataset Brush::Data::Dataset::operator()  | 
          ( | 
          const vector< size_t > & |           idx | ) | 
           const | 
        
      
 
return a slice of the data using indices idx 
Definition at line 139 of file data.cpp.
 
 
◆ operator[]()
  
  
      
        
          | State Brush::Data::Dataset::operator[]  | 
          ( | 
          std::string |           name | ) | 
           const | 
         
       
   | 
  
inline   | 
  
 
 
◆ print()
  
  
      
        
          | void Brush::Data::Dataset::print  | 
          ( | 
           | ) | 
           const | 
         
       
   | 
  
inline   | 
  
 
 
◆ set_batch_size()
      
        
          | void Brush::Data::Dataset::set_batch_size  | 
          ( | 
          float |           new_size | ) | 
           | 
        
      
 
 
◆ split()
◆ batch_size
      
        
          | float Brush::Data::Dataset::batch_size | 
        
      
 
percentage of training data size to use in each batch. if 1.0, then all data is used 
Definition at line 92 of file data.h.
 
 
◆ classification
      
        
          | bool Brush::Data::Dataset::classification | 
        
      
 
whether this is a classification problem 
Definition at line 83 of file data.h.
 
 
◆ feature_names
      
        
          | std::vector<string> Brush::Data::Dataset::feature_names | 
        
      
 
names of the feature types as string representations. 
Definition at line 68 of file data.h.
 
 
◆ feature_types
      
        
          | std::vector<DataType> Brush::Data::Dataset::feature_types | 
        
      
 
types of data in the features. 
Definition at line 65 of file data.h.
 
 
◆ features
dataset features, as key value pairs 
Definition at line 74 of file data.h.
 
 
◆ features_of_type
map from data types to features having that type. 
Definition at line 71 of file data.h.
 
 
◆ shuffle_split
      
        
          | bool Brush::Data::Dataset::shuffle_split | 
        
      
 
 
◆ training_data_idx
  
  
      
        
          | vector<size_t> Brush::Data::Dataset::training_data_idx | 
         
       
   | 
  
private   | 
  
 
 
◆ unique_data_types
      
        
          | std::vector<DataType> Brush::Data::Dataset::unique_data_types | 
        
      
 
keeps track of the unique data types in the dataset. 
Definition at line 62 of file data.h.
 
 
◆ use_batch
      
        
          | bool Brush::Data::Dataset::use_batch | 
        
      
 
 
◆ use_validation
      
        
          | bool Brush::Data::Dataset::use_validation | 
        
      
 
 
◆ validation_data_idx
  
  
      
        
          | vector<size_t> Brush::Data::Dataset::validation_data_idx | 
         
       
   | 
  
private   | 
  
 
 
◆ validation_size
      
        
          | float Brush::Data::Dataset::validation_size | 
        
      
 
percentage of original data used for train. if 0.0, then all data is used for train and validation 
Definition at line 87 of file data.h.
 
 
◆ Xref
      
        
          | std::optional<std::reference_wrapper<const ArrayXXf> > Brush::Data::Dataset::Xref | 
        
      
 
 
      
        
          | ArrayXf Brush::Data::Dataset::y | 
        
      
 
length N array, the target label 
Definition at line 80 of file data.h.
 
 
The documentation for this class was generated from the following files: