Skip to content

Latest commit

 

History

History
72 lines (71 loc) · 2.76 KB

File metadata and controls

72 lines (71 loc) · 2.76 KB

Random data generators work plan

Introduction

  • This org-mode file has a work plan list and a unit test list that correspond (and hopefully reflect) the development of the Python package RandomDataGenerators.

Work plan [100%] [5/5]

Random strings generation

Random words generation

  • [X] Using “data loader”
  • [X] Having word type / kind filter
  • [X] Handling failed expectations

Random pet names generation

  • [X] Using “data loader”
  • [X] Having species filter
  • [X] Having weighted sampling
  • [X] Handling failed expectations

Random pretentious job title

  • [X] English
  • [X] Bulgarian
  • [X] Number of words
  • [X] Handling failed expectations
  • [X] Random generation with different number of words and languages in one signature spec

Random data frame [77%] [7/9]

  • [X] Number of rows and columns
  • [X] Using named columns
  • [X] Column names generator spec utilization
  • [X] Tie up the generator specs and the column names
  • [X] Row names / keys
  • [ ] Both long and wide format
  • [ ] Make sure the ability to generate data frames with:
    • Large number of rows and columns
    • Few non-NA elements
  • [X] Experiment with “thousands of random data frames” generations
  • [X] Experiment with utilization for random sparse matrix generation
    • SSparseMatrix objects

Unit tests [50%] [1/2]

Random functions [100%] [4/4]

  • [X] random_string
    • [X] Basic usages
    • [X] Usages with regex specs
  • [X] random_word
    • [X] Basic usages
    • [X] Kind/type spec failure
  • [X] random_pet_name
    • [X] Basic usages
    • [X] Species spec failure
  • [X] random_pretentious_job_title
    • [X] Basic usages
    • [X] Random language
    • [X] Random number of words
    • [X] Language spec failure

Random data frame aspects [75%] [3/4]

  • [X] Simple usages
  • [X] Large number of columns and “insufficient” column names
  • [X] Thousands of data frames tests
  • [ ] Find a way incorporate the “thousands of data frame tests” into one or more unit tests.

Documentation [%50] [1/6]

  • It is considered helpful to have:
    • Overall document with examples of all package functions
    • Wolfram Function Repository (WFR) style document for each function

Overall document with all functions

random_string WFR style

random_word WFR style

random_pet_name WFR style

random_pretentious_job_title WFR style

random_data_frame WFR style