- This org-mode file has a work plan list and a unit test list that correspond (and hopefully reflect) the development of the Python package RandomDataGenerators.
- [X] Using “data loader”
- [X] Having word type / kind filter
- [X] Handling failed expectations
- [X] Using “data loader”
- [X] Having species filter
- [X] Having weighted sampling
- [X] Handling failed expectations
- [X] English
- [X] Bulgarian
- [X] Number of words
- [X] Handling failed expectations
- [X] Random generation with different number of words and languages in one signature spec
- [X] Number of rows and columns
- [X] Using named columns
- [X] Column names generator spec utilization
- [X] Tie up the generator specs and the column names
- [X] Row names / keys
- [ ] Both long and wide format
- [ ] Make sure the ability to generate data frames with:
- Large number of rows and columns
- Few non-NA elements
- [X] Experiment with “thousands of random data frames” generations
- [X] Experiment with utilization for random sparse matrix generation
SSparseMatrixobjects
- [X]
random_string- [X] Basic usages
- [X] Usages with regex specs
- [X]
random_word- [X] Basic usages
- [X] Kind/type spec failure
- [X]
random_pet_name- [X] Basic usages
- [X] Species spec failure
- [X]
random_pretentious_job_title- [X] Basic usages
- [X] Random language
- [X] Random number of words
- [X] Language spec failure
- [X] Simple usages
- [X] Large number of columns and “insufficient” column names
- [X] Thousands of data frames tests
- The script thousands-of-random-data-frames.py generates thousands of tests.
- [ ] Find a way incorporate the “thousands of data frame tests” into one or more unit tests.
- It is considered helpful to have:
- Overall document with examples of all package functions
- Wolfram Function Repository (WFR) style document for each function