Skip to content

RobertoCorti/skfeaturellm

Repository files navigation

SKFeatureLLM

SKFeatureLLM Logo

SKFeatureLLM is a Python library that brings the power of Large Language Models (LLMs) to feature engineering for tabular data, wrapped in a familiar scikit-learn–style API. The library aims to leverage LLMs' capabilities to automatically generate and implement meaningful features for your machine learning tasks.

📑 Table of Contents

🌟 Key Features

  • 🤖 LLM-powered feature engineering
  • 🔌 Model-agnostic: works with any LLM provider (OpenAI, Anthropic, etc.)
  • 🛠 Scikit-learn compatible API
  • 📊 Comprehensive feature evaluation and reporting
  • 🎯 Support for both supervised and unsupervised feature engineering

📦 Installation

To install the latest release of SKFeatureLLM from PyPI:

pip install skfeaturellm

This will install the library and its core dependencies for end users.

✅ Running Tests

To run the test suite, ensure pytest is installed and execute:

poetry run pytest

Tests are located in the tests/ directory and cover the core functionality of SKFeatureLLM.

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🤝 Contributing

We welcome contributions! Here's how you can help:

  1. Report Bugs: If you find a bug, please open an issue with a detailed description.
  2. Suggest Features: Have an idea for a new feature? Open an issue to discuss it.
  3. Submit Pull Requests: We love PRs! Here's how to submit one:
    • Fork the repository
    • Create a new branch for your feature
    • Make your changes
    • Submit a pull request

Development Setup

  1. Clone the repository:
git clone https://github.com/yourusername/skfeaturellm.git
cd skfeaturellm
  1. Install development dependencies:
pip install -e ".[dev]"
  1. Run tests:
pytest
  1. Format code:
black .
isort .

Code Style

We use:

  • Black for code formatting
  • isort for import sorting
  • pylint for linting
  • mypy for type checking

Please ensure your code passes all checks before submitting a PR.

👤 Author

📚 Documentation

Our documentation is hosted on Read the Docs and includes:

For any questions or issues, please open an issue on our GitHub repository.

About

skfeaturellm is a Python library that brings the power of Large Language Models (LLMs) to feature engineering for tabular data, wrapped in a familiar scikit-learn–style API.

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors