Developer’s Guide
Poetry Setup Guide
To manage our development environment and configure our project packaging, it is most convenient to use Poetry. Poetry uses the pyproject.toml and poetry.lock files to manage our packages/dependencies and make it easy to automatically install them to a virtual environment. It also makes it easy to build and install our package locally as well as publish it to PyPI, obviating the need for a setup.py file.
Additional Poetry documentation:
Requirement: Python version 3.8 or greater
Installation
It is possible to install Poetry using Pip, but it is recommended to instead install it via a script using the following command:
curl -sSL https://install.python-poetry.org | python3 -
This will install Poetry to ~/.poetry/bin, which should be automatically prepended to your PATH by modifying your ~/.profile, ~/.bash_profile, and/or ~/.bashrc. If you are using a shell other than Bash, you will have to add it to your PATH manually.
Environment Setup
All of the following commands should be run in the root of our project directory as that is where our .python-version, pyproject.toml, and poetry.lock files are located (for use with Pyenv).
If you use Pyenv, change the line in your ~/.bashrc, etc.:
eval “$(pyenv init -)”
to:
[ $POETRY_ACTIVE ] || eval “$(pyenv init -)”
On MacOS, Poetry virtual environments are installed to ~/Library/Caches/pypoetry/virtualenvs. To instead install to a local .venv directory, first run the command:
poetry config settings.virtualenvs.in-project true
When creating a Python virtual environment, Poetry will automatically install the version of Python of whatever python executable appears first on your PATH.
Development Workflow
1. Make changes
To work on a fix or feature, switch to the feature branch in the BEE repo (see Contribute).
2. Create a Virtual Environment and Install Dependencies
After making your changes create a Python 3.x virtual environment and install our project dependencies (including developer dependencies):
poetry install
3. Activate the Virtual Environment
To activate the virtual environment (‘exit’ or EOF to deactivate):
poetry shell
4. Start the BEE components
beeflow core start
5. Test
Attempt to write tests that cover all the new/modified lines on your feature branch. Test files are in the beeflow/tests
folder and follow the naming convention test_MODULE_NAME.py
. You may need to create a new file if one doesn’t exist for the module you are working on. Make sure your test function begins with test_
; test_FUNCTION_NAME
is a good naming convention.
Some useful features of pytest
to write your tests:
@pytest.mark.parametrize
: This allows you to run the same test with slight variations which can be useful to increase line coverage or the robustness of your test. See How to parametrize fixtures and test functions.tmp_path
: Many actions in the codebase create files. Do not let these files be left around at the end of the test.pytest
provides a temporary directory that will automatically be cleaned up at the end of the test and can be accessed withtmp_path
. See How to use temporary directories and files in tests.mocker
: If a function you are testing calls functions that cannot reasonably be called during the test; e.g.input
, you can tellpytest
to ignore that function or create a dummy ‘mocked’ function to behave in a way you specify usingmocker
. See pytest-mock: Usage.
See also Running Tests
6. Commit Changes
If you’re done making changes, follow the git workflow specified in Contribute.
7. Continue Development
If you want to continue making changes, add them and then pause any running workflows:
beeflow pause $ID
Stop the bee components:
beeflow core stop
Now you can repeat steps 2 to 5.
Dependency and Package Management with Poetry
Update Dependencies
To update the package dependencies and generate a new poetry.lock (tracked):
poetry update
Add a New Dependency
To add a new dependency to pyproject.toml:
poetry add <package>
Remove a Dependency
To remove a dependency from pyproject.toml:
poetry remove <package>
Build the Package
To build the package as a tarball and a wheel (by default):
poetry build
Check the Validity of pyproject.toml
poetry check
Publish the Package to a Remote Repository
poetry publish
Running Tests
BEE includes unit and integration tests that can be run on a local system.
To run the unit tests, make sure to install beeflow with poetry install -E cloud_extras
; the -E cloud_extras
option forces Poetry to install extra dependencies required for some of the cloud API tests. After loading a shell with poetry shell
, you can run the unit tests with pytest beeflow/tests
.
Some useful pytest options
-k EXPRESSION
: Allows you to only run tests that match a keyword expression. This is useful when writing a test case as you can run only that test. You can also run a test file for a specific module when working on an enhancement to quickly ensure the most relevant tests still pass.--durations 0
: This will show the durations of all tests run that are >= 0.005s. Since tests run on CI it is best to keep them as fast as possible. A test that takes over 1s is slow in this context.--cov beeflow --cov-report term-missing
: This will check test line coverage for each file. It is useful to ensure lines being added/modified in a feature branch have test coverage. See pytest-cov’s documentation.
See How to invoke pytest for even more options when running pytest
.
Integration tests
For the integration tests, you’ll first have to start beeflow with beeflow core start
(see Command Line Interface). Then, making sure that you have Charliecloud loaded in your environment, you can run ./ci/integration_test.py
to run the tests. This must be done from the root of BEE repository. The integration tests will create a directory ~/.beeflow-integration
to be used for storing temporary files as well as inspecting failure results. The script itself includes a number of options for running extra tests, details of which can be found through --help
and other command line options. Running the script without any options will run the default test suite. Some tests are disabled by default due to runtime or environment constraints and need to be specified in a comma-separated list with --tests
(-t
) to be run. Run the script with just --show-tests
(-s
) to see a list of all possible tests.
Git Workflow
See Contribute for more information