Repository for the Vinnova project "Mixture of Experts models Tailored for Fleet Intelligence"
This repository is designed for collaborative development with multiple people working simultaneously:
architecture/- Shared, reusable model components (backbone, experts, router, MoE)training/- Shared training utilities and frameworkspipelines/- Experiment-specific implementations that use the shared modules
python3 -m venv .venv
source .venv/bin/activate
pip3 install -e . -U- Shared Components: Add reusable model components to
architecture/or training utilities totraining/ - New Experiments: Create new experiment directories in
research_pipelines/(e.g.,research_pipelines/zod_fl/,research_pipelines/cifar10/) - Experiment Structure: Each experiment should have:
config.py- Experiment-specific configurationdataset.py- Dataset loading and preprocessingmodel.py- Model building using shared componentstrain.py- Training script, preferably using shared trainer
- Established Experiments: Established experiments can be moved to
production_pipelines/(with good test coverage). Add aREADME.mdwith key findings and learnings.