rl_salamandra_alignment
- rl_salamandra_alignment package
- Subpackages
- rl_salamandra_alignment.distributed_configs package
- rl_salamandra_alignment.templates package
- rl_salamandra_alignment.trl_scripts package
- Submodules
- rl_salamandra_alignment.trl_scripts.alignprop module
- rl_salamandra_alignment.trl_scripts.bco module
- rl_salamandra_alignment.trl_scripts.chat module
- rl_salamandra_alignment.trl_scripts.cpo module
- rl_salamandra_alignment.trl_scripts.ddpo module
- rl_salamandra_alignment.trl_scripts.dpo module
- rl_salamandra_alignment.trl_scripts.dpo_online module
- rl_salamandra_alignment.trl_scripts.dpo_vlm module
- rl_salamandra_alignment.trl_scripts.gkd module
- rl_salamandra_alignment.trl_scripts.grpo module
- rl_salamandra_alignment.trl_scripts.kto module
- rl_salamandra_alignment.trl_scripts.nash_md module
- rl_salamandra_alignment.trl_scripts.orpo module
- rl_salamandra_alignment.trl_scripts.reward_modeling module
- rl_salamandra_alignment.trl_scripts.sft module
- rl_salamandra_alignment.trl_scripts.sft_video_llm module
- rl_salamandra_alignment.trl_scripts.sft_vlm module
- rl_salamandra_alignment.trl_scripts.xpo module
- Module contents
- rl_salamandra_alignment.utils package
- Submodules
- rl_salamandra_alignment.cli module
- rl_salamandra_alignment.convert_dataset module
- rl_salamandra_alignment.generate_scripts module
generate_all_job_files()generate_distributed_run_script()generate_eval_scripts_for_one_training()generate_harness_eval_script()generate_launch_script()generate_local_eval_script()generate_one_job_set()generate_one_training_job()generate_slurm_preamble()get_config_ids()get_output_dir()get_script_args_string()replace_in_template()setup_macro_output_dir_tree()setup_micro_output_dir_tree()
- Module contents
- Subpackages