Installation
============

Downloading brioche:
--------------------

Brioche can be downloaded directly from the github page as shown below.

.. code-block:: console

   git clone https://github.com/plantinformatics/brioche.git


after downloading you can navigate to the brioche folder. 

.. code-block:: console

   cd brioche

Here you will have a folder structure laid out like the following:

.. image:: Images/Brioche_github_downloaded.png
   :alt: Brioche downloaded from github
   :width: 300px
   :align: center


Software dependencies:
----------------------

Brioche uses Nextflow as a workflow manager. Brioche also uses conda as a package manager and git for recording relevant metadata. Each of these are required to be installed either as modules on the HPC system 
or directly by the user.

e.g., 

.. code-block:: console

   module load Nextflow 


or to download nextflow


.. code-block:: console

   wget -qO- https://get.nextflow.io | bash 


The same applies for conda

.. code-block:: console

   module load Miniconda3 


or to download conda we navigate to the desired location for storing conda (usually your home directory), download,unpack and initialise conda

.. code-block:: console

   wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O /path/to/desired/directory/miniconda/miniconda.sh
   bash /path/to/desired/directory/miniconda/miniconda.sh -b -u -p /path/to/desired/directory/miniconda/
   /path/to/desired/directory/miniconda/bin/conda init bash


Finally, for git

.. code-block:: console

   module load git


or (asssuming Debian/Ubuntu Linux)

.. code-block:: console

   apt-get install git


Brioche is currently written to work within **slurm** HPC systems so will not run out of the box on other systems. 
If the user is interested in working with other HPC management systems individual .sh files will need to have their headers 
updated and the nextflow.config parameters updated manually.


During the first run of Brioche, the main conda environment will be built automatically from the conda_environment.yml file provided in the brioche directory. 
NOTE: this means that the launch partition of Brioche (e.g., example_run_script_brioche.sh) needs to have access to the internet so set the partition to one which can download. 


Some additional software is required for extended functionality in Brioche. This can be found in the Additional_functions folder and will be installed as a conda environment using the file brioche-vcf.yaml 
This software environment is used to do additional insilico and other functionality. 

This will install when running these scripts, or you can install it manually by running build_brioche-vcf_env.sh


File layout:
------------

Brioche comes with a number of different files and folders but only a small number are important for the user to be aware of. 

e.g., the following folders are usefull for the user 

example_files/

example_files contains example formatted files for each optional and mandatory input and parameter file Brioche can use

.. image:: Images/example_files_folder.png
   :alt: example_files folder
   :width: 300px
   :align: center


Additional_functions/

Additional_functions contains a collection of helpful bash and R scripts designed to expand the regular use of Brioche through 1. helping create input files from diverse datatypes (e.g.,run_convert_vcf_to_brioche_input.sh)
and 2. Running specialised Brioche runs including testing the parameter space for marker uniqueness (e.g., Run_brioche_optimise.sh), or insilico adding reference genome genotypes (e.g., Run_insilico_add.sh)

Each of these scripts contains a short description of how to run in the file and what settings to change but additional details can be found in the relevant usecase described in this User guide.

.. image:: Images/Additional_functions.png
   :alt: Additional_functions folder
   :width: 300px
   :align: center


The following files are useful for the user:

params.config

This file contains a list of all parameters that can be changed in Brioche allowing for greater control of a run

nextflow.config

This file contains additional details about resource allocation for the different types of runs Brioche can do. 
This can be edited to allocated great/less memory, CPUs etc per individual run if extremely large/small datasets
and reference genomes are being used and standard allocations are not efficient.


example_run_script_brioche.sh

This file contains a simple run script for how to run Brioche. It can be opened, the slurm settings updated and then launched with sbatch commands.


Introduction:
~~~~~~~~~~~~~

To return to the Introduction page go to :doc:`Introduction <introduction>`


Run setup:
~~~~~~~~~~

For setting up a run go to :doc:`Setting up a run <setting_up_a_run>`