Help

1. Installation

  • For Windows:

Install PERL from https://www.perl.org
Test installation by typing perl -v in the cmd (command prompt) window

Install Bioperl by following the instructions at bioperl.org or follow the steps:

I) Open a cmd window

II) Type cpan to enter the CPAN shell. If CPAN is not recognized, you may have to set the PATH to C:\Strawberry\perl\lib (or the location where CPAN.pm is installed).

III) At the cpan> prompt, type install CPAN to upgrade to the latest version.
IV) Quit (by typing q) and reload CPAN. You may be asked some configuration questions; accepting default questions should work fine.

V) At the cpan> prompt, type o conf prefer_installer MB to tell CPAN to prefer to use Build.PL scripts for installation, and the type o conf commit to save the choice.

VI) At the cpan> prompt, type
install Module::Build

VII) At the cpan> prompt, type
install Test::Harness

VIII) At the cpan> prompt, type
 install Test::Most

2. Finish the install with BioPerl from GitHub
Install the current version of Bioperl manually using a ZIP file from the GitHub repository:
https://cpan.metacpan.org/authors/id/C/CJ/CJFIELDS/BioPerl-1.007001.tar.gz

I) Extract the archive using 7zip or WinRAR.

II) In a cmd window go to the directory you extracted the .rar file to. E.g. if you extracted to directory C:\Downloads\bioperl type cd and the path link of the folder.

III) At the prompt type:
perl Build.pl
and few questions will be asked, answer the questions as per your requirement.

IV) Type

perl Build test

All the tests should pass, but if they don’t, your usage of Bioperl may or may not be affected by the failures, so you can choose to continue anyway.

3) Type:

perl Build install
to install bioperl.
Install R in windows

  • Add Rscript.exe and its bin to PATH

Start -> Control panel -> System and security -> Advanced system settings-> Click “environment variables”

Or

Start -> Type “View advanced system settings” -> Click “environment variables” and edit ->
Under “System variables”, select Path -> Click on edit à Click new -> Click browse -> Browse to C:/Program files/R/R(version)/bin à Ok
Also add the “exe” browse to upto bin; add the location and manually add /Rscript.exe.

  • Install packages: Open R GUI and type install.packages(type="source")

Source of igraph install.packages("igraph")
User can also download binaries of package and unzip it R library folder.

  • Download and install other dependent packages (if any).

2. Supported Platforms

Currently tested on Windows 10 x64 Version. But as Perl is platform independent it should work on Linux, Unix and Mac (Although test runs need to be performed).
Windows cmd: Type perl PBIT_n.pl or type PBIT_n.pl to execute the program.
3. Dependencies

For PBIT the following must be installed.
I.) PERL v5.8.3 and above
II.)Bioperl and its libraries (https://metacpan.org/pod/release/CJFIELDS/BioPerl-1.007001/BioPerl.pm)
III.) BLAST+ v2.10 (https://ftp.ncbi.nlm.nih.gov/blast/executables/LATEST/) 
IV.) Python v3.7 and above (https://www.python.org/downloads/)
i) pip v19.0 and above (https://pypi.org/project/pip/)
ii) COBRA 0.13.3 and above (https://pypi.org/project/cobra/)
iii) Libraries: sbml, pandas, xlwt, xlxswriter, and xlrd (install using pip)
V)  R and its packages (https://cran.r-project.org/bin/windows/base/)
i) igraph(https://igraph.org/r/#downloads)

IMPORTANT:
Once Perl, BioPerl, Python, BLAST+ have been installed and working, go to the Perl command line and enter cpan by giving the command – cpan and you should be in the cpan mode of Perl.
Enter the following commands to install the respective modules which will be needed for PBIT. Install the modules/libraries one at a time.

  • install Inline
  • install Inline::Python
  • install Data::Table
  • install HTML::TokeParser
  • install Parse::RecDescent
  • install Devel::Leak::Module
  • install Regexp::Common
  • install Number::Format
  • install GD::Graph
  • install Statistics::Basic
  • install Statistics::Descriptive
  • install Math::Utils

4. List of available modules and submodules


Figure: Modules and submodules of PBIT-offline

5. Selection of modules

  • Comparative genomics

This module has 6 submodules. All the sub-modules can be executed at once or individually. They can also be linked together by entering the module and submodule numbers as comma separated values. The screening module can also be linked to annotation module.

Input: FASTA formatted protein sequence files from UniProtKB
Parameters: Parameters such as the e-value (to be entered in float or integer) and percent identity or alignment length cut-off (to be entered in float or integer)

Output files: ‘queryname.bls’ – for each query sequence
‘queryname.fasta’ - for each query sequence
‘Ouputformodule_.fasta’ – non-homologous sequences for first three submodules (see figure) and homologous sequences for last three submodule (see figure).

  • Annotation

Annotation module has 2 submodules, these submodules are independent and have no pipeline connectivity between each other.
Input: FASTA formatted protein sequence files from UniProtKB
Parameters: Choice of submodules

Output files:  ‘queryname.txt’ – for each query sequence
queryname.html’ - for each query sequence
‘out_func.xls’ or ‘out_path.xls’ – annotated results of the query sequence

  • Host Pathogen Interaction

Host Pathogen Interaction is an individual module and has no pipeline connectivity between any modules/submodules.
Input: FASTA formatted protein sequence files from UniProtKB
Parameters: e-value (to be entered in float or integer) and percent identity (to be entered in float or integer)

Output files: ‘queryname.bls’ – for each query sequence
‘queryname.fasta’ - for each query sequence
‘Ouputformodule_.fasta’ – non-homologous sequences for first three submodules (see figure) and homologous sequences for last three submodule (see figure).

  • Systems Biology

Systems Biology has 2 submodules and are independent of each other, no pipeline has been formed.
Input: SBML 2 genome scale metabolic model in JSON format
reaction ids (for FVA) [optional]
gene ids in .txt format (for individual gene knockouts) [optional]
Parameters: choice of submodules
reaction constraints (bounds) [optional]

Output files: ‘Model_Associations.csv’ contains gene-reaction association of the metabolic model,
‘FVA_Model.xls’ contains flux variability analysis results,
‘FVA_Filtered.xls’ contains reaction ids with equal minimum and maximum flux values
‘FVA_Final.xls’ contains genes id associated with FVA_Filtered.xls reactions.
‘growth_sgd.xls’ contains growth rates under simulation of single gene deletion of entire model
‘Gene_Knockout.txt’ contains growth rates of model under individual gene knockout for queried genes

  • Network Analysis

Input: Proteome (edge list as comma separated file)
Parameters: Choice of module
Output files: Text file with list of shortlisted proteins

After the tool has been executed. The next time the user executes PBIT, the files of the previous results will be shifted to a new folder which will be named according to the format - D/M/Y and time conventions according to Linux so that the files can be stored and no overlapping of results can take place.

6. Warnings

  • Erroneous installation of Perl/Bioperl/Blast+/Python and their respective libraries can lead to failure in working of the tool.
  • After installation, few test runs should be done by the user to make sure that the tools installed are working without any errors.
  • Depending on the systems configuration the tool can execute in a certain amount of time, usually being few seconds to minutes.
  • All input files must be kept in the working directory along with PBIT executables.
  • User must execute PBIT as an administrator as permissions are needed to move or create files.
  • The user must have an active internet connection if they want to execute the Annotation module.
  • The user must enter the choices and syntax as instructed.

 

| © 2016, Biomedical Informatics Centre, NIRRH |
National Institute for Research in Reproductive Health, Jehangir Merwanji Street, Parel, Mumbai-400 012
Tel: 91-22-24192104, Fax No: 91-22-24139412