Skip to content

An Excel file for Molecular Biologists' day-to-day use. Conveniently provides useful functions from basic ReverseComplement, and Translate functions to sophisticated Needleman alignment and Motif Searches.

License

Notifications You must be signed in to change notification settings

Ghahfarokhi/ATG_molbio_excel

Repository files navigation

ATG_molbio_excel

Overview

Excel VBA (Visual Basic for Applications) is a powerful feature of Microsoft Excel to extend its functionality and automate tasks. This repository contains codes that add Molecular Biology related functionalities to Excel.

Excel file

Simply download "ATG_molbio_excel.xlsb" file and enable the Macros upon opening. Tested on a Mac system, no reason for not working on Windows systems.

Usage

  • Basic functions: Reverse, Complement, ReverseComplement, gcContent, and Translate:
=Reverse(DAN/RNA)	

=Complement(DNA/RNA)
=Complement_Relaxed(DNA/RNA)

=ReverseComplement(DNA/RNA)
=ReverseComplement_Relaxed(DNA/RNA)

=gcContent(DNA/RNA)

=Translate3LettersAA(DNA/RNA)
=Translate(DNA/RNA)

Reverse_and_Complement GC_and_Translate

  • PCR: to produce the sequence of an amplicon using Fwd, Rev, and Template sequences as input:
=PCR(Fwd, Rev, Template)	
=PCR(Fwd, Rev, Template, "Length")	

PCR

=NeedleAlignmnet(Sequence1,Sequence2)

Needle

  • Motif Search: Count the number or list motifs in a sequence (searching both strands). This feature, for example, can be used for listing all targetable SpyCas9 (NNNNNNNNNNNNNNNNNNNNNGG), or AspCas12a (TTTNNNNNNNNNNNNNNNNNNNNNN) sites:
=Motifs(Sequence, Motif, "Count")
=Motifs(Sequence, Motif, "List")

Motifs

Alternatively, "ListMotifs" worksheet is specifically designed to provide a clickable button to find matching motifs in a sequence and list them.

ListMotifs

  • Sequence Distances: Calculate Hamming Distance or Edit Distance (a.k.a. Levenshtine Distance) between two sequences

Distances

  • Alignment Decoration: Change the font color of mismatches between two sequences based on their nature (Transition, Transversion, indel). Refer to the "Info" sheet to specify the desired font color for matches and mismatches.

AlignmentDecoration

  • CasOFFinder Decoration: CasOFFinder is a computational tool used to find genome-wide off-target candidates for RNA-guided proteins. The designated worksheet runs a Macro to visualize the crRNA and DNA (off-target) differences.

CasOFFinderDecoration

Important to know

  • Non-DNA letters for the "Relaxed" basic functions are accepted (an on-purpose feature).
  • Microsoft Excel has a character limit of 32,767 characters in each cell.
  • The default parameters for Needleman function are adjustable via the "Info" worksheet.
  • Font colors for Matches and different type of Mismatches are adjustable by changing the font color of designated cells in the "Info" worksheet.

AlignmentFontColors

Contact

Please report bugs to Amir.Taheri.Ghahfarokhi@Gmail.com

About

An Excel file for Molecular Biologists' day-to-day use. Conveniently provides useful functions from basic ReverseComplement, and Translate functions to sophisticated Needleman alignment and Motif Searches.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published