Lists (2)
Sort Name ascending (A-Z)
- All languages
- AMPL
- C
- C#
- C++
- CAP CDS
- CSS
- Clojure
- Common Workflow Language
- Cuda
- Cython
- D
- Dart
- Dockerfile
- Euphoria
- Go
- HTML
- Handlebars
- Haskell
- HyPhy
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- Lua
- M4
- Makefile
- Markdown
- Mojo
- Nextflow
- Nim
- Objective-C
- Objective-C++
- Perl
- PostScript
- PowerShell
- PureBasic
- Python
- R
- Roff
- Ruby
- Rust
- Scala
- Scilab
- Shell
- Svelte
- Swift
- Tcl
- TeX
- TypeScript
- Vue
- WDL
- Zig
Starred repositories
Compute hash digests for DNA sequences in a FASTA file
VIRify: detection of phages and eukaryotic viruses from metagenomic and metatranscriptomic assemblies
Access a database of word frequencies, in various natural languages.
💎 An easy-to-use workflow for generating context specific genome-scale metabolic models and predicting metabolic interactions within microbial communities directly from metagenomic data
Assessing the quality of metagenome-derived genome bins using machine learning
SemiBin: metagenomics binning with self-supervised deep learning
software to identify primers that can distinguish genomes
Plant foundation DNA large language models (LLMs) trained on different models
rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc.
Mikado is a lightweight Python3 pipeline whose purpose is to facilitate the identification of expressed loci from RNA-Seq data * and to select the best models in each locus.
tcsenpai / multi1
Forked from bklieger-groq/g1multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at once.
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
Export and Share your ChatGPT conversation history
BAM Statistics, Feature Counting and Annotation
MetaWRAP - a flexible pipeline for genome-resolved metagenomic data analysis
use the cigar string in the PAF to correct target and query endpoint issues
Panacus is a tool for computing statistics for GFA-formatted pangenome graphs
[ICLR 2024] DNABERT-2: Efficient Foundation Model and Benchmark for Multi-Species Genome
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
A rapid phylogeny-based method for accurate community profiling of large-scale metabarcoding datasets
This is the 2009 version of TestU01, a software library, implemented in the ANSI C language, and offering a collection of utilities for the empirical statistical testing of uniform random number ge…
The code from NIST SP-800-22 for testing random-number generators, along with docs for reference
DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome
Unix, R and python tools for genomics and data science
📠 Pretty and fast csv viewer for cli with cjk/emoji support.