Skip to content
View iNeil77's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report iNeil77

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Source Code Data Augmentation for Deep Learning: A Survey.

59 1 Updated Jun 15, 2024

Code to create bugged python scripts for OpenAssistant Training, maintained by https://twitter.com/Cyndesama

Jupyter Notebook 21 5 Updated Jul 23, 2023

A simple code complexity analyser without caring about the C/C++ header files or Java imports, supports most of the popular languages.

Python 1,826 248 Updated Jun 24, 2024

BigCodeBench: Benchmarking Code Generation Towards AGI

Python 186 22 Updated Sep 17, 2024

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 610 41 Updated Jul 26, 2024

Recipes to train reward model for RLHF.

Python 659 57 Updated Sep 23, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 2,063 202 Updated Sep 23, 2024

A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering

671 90 Updated Jul 9, 2024

Train Models Contrastively in Pytorch

Python 508 37 Updated Aug 26, 2024

Document aligner which uses neural technologies to search matches across bilingual documents

Python 7 3 Updated Jun 9, 2022

Data creation, training and eval scripts for the IRCoder paper

Python 9 1 Updated May 31, 2024

pyan is a Python module that performs static analysis of Python code to determine a call dependency graph between functions and methods. This is different from running the code and seeing which fun…

Python 627 124 Updated Oct 3, 2021

Run code inference-only benchmarks quickly using vLLM

Python 7 Updated Sep 15, 2024

A Source Code Tokenizer

Python 13 4 Updated Apr 10, 2024
12 Updated Aug 16, 2023

Repository for "SecurityEval Dataset: Mining Vulnerability Examples to Evaluate Machine Learning-Based Code Generation Techniques" published in MSR4P&S'22.

Python 55 13 Updated Nov 4, 2023

MOSS-RLHF

Python 1,271 96 Updated Mar 3, 2024

evol augment any dataset online

Python 55 7 Updated Aug 3, 2023

[TMLR] A curated list of language modeling researches for code and related datasets.

1,409 97 Updated Sep 21, 2024

Large Language Models Meet NL2Code: A Survey

HTML 35 11 Updated Jul 19, 2024

Python SQL Parser and Transpiler

Python 6,433 657 Updated Sep 23, 2024

⚒️ Tree-sitter custom toolkit for extracting function and class from raw source file

Python 38 6 Updated Jul 1, 2024

Machine Learning Engineering Open Book

Python 11,044 661 Updated Sep 19, 2024

😎 Curated list of awesome things regarding WebAssembly (wasm) ecosystem.

8,761 500 Updated Jun 21, 2024

Control the quality of your labeled data with the Python tools you already know.

Python 211 15 Updated Sep 12, 2024

Collection of important articles to be treated as a textbook

Jupyter Notebook 560 28 Updated Apr 5, 2024

☠️ Ground-truth dataset for vulnerability prediction (known research datasets and data sources included such as NVD, CVE Details and OSV); tools to automatically update the data are provided.

Jupyter Notebook 80 26 Updated Sep 2, 2023

Collected solutions from Google Code Jam programming competition (2008-2020).

59 9 Updated Sep 19, 2024

Problem statements on System Design and Software Architecture as part of Arpit's System Design Masterclass

Python 1,974 421 Updated Dec 8, 2023
Next