Skip to content
View pudo's full-sized avatar

Organizations

@bundestag @pdfminer @opensanctions

Block or report pudo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

374 results for source starred repositories
Clear filter

Task tracking for the crawlers we're working on

6 Updated Jan 26, 2024

How can we improve name matching in screening tools?

Jupyter Notebook 11 Updated Apr 17, 2024

A super-fast lookup service for canonical names

Python 5 Updated Apr 2, 2024

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

Python 1,297 147 Updated Oct 4, 2024

Data cleaning and validation functions for names, languages, identifiers, etc.

Python 8 3 Updated Aug 26, 2024

The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams

10,145 489 Updated May 5, 2024

A library that provides an embeddable, persistent key-value store for fast storage.

C++ 28,387 6,293 Updated Oct 4, 2024

LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.

C++ 36,303 7,802 Updated Aug 23, 2024

Bootstrap components built with React

TypeScript 22,373 3,589 Updated Oct 4, 2024

Rapid fuzzy string matching in Python using various string metrics

C++ 2,653 118 Updated Sep 23, 2024

Validate National ID Numbers

Python 5 Updated Oct 27, 2022

Column store implementation for ftm data based on clickhouse

Python 4 Updated Oct 2, 2024

Mini-metadata format for media content exchange

Python 7 Updated Jan 25, 2023

A collection to manage resources on Hetzner Cloud

Python 105 37 Updated Sep 23, 2024

Main code for a work space localized in Berlin

SCSS 5 2 Updated May 30, 2024

PyPi module for Graphlet AI Knowledge Graph Factory

Python 27 1 Updated Apr 1, 2023

This is a converter to FTM for zakupki.gov.ru leaked data

Python 1 Updated Aug 4, 2022

Russian companies registry

Python 7 Updated Nov 7, 2022

A curated list of threat modeling resources (Books, courses - free and paid, videos, tools, tutorials and workshops to practice on ) for learning Threat modeling and initial phases of security review.

Dockerfile 1,380 252 Updated Aug 2, 2024

Memray is a memory profiler for Python

Python 13,193 394 Updated Oct 4, 2024

Guidance for BODS schema development and related things

Ruby 4 1 Updated Jul 18, 2024

Platform for journalists to search, analyse, categorise and share unstructured data

Scala 53 3 Updated Oct 4, 2024

Map Open Sanctions into Senzing format.

Python 4 1 Updated Aug 22, 2024

A dataset with political datasets

R 621 87 Updated Sep 7, 2024

Loading OpenSanctions into Neo4J and Linkurious

Python 26 6 Updated Feb 5, 2024

Fast lookup server for NSRL and other hash database used in digital forensic

Python 41 7 Updated Jun 16, 2022

Import synonames (multilingual variants of first names from Wikidata) to Solr managed synonyms graph

Python 6 1 Updated Oct 4, 2020

The Toolkit API, app, and browser extension. Start preserving now.

TypeScript 45 4 Updated Oct 1, 2024

API for OpenSanctions with support for entity search and bulk matching of data collections. Supports Reconciliation API spec.

Python 67 27 Updated Oct 4, 2024

An open database of international sanctions data, persons of interest and politically exposed persons

HTML 484 116 Updated Oct 4, 2024
Next