Gryffin (beta)

Gryffin is a large scale web security scanning platform. It is not a yet another scanner. It was written to solve two specific problems with existing scanners, that of, coverage and scale.

Better coverage translates to fewer false negatives. Inherent scalability translates to, capaility of scanning and supporting a large elastic application infrastructure. Or simply put, the ability to scan 1000 applications today to 100,000 applications tomorrow by straightforward horizontal scaling.

Coverage

Coverage has two dimensions - one during crawl and the other during fuzzing. In crawl phase, coverage implies, being able to find as much of the application footprint. In scan phase or while fuzzing, it implies, being able to test each part of the application for applied set of vulnerabilities in a deep.

Crawl Coverage

Today a large number of web applications are template driven, that means, same code or path generates millions of URLs. For a security scanner, it just needs one of the million URLs that are generated by the same code or path. Gryffin's crawler does just that.

Page Deduplication

Gryffin has a deduplication engine at its heart that compares the new page with the already seen pages. If the HTML structure of the new page is similar to the ones seen, it is classified as duplicate and not crawled further.

DOM Rendering and Navigation

A large number of applications today are rich applications. They are heavily driven by client-side JavaScript. In order to discover links and code paths in such applications, Gryffin's crawler, uses PhantomJS for DOM rendering and navigation.

Scan Coverage

As Gryffin is a scanning platform and not a scanner, it does not have its own fuzzer modules, even for fuzzing common web vulnerabilities like XSS and SQL Injection.

It's not wise to reinvent the wheel where you do not have to. Gryffin at production scale at Yahoo uses open source and custom fuzzers. Some of these custom fuzzers might be open sourced in future and might or might not be part of Gryffin repository.

For demonstration purpose, Gryffin comes integrated with sqlmap and arachni. It does not endorse them or any other scanner in particular.

Philosophy is to improve scan coverage by being able to fuzz for just what you need.

Scale

While Gryffin is available as a standalone package, it's primarily built for scale.

Gryffin is built on the publisher-subscriber model. Each component is either a publisher or a subscriber or both. This allows Gryffin to scale horizontally by simply adding more subscriber or publisher nodes.

Operating Gryffin

Pre-requisite

Go
PhantomJS, v2
Sqlmap (for fuzzing SQLi)
Arachni (for fuzzing XSS and web vulnerabilities)
NSQ ,
- running lookupd at port 4160,4161
- running nsqd at port 4150,4151
- with --max-msg-size=5000000
Kibana and Elastic search, for dashboarding
- listening to JSON over port 5000
- Preconfigured docker image available in https://hub.docker.com/r/yukinying/docker-elk/

Installation

go get github.com/yahoo/gryffin/...

Run

Example 1: A site with 1M+ URLs

A typical site with millions of URLs like news.yahoo.com is scanned below to show the importance of TBD: Link to news/finance scan video

Example 2: A rich app

TBD: Link to Flickr scan video

TODO

Mobile browser user agent
Preconfigured docker images
Redis for sharing states across machines
Instruction to run gryffin (distributed or standalone)
Documentation for html-distance
Implement a JSON serializable cookiejar.
Identify duplicate url patterns based on simhash result.

Credits

Adonis Fung @ Yahoo, for the asynchronous phantomjs based crawler and DOM event navigator.
Simhash algorithm](http://www.cs.princeton.edu/courses/archive/spring04/cos598B/bib/CharikarEstim.pdf) by Moses Charikar
Simhash implementation provided by mfonda/simhash.
Sqlmap
Arachni

Licence

Code licensed under the BSD-style license. See LICENSE file for terms.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
cmd		cmd
data		data
fuzzer		fuzzer
html-distance		html-distance
renderer		renderer
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
gryffin.go		gryffin.go
gryffin_test.go		gryffin_test.go
util.go		util.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gryffin (beta)

Coverage

Crawl Coverage

Page Deduplication

DOM Rendering and Navigation

Scan Coverage

Scale

Operating Gryffin

Pre-requisite

Installation

Run

Example 1: A site with 1M+ URLs

Example 2: A rich app

TODO

Credits

Licence

About

Releases

Packages

Languages

License

4point/gryffin

Folders and files

Latest commit

History

Repository files navigation

Gryffin (beta)

Coverage

Crawl Coverage

Page Deduplication

DOM Rendering and Navigation

Scan Coverage

Scale

Operating Gryffin

Pre-requisite

Installation

Run

Example 1: A site with 1M+ URLs

Example 2: A rich app

TODO

Credits

Licence

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages