<!doctype html><script>window.REMOTE=1</script><script src="https://archive.org/~tracey/slides/eveal.js/eveal.js"></script><title>Archive TV and Captions, textAV</title>
by [traceypooh](https://twitter.com/tracey_pooh)git clone https://github.com/traceypooh/textAV; open textAV/index.html
- recording 50 - 100 channels
- 24 x 7
- around the world
- since 2000
- 2 million news shows
- search captions
- reading the "lower thirds"
- compare networks
- editorial?
- angle?
http://archive.org/~tracey/tv/comey.htm http://archive.org/~tracey/tv/sessions.htm
- crop third every second
- tesseract (OCR)
- simhash
- similarity hash
- phrases nearly equal?
- grouping ~repeated instances
https://twitter.com/tvThirdEye
- CNN now
- expand to MSNBC, FOXNEWS, BBCNEWS
- launching soon
- ccextractor
- OCR caption glyphs (euro DVB)
- tesseract
- avoid repeated / rolling windows
- compare two images:
- how to cook
- how to cook for humans
- some deduping and simhash
- compare two images:
- coming next week
- trump administration, too
- allow CC searching subsets
- browsing easier
- find most watched or cited pieces
- little JSON annotations
- arbitrary start/end
- auto expands each clip to a "synthetic" document
- to elastic search
- JSONPatch for changes
- track play counts, some referers
{
"268.1|269.1": {
"subject": [
"Criminal Activity"
"Crime"
],
"factcheck": [
"http://www.factcheck.org/2016/07/factchecking-trumps-big-speech/"
]
},
"266.7|267.2": {
"ad_id": "PolAd_DonaldTrump_d9dsn",
"type": "campaign",
"race": "PRES",
"cycle": "2016",
"message": "pro",
"sponsor": [
"Republican National Cmte"
],
"sponsor_type": "PAC",
"subject": [
"Job Accomplishments"
],
"person": [
"Donald Trump"
]
},
"268.1|269.1": {
"collection": [
"nancy_pelosi_archive"
],
"subject": [
"Voting",
],
}
}
- popcorn
- https://archive.org/pop
- https://github.com/mozilla/popcorn-editor
- ted nelson likes transcludes!
- more realtime experiments
- ES6 JS