GHOSTMACHINE: The NSA's cloud analytics platform
What is HADOOP?
An open-source software framework for storage and large-scale data processing originally derived from academic papers by Google. This system underlays many of the largest data-processing projects in the world today, including the NSA.
TUSKATTIRE
This is the NSA’s system for cleaning and processing call-related data (DNR or Dialed Number Recognition).
FASCIA limit
The top line indicates the current limit of the amount of data the NSA's FASCIA location database can ingest — about 5 billion records per day.
JUGGERNAUT
The NSA’s signal-processing system for ingesting telephony information, including SS7 signaling — a technical term for the method by which cell-phone networks communicate with each other.
How much data does FASCIA collect?
This indicates that more than 27 terrabytes of location data was collected by the database over a period of about seven months.
OPC/DPC pairs
These refer to the originating and destination points that typically transfer traffic from one provider’s internal network to another’s.
STORMBREW
The code name for a sigad, or signal-collection location, that collects data from 27 OPC/DPC pairs.
FAIRVIEW
This is a sigad that collects data from 860 OPC/DPC pairs.
> https:// www.washingtonpost.com/apps/g/page/world/ghostmachine-the-nsas-cloud-analytics-platform/644/#document/p4/a135362