Security

note

The product described here is in constant evolution, this document was last updated on: 2024-02-20

For context on what this product does what the main components are, and high-level external dependencies, see the Integrates page.

Threat Model

Components

Integrates is intended to be run in the cloud on AWS EC2, behind Cloudflare, the AWS ELB, the AWS EKS Ingress Controller, and within the AWS VPC. The other components that Integrates uses can be accessed over the internet, but they require authentication, for instance, AWS Backup, AWS CloudWatch, AWS DynamoDB, AWS OpenSearch, AWS S3, Compute, and Secrets, which can be accessed using AWS IAM prod_integrates; and Cloudflare, which can be accessed using Secrets.

Identifier	Description
Cloudflare	Domain Name, Firewall, CDN
AWS ELB	Load Balancer
AWS EKS	K8s (Ingress Controller, Deployment Manager)
AWS EC2	Physical Machine Instances
AWS CloudWatch	Logs
AWS DynamoDB	Main Database
AWS OpenSearch	Search Engine
AWS S3	Cloud Blob Storage
AWS VPC	Virtual Private Cloud and Networking
AWS Backup	Backups
Compute	Job Queues and Schedules Manager
Secrets	Credentials and secrets of other services
AWS IAM prod_integrates	`prod_integrates` AWS IAM role

Entry Points

Entry points define the interfaces through which potential attackers can interact with the application or supply it with data.

Identifier	Description	Trust Levels
API	The API is intended to be used by anyone, but some endpoints require an API token or an active session	Anonymous, Authenticated
Front	The FrontEnd is accessible by anyone, but some views require an active session	Anonymous, Authenticated
Retrieves	The Visual Studio Code extension can be installed by anyone, but it requires an API token	Anonymous, Authenticated
Forces	The CI/CD agent can be ran by anyone, but it requires an API token	Anonymous, Authenticated

Exit Points

Exit points might prove useful when attacking Integrates.

Identifier	Description
API	Data and errors returned from the API

Assets

Something an attacker may be interested in:

Identifier	Description	Trust Levels
Cloudflare	Domain Name, Firewall, CDN	Admin
Data	AWS Backup, AWS CloudWatch,AWS DynamoDB, AWS OpenSearch, AWS S3	Authenticated, Admin
Backups	Backups of the previous assets	Admin
Secrets	Credentials and secrets of other services	Admin
AWS IAM prod_integrates	`prod_integrates` IAM role	Admin
Availability	Integrates should be available all the time	Anonymous, Authenticated, Admin

Trust Levels

Access rights that the application recognizes on external entities:

Identifier	Description
Anonymous	Any user on the internet
Authenticated	Any user with either an API token or a valid session
Admin	Some Integrates Developers or and instance of AWS IAM prod_integrates

Data Flow

The Integrates API entry point is only accessible through the Cloudflare component.
The Cloudflare component receives all the traffic from the users and acts as a firewall and CDN.

It routes traffic directed to the Front to the corresponding Blob in the AWS S3 component, and routes traffic directed to the API to the Load Balancer in the AWS ELB component.
The AWS ELB component routes traffic to any healthy instance in the AWS EC2 component.
The AWS EC2 component routes the traffic to the container running the API HTTP server.
The API HTTP server processes the request, and verifies the authentication and authorization of it, changing the trust level to Authenticated if legit, or keeping the trust level as Anonymous if the requested action does not require any access control. Otherwise, the request is rejected and a response is sent back to the user.
The API HTTP server fulfills the request by retrieving or updating the state in the corresponding Data assets, which are accessed using the Secrets component.
Now and then, some Data assets are replicated by the Backups component.

Threat Categorization

Spoofing

An attacker may want to impersonate our users.

Mitigation:
- We don't store login credentials but instead use OAuth2, which means that the security of the login is delegated to the provider that the user chooses, or controlled by the organization the user works for.
- Only OAuth2 accounts with a validated email address are allowed.
An attacker may claim to be who they are not.

Mitigation:
- Unless a cross-site-request-forgery, side-channel attack, or a similar vulnerability exists, an attacker would not be able to impersonate other users.
  
  This is guaranteed because all users are given an authentication token after they authenticate successfully in the application, which contains (in encrypted form) the user's identity.
An Attacker may use an expired authentication token to gather information about the user.

Mitigation:
- The session token payload is encrypted.
An attacker may use a left-over session of a user to impersonate that user.

Mitigation:
- There is a limit after which the user's session expires.
- No concurrent sessions are allowed
- API tokens can be revoked.

Tampering

An attacker may tamper with an authentication token (session token or API token).

Mitigation:
- Authentication tokens use signed JWT (JSON Web Tokens), and the signature is validated by the server before trusting its contents.
An attacker may submit invalid data to the API.

Mitigation:
- All input validation controls happen server-side.
- The protocol we use for the API is GraphQL, which enforces basic types (boolean, integer, date, ...) and structure on the incoming requests.
- Some fields are added special validations using regular expressions, and so on.
- Communication with the Database is done using the appropriate libraries for the task, which handle the specific Database language (escaping, data types, etc) correctly.
Man in the Middle.

Mitigation:
- Only secure communication protocols are accepted by the API server.
- The Strict-Transport-Security response header is set.
An attacker may tamper with the data anyway.

Mitigation:
- We have backups for the most important tables in the Database.

Repudiation

An attacker may repudiate who they are.

Mitigation:
- Requests to the API require an authentication token, which includes the user identity and can only be obtained by authenticating first. Therefore, there is a strong claim that the bearer of the authentication token is the identity it claims to be.
An attacker may perform destructive actions anyway.

Mitigation:
- Logs are sent to AWS Cloudwatch identifying who did what and when.
- In many cases, the "Historic State" is stored in the Database, allowing retrieval of every modification the data has had over time.

Information disclosure

An attacker may use data from a breach.

Mitigation:
- Secrets are rotated every two weeks.
- Information is encrypted at rest.
- Secrets are stored in the database using proper cryptography mechanisms.
An attacker may access the information they would normally not have access to.

Mitigation:
- Apart from the authentication system, the authorization system forbids access unless explicitly granted.

Denial of service

Symmetric Denial of Service.

Mitigation:
- We use Cloudflare as a proxy, CDN, and firewall, and have configured a rate limit per IP.
Asymmetric Denial of Service.

Mitigation:
- Pagination is implemented for all endpoints that can return high amounts of information.
- The backend evaluates the depth and breadth of provided GraphQL queries and refuses to evaluate them if they are too deep or too wide.
- Upon failure of some API server instances, Kubernetes can heal the cluster by spinning new instances.
- Upon increased demand, Kubernetes can spin new instances up to a max capacity.
- We use monitoring solutions to inspect in real-time the slow queries happening in the application, and proactively optimize them before an attacker can exploit them in practice.

Elevation of privilege

An attacker may want to elevate their current role within the application.

Mitigation:
- The API functions in the source code are easy to annotate with a "decorator", which is a small line of code that enforces the authorization automatically before executing any logic.
- The access control of every role is centralized and as-code, and can only be changed by changing the source code and redeploying the application, so it's easy to audit changes that are made to it.
- Who has which role happens dynamically through being granted access by another higher-ranked role. Care is taken to avoid an elevation of privileges by computing permissions matrices, where it's easy to see which role can do what.
- We practice minimum privilege where it makes sense.

Lateral movement

An attacker may use their current privileges in one system to gain access to another system.

Mitigation:
- The API server has different layers of access control, allowing a logical division of the data by customer.

Design Principles

Least Privilege

Integrates has many roles so that every identity can be granted one with the minimum amount of permissions.

Fail-Safe Defaults

The application denies access by default to all non-corporate emails.

Corporate emails can login but do not have access to anything unless someone else with the right privileges grants it.

Economy of Mechanism

Complete Mediation

Authentication tokens only carry the identity of the user, and authorization happens many times within one request every time a function is executed.

Open Design

Integrates is Free and Open Source Software, anyone can read its source code and understand the internals.
The technical and user documentation is also public.

Separation of Privilege

Roles are per user, group, and organization, allowing maximum segregation.

Least Common Mechanism

Almost all storage mechanisms are layered by organization, group, or user, and therefore locating them requires specifying this information, which creates a logical barrier.

Psychological Acceptability

The platform lets users know that they should use a work email in order to get access.
API technical errors are not displayed.

Threat Model​

Components​

Entry Points​

Exit Points​

Assets​

Trust Levels​

Data Flow​

Threat Categorization​

Spoofing​

Tampering​

Repudiation​

Information disclosure​

Denial of service​

Elevation of privilege​

Lateral movement​

Design Principles​

Least Privilege​

Fail-Safe Defaults​

Economy of Mechanism​

Complete Mediation​

Open Design​

Separation of Privilege​

Least Common Mechanism​

Psychological Acceptability​