You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/README.md
-2Lines changed: 0 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,15 +8,13 @@ The Sendit application is an on demand application that works in two stages to o
8
8
- anonymization is performed (status "PROCESSING"), meaning removing/replacing fields in the header and image data, .
9
9
- when status "DONEPROCESSING" is achieved for all in the queue, the researcher triggers the final job to send data to storage (status "SENT")
10
10
11
-
12
11
## Preparation
13
12
The base of the image is distributed via [sendit-base](scripts/docker/README.md). This image has all dependencies for the base so we can easily bring the image up and down.
14
13
15
14
## Deployment
16
15
17
16
-[Setup](setup.md): Basic setup (download and install) of a new application for a server.
18
17
-[Configuration](config.md): How to configure the application before starting it up.
19
-
-[Application](application.md): Details about the application infrastructure.
20
18
-[Start](start.md): Start it up!
21
19
-[Interface](interface.md): A simple web interface for monitoring batches.
Copy file name to clipboardExpand all lines: docs/application.md
+16Lines changed: 16 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,11 +1,27 @@
1
1
# Application
2
+
3
+
## Overview
4
+
Sendit generally works as follows:
5
+
6
+
1. datasets are mapped to the application container `/data` folder, and are processed in batches. Each folder associated with an accession number is considered a batch.
7
+
2. Header data is extracted for cleaning using the [deid](https://www.github.com/pydicom/deid) module. Sendit stores key/values for header data.
8
+
3. Sendit uses the [stanford open modules](https://www.github.com/vsoch/som) to interact with the Stanford (DASHER) API and retrieve anonymous ids for the data.
9
+
4. The data structure with headers is updated with the identifiers from DASHER, and the images are anonymized using this data structure per a specific customization defined in a deid recipe.
10
+
5. The final images and metadata are uploaded to Google Cloud again using the [stanford open modules](https://www.github.com/vsoch/som).
11
+
12
+
From the above, you can see that sendit is akin to a glue to hold several APIs and customizations together.
13
+
14
+
## Sendit Containers
15
+
2
16
This application lives in a docker-compose orchestration of images running on `STRIDE-HL71`. This application has the following components (each a Docker image):
3
17
4
18
-**uwsgi**: is the main python application with Django (python)
19
+
-**postgres**: is a postgres image database
5
20
-**nginx**: is a web server to make a status web interface for Research IT
6
21
-**worker**: is the same image as uwsgi, but configured to run a distributed job queue called [celery](http://www.celeryproject.org/).
7
22
-**redis**: is the database used by the worker, with serialization in json.
0 commit comments