jianjun66
diff --git a/‎ground_truth_labeling_jobs/ground_truth_object_detection_tutorial/object_detection_tutorial.ipynb‎
Lines changed: 27 additions & 53 deletions b/‎ground_truth_labeling_jobs/ground_truth_object_detection_tutorial/object_detection_tutorial.ipynb‎
Lines changed: 27 additions & 53 deletions
@@ -47,7 +47,7 @@
  "\n",
  "#### Prerequisites\n",
  "To run this notebook, you can simply execute each cell in order. To understand what's happening, you'll need:\n",
- "* An S3 bucket you can write to -- please provide its name in the following cell. The bucket must be in the same region as this SageMaker Notebook instance. You can also change the `EXP_NAME` to any valid S3 prefix. All the files related to this experiment will be stored in that prefix of your bucket. <mark>IMPORTANT: Your S3 bucket must allow public ACL access. This was the default S3 behavior until 11/2018, but not anymore. To enable public ACL access, [follow these AWS instructions](https://docs.aws.amazon.com/AmazonS3/latest/user-guide/block-public-access-bucket.html) and **unmark** all the checkboxes in Step 5.</mark>\n",
+ "* An S3 bucket you can write to -- please provide its name in the following cell. The bucket must be in the same region as this SageMaker Notebook instance. You can also change the `EXP_NAME` to any valid S3 prefix. All the files related to this experiment will be stored in that prefix of your bucket. \n",
  "* Familiarity with Python and [numpy](http://www.numpy.org/).\n",
  "* Basic familiarity with [AWS S3](https://docs.aws.amazon.com/s3/index.html).\n",
  "* Basic understanding of [AWS Sagemaker](https://aws.amazon.com/sagemaker/).\n",
@@ -69,6 +69,7 @@
  "from collections import Counter\n",
  "from datetime import datetime\n",
  "import itertools\n",
+ "import base64\n",
  "import glob\n",
  "import json\n",
  "import random\n",
@@ -101,21 +102,7 @@
  "region = boto3.session.Session().region_name\n",
  "s3 = boto3.client('s3')\n",
  "bucket_region = s3.head_bucket(Bucket=BUCKET)['ResponseMetadata']['HTTPHeaders']['x-amz-bucket-region']\n",
- "assert bucket_region == region, \"Your S3 bucket {} and this notebook need to be in the same region.\".format(BUCKET)\n",
- "\n",
- "# Test that the bucket allows public-read files.\n",
- "!echo \"test\" > test\n",
- "!aws s3 cp test s3://{BUCKET}/test\n",
- "try:\n",
- " s3.put_object_acl(\n",
- " ACL='public-read',\n",
- " Bucket=BUCKET,\n",
- " Key=f'test')\n",
- "except botocore.exceptions.ClientError:\n",
- " print('\\n\\n!!!!!!!!!! READ THIS !!!!!!!!\\n'\n",
- " 'Your bucket has wrong permissions. Please read the instructions for these cells carefully and adjust your bucket permissions as described.'\n",
- " ' Otherwise, we will be unable to upload an instruction template that is readable by the annotators to your bucket.')\n",
- " raise"
+ "assert bucket_region == region, \"Your S3 bucket {} and this notebook need to be in the same region.\".format(BUCKET)"
  ]
  },
  {
@@ -283,7 +270,7 @@
  "metadata": {},
  "outputs": [],
  "source": [
- "# Plot 6 samples in the given class.\n",
+ "# Plot sample images.\n",
  "def plot_bbs(ax, bbs, img):\n",
  " '''Add bounding boxes to images.'''\n",
  " ax.imshow(img)\n",
@@ -297,31 +284,19 @@
  " rec = plt.Rectangle((xmin, ymin), xmax-xmin, ymax-ymin, fill=None, lw=4, edgecolor='blue')\n",
  " ax.add_patch(rec)\n",
  " \n",
- "plt.figure(facecolor='white', dpi=100, figsize=(3, 9))\n",
+ "plt.figure(facecolor='white', dpi=100, figsize=(3, 7))\n",
  "plt.suptitle('Please draw a box\\n around each {}\\n like the examples below.\\n Thank you!'.format(CLASS_NAME), fontsize=15)\n",
- "for fid_id, (fid, bbs) in enumerate([list(fids2bbs.items())[idx] for idx in [1, 3, 4]]):\n",
+ "for fid_id, (fid, bbs) in enumerate([list(fids2bbs.items())[idx] for idx in [1, 3]]):\n",
  " !aws s3 cp s3://open-images-dataset/test/{fid}.jpg .\n",
  " img = imageio.imread(fid + '.jpg')\n",
  " bbs = [[float(a) for a in annot[1:]] for annot in bbs]\n",
- " ax = plt.subplot(3, 1, fid_id+1)\n",
+ " ax = plt.subplot(2, 1, fid_id+1)\n",
  " plot_bbs(ax, bbs, img)\n",
  " plt.axis('off')\n",
  " \n",
- "plt.savefig('instructions.png', dpi=200)\n",
- "!aws s3 cp instructions.png s3://{BUCKET}/{EXP_NAME}/instructions.png\n",
- "try:\n",
- " s3.put_object_acl(\n",
- " ACL='public-read',\n",
- " Bucket=BUCKET,\n",
- " Key=f'{EXP_NAME}/instructions.png')\n",
- "except botocore.exceptions.ClientError:\n",
- " print('\\n\\n!!!!!!!!!! READ THIS !!!!!!!!\\n'\n",
- " 'Could not make the instructions file public-readable in your S3 bucket. Annotators will not be able to see the instructions.'\n",
- " ' You must change your bucket access settings, as described at the beginning of this notebook (instructions for the first cell), '\n",
- " ' and then rerun this cell before continuing.')\n",
- " assert 1 == 0, 'Please change your bucket permissions'\n",
- "\n",
- "instructions_uri = 'https://s3.{}.amazonaws.com/{}/{}/instructions.png'.format(bucket_region, BUCKET, EXP_NAME)"
+ "plt.savefig('instructions.png', dpi=60)\n",
+ "with open('instructions.png', 'rb') as instructions:\n",
+ " instructions_uri = base64.b64encode(instructions.read()).decode('utf-8').replace('\\n', '')"
  ]
  },
  {
@@ -332,7 +307,6 @@
  "source": [
  "from IPython.core.display import HTML, display\n",
  "\n",
- "TEST_TEMPLATE = True\n",
  "def make_template(test_template=False, save_fname='instructions.template'):\n",
  " template = r\"\"\"<script src=\"https://assets.crowd.aws/crowd-html-elements.js\"></script>\n",
  " <crowd-form>\n",
@@ -358,7 +332,7 @@
  "\n",
  " </full-instructions>\n",
  " <short-instructions>\n",
- " <img src=\"{instructions_uri}\" style=\"max-width:100%\">\n",
+ " <img src=\"data:image/png;base64,{instructions_uri}\" style=\"max-width:100%\">\n",
  " </short-instructions>\n",
  " </crowd-bounding-box>\n",
  " </crowd-form>\n",
@@ -367,8 +341,6 @@
  " labels_str=str(CLASS_LIST) if test_template else '{{ task.input.labels | to_json | escape }}')\n",
  " with open(save_fname, 'w') as f:\n",
  " f.write(template)\n",
- " if test_template is False:\n",
- " print(template)\n",
  "\n",
  " \n",
  "make_template(test_template=True, save_fname='instructions.html')\n",
@@ -398,14 +370,14 @@
  "3. Enter the desired name for your private workteam.\n",
  "4. Select \"Create a new Amazon Cognito user group\" and click \"Create private team.\"\n",
  "5. The AWS Console should now return to `AWS Console > Amazon SageMaker > Labeling workforces`.\n",
- "5. Click on \"Invite new workers\" in the \"Workers\" tab.\n",
- "6. Enter your own email address in the \"Email addresses\" section and click \"Invite new workers.\"\n",
- "7. Click on your newly created team under the \"Private teams\" tab.\n",
- "8. Select the \"Workers\" tab and click \"Add workers to team.\"\n",
- "9. Select your email and click \"Add workers to team.\"\n",
- "10. The AWS Console should again return to `AWS Console > Amazon SageMaker > Labeling workforces`. Your newly created team should be visible under \"Private teams\". Next to it you will see an `ARN` which is a long string that looks like `arn:aws:sagemaker:region-name-123456:workteam/private-crowd/team-name`. Copy this ARN in the cell below.\n",
- "11. You should get an email from `no-reply@verificationemail.com` that contains your workforce username and password. \n",
- "12. In `AWS Console > Amazon SageMaker > Labeling workforces`, click on the URL in `Labeling portal sign-in URL`. Use the email/password combination from Step 11 to log in (you will be asked to create a new, non-default password).\n",
+ "6. Click on \"Invite new workers\" in the \"Workers\" tab.\n",
+ "7. Enter your own email address in the \"Email addresses\" section and click \"Invite new workers.\"\n",
+ "8. Click on your newly created team under the \"Private teams\" tab.\n",
+ "9. Select the \"Workers\" tab and click \"Add workers to team.\"\n",
+ "10. Select your email and click \"Add workers to team.\"\n",
+ "11. The AWS Console should again return to `AWS Console > Amazon SageMaker > Labeling workforces`. Your newly created team should be visible under \"Private teams\". Next to it you will see an `ARN` which is a long string that looks like `arn:aws:sagemaker:region-name-123456:workteam/private-crowd/team-name`. Copy this ARN into the cell below.\n",
+ "12. You should get an email from `no-reply@verificationemail.com` that contains your workforce username and password. \n",
+ "13. In `AWS Console > Amazon SageMaker > Labeling workforces > Private`, click on the URL under `Labeling portal sign-in URL`. Use the email/password combination from the previous step to log in (you will be asked to create a new, non-default password).\n",
  "\n",
  "That's it! This is your private worker's interface. When we create a verification task in [Verify your task using a private team](#Verify-your-task-using-a-private-team-[OPTIONAL]) below, your task should appear in this window. You can invite your colleagues to participate in the labeling job by clicking the \"Invite new workers\" button.\n",
  "\n",
@@ -497,7 +469,7 @@
  " \"PreHumanTaskLambdaArn\": prehuman_arn,\n",
  " \"MaxConcurrentTaskCount\": 200, # 200 images will be sent at a time to the workteam.\n",
  " \"NumberOfHumanWorkersPerDataObject\": 5, # We will obtain and consolidate 5 human annotations for each image.\n",
- " \"TaskAvailabilityLifetimeInSeconds\": 21600, # Your worteam has 6 hours to complete all pending tasks.\n",
+ " \"TaskAvailabilityLifetimeInSeconds\": 21600, # Your workteam has 6 hours to complete all pending tasks.\n",
  " \"TaskDescription\": task_description,\n",
  " \"TaskKeywords\": task_keywords,\n",
  " \"TaskTimeLimitInSeconds\": 300, # Each image must be labeled within 5 minutes.\n",
@@ -1965,7 +1937,7 @@
  "cell_type": "markdown",
  "metadata": {},
  "source": [
- "## Create Endpoint\n",
+ "### Create Endpoint\n",
  "\n",
  "The next cell creates an endpoint that can be validated and incorporated into production applications. This takes about 10 minutes to complete."
  ]
@@ -2005,12 +1977,12 @@
  ]
  },
  {
- "cell_type": "code",
- "execution_count": null,
+ "cell_type": "markdown",
  "metadata": {},
- "outputs": [],
  "source": [
- "print('Endpoint creation ended with EndpointStatus = {}'.format(status))"
+ "### Perform inference\n",
+ "\n",
+ "The following cell transforms the image into the appropriate format for realtime prediction, submits the job, receives the prediction from the endpoint, and plots the result."
  ]
  },
  {
@@ -2044,6 +2016,8 @@
  "cell_type": "markdown",
  "metadata": {},
  "source": [
+ "### Clean up\n",
+ "\n",
  "Finally, let's clean up and delete this endpoint."
  ]
  },