did not do part 2. but must complete even if of no use.

This commit is contained in:
2024-03-17 22:16:52 -07:00
parent 56c521ae2e
commit 582233502d
1273 changed files with 9072 additions and 0 deletions

21
Tooling/LICENSE Normal file
View File

@@ -0,0 +1,21 @@
MIT License
Copyright (c) 2024 visa-lab
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.

2
Tooling/README.md Normal file
View File

@@ -0,0 +1,2 @@
# CSE546-Cloud-Computing
This is a public repository to facilitate project development for CSE546 Cloud Computing students.

View File

@@ -0,0 +1,101 @@
Image,Results
test_00,Paul
test_01,Emily
test_02,Bob
test_03,German
test_04,Emily
test_05,Gerry
test_06,Gerry
test_07,Ranil
test_08,Bill
test_09,Wang
test_10,Paul
test_11,Emily
test_12,Bob
test_13,German
test_14,Emily
test_15,Gerry
test_16,Gerry
test_17,Ranil
test_18,Bill
test_19,Wang
test_20,Paul
test_21,Emily
test_22,Bob
test_23,German
test_24,Emily
test_25,Gerry
test_26,Gerry
test_27,Ranil
test_28,Bill
test_29,Wang
test_30,Paul
test_31,Emily
test_32,Bob
test_33,German
test_34,Emily
test_35,Gerry
test_36,Gerry
test_37,Ranil
test_38,Bill
test_39,Wang
test_40,Paul
test_41,Emily
test_42,Bob
test_43,German
test_44,Emily
test_45,Gerry
test_46,Gerry
test_47,Ranil
test_48,Bill
test_49,Wang
test_50,Paul
test_51,Emily
test_52,Bob
test_53,German
test_54,Emily
test_55,Gerry
test_56,Gerry
test_57,Ranil
test_58,Bill
test_59,Wang
test_60,Paul
test_61,Emily
test_62,Bob
test_63,German
test_64,Emily
test_65,Gerry
test_66,Gerry
test_67,Ranil
test_68,Bill
test_69,Wang
test_70,Paul
test_71,Emily
test_72,Bob
test_73,German
test_74,Emily
test_75,Gerry
test_76,Gerry
test_77,Ranil
test_78,Bill
test_79,Wang
test_80,Paul
test_81,Emily
test_82,Bob
test_83,German
test_84,Emily
test_85,Gerry
test_86,Gerry
test_87,Ranil
test_88,Bill
test_89,Wang
test_90,Paul
test_91,Emily
test_92,Bob
test_93,German
test_94,Emily
test_95,Gerry
test_96,Gerry
test_97,Ranil
test_98,Bill
test_99,Wang
1 Image Results
2 test_00 Paul
3 test_01 Emily
4 test_02 Bob
5 test_03 German
6 test_04 Emily
7 test_05 Gerry
8 test_06 Gerry
9 test_07 Ranil
10 test_08 Bill
11 test_09 Wang
12 test_10 Paul
13 test_11 Emily
14 test_12 Bob
15 test_13 German
16 test_14 Emily
17 test_15 Gerry
18 test_16 Gerry
19 test_17 Ranil
20 test_18 Bill
21 test_19 Wang
22 test_20 Paul
23 test_21 Emily
24 test_22 Bob
25 test_23 German
26 test_24 Emily
27 test_25 Gerry
28 test_26 Gerry
29 test_27 Ranil
30 test_28 Bill
31 test_29 Wang
32 test_30 Paul
33 test_31 Emily
34 test_32 Bob
35 test_33 German
36 test_34 Emily
37 test_35 Gerry
38 test_36 Gerry
39 test_37 Ranil
40 test_38 Bill
41 test_39 Wang
42 test_40 Paul
43 test_41 Emily
44 test_42 Bob
45 test_43 German
46 test_44 Emily
47 test_45 Gerry
48 test_46 Gerry
49 test_47 Ranil
50 test_48 Bill
51 test_49 Wang
52 test_50 Paul
53 test_51 Emily
54 test_52 Bob
55 test_53 German
56 test_54 Emily
57 test_55 Gerry
58 test_56 Gerry
59 test_57 Ranil
60 test_58 Bill
61 test_59 Wang
62 test_60 Paul
63 test_61 Emily
64 test_62 Bob
65 test_63 German
66 test_64 Emily
67 test_65 Gerry
68 test_66 Gerry
69 test_67 Ranil
70 test_68 Bill
71 test_69 Wang
72 test_70 Paul
73 test_71 Emily
74 test_72 Bob
75 test_73 German
76 test_74 Emily
77 test_75 Gerry
78 test_76 Gerry
79 test_77 Ranil
80 test_78 Bill
81 test_79 Wang
82 test_80 Paul
83 test_81 Emily
84 test_82 Bob
85 test_83 German
86 test_84 Emily
87 test_85 Gerry
88 test_86 Gerry
89 test_87 Ranil
90 test_88 Bill
91 test_89 Wang
92 test_90 Paul
93 test_91 Emily
94 test_92 Bob
95 test_93 German
96 test_94 Emily
97 test_95 Gerry
98 test_96 Gerry
99 test_97 Ranil
100 test_98 Bill
101 test_99 Wang

File diff suppressed because it is too large Load Diff

19
Tooling/model/Readme.md Normal file
View File

@@ -0,0 +1,19 @@
## Face Recognition Model Repository with Inference Code and Pretrained Weights
### Folder structure:
- facenet_pytorch: This is a repository for Inception Resnet (V1) models in pytorch, pretrained on VGGFace2 and CASIA-Webface.
- face_recognition.py: Model Inference code
- data.pt: Saved model weights
### Prerequisites
- You need to install the PyTorch CPU version to use the code.
```
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cpu
```
### Sample Output:
```
(cse546) kjha9@en4113732l:~/git/Project-1/part-2/model$ python3 face_recognition.py ../../face_images_1000/test_000.jpg
Paul
(cse546) kjha9@en4113732l:~/git/Project-1/part-2/model$

BIN
Tooling/model/data.pt Normal file

Binary file not shown.

View File

@@ -0,0 +1,36 @@
__copyright__ = "Copyright 2024, VISA Lab"
__license__ = "MIT"
import os
import csv
import sys
import torch
from PIL import Image
from facenet_pytorch import MTCNN, InceptionResnetV1
from torchvision import datasets
from torch.utils.data import DataLoader
mtcnn = MTCNN(image_size=240, margin=0, min_face_size=20) # initializing mtcnn for face detection
resnet = InceptionResnetV1(pretrained='vggface2').eval() # initializing resnet for face img to embeding conversion
test_image = sys.argv[1]
def face_match(img_path, data_path): # img_path= location of photo, data_path= location of data.pt
# getting embedding matrix of the given img
img = Image.open(img_path)
face, prob = mtcnn(img, return_prob=True) # returns cropped face and probability
emb = resnet(face.unsqueeze(0)).detach() # detech is to make required gradient false
saved_data = torch.load('data.pt') # loading data.pt file
embedding_list = saved_data[0] # getting embedding data
name_list = saved_data[1] # getting list of names
dist_list = [] # list of matched distances, minimum distance is used to identify the person
for idx, emb_db in enumerate(embedding_list):
dist = torch.dist(emb, emb_db).item()
dist_list.append(dist)
idx_min = dist_list.index(min(dist_list))
return (name_list[idx_min], min(dist_list))
result = face_match(test_image, 'data.pt')
print(result[0])

View File

@@ -0,0 +1,14 @@
__pycache__
.vscode
.ipynb_checkpoints
runs
build
dist
*.egg-info
*tmp*
.coverage
htmlcov
test.py
.cache
.ipython
.local

View File

@@ -0,0 +1,25 @@
language: python
matrix:
include:
- name: "Python 3.7 on Linux"
python: 3.7
dist: xenial
- name: "Python 3.6 on Linux"
python: 3.6
dist: xenial
- name: "Python 3.5 on Linux"
python: 3.5
dist: xenial
install: pip3 install -r tests/travis_requirements.txt || pip3 install --user -r tests/travis_requirements.txt
script:
- python3 --version || python --version
- echo "import tests.travis_test" > test.py
- coverage run --source models,examples test.py && coverage report
after_success:
- codecov

View File

@@ -0,0 +1,21 @@
MIT License
Copyright (c) 2019 Timothy Esler
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.

View File

@@ -0,0 +1,246 @@
# Face Recognition Using Pytorch
[![Downloads](https://pepy.tech/badge/facenet-pytorch)](https://pepy.tech/project/facenet-pytorch)
[![Code Coverage](https://img.shields.io/codecov/c/github/timesler/facenet-pytorch.svg)](https://codecov.io/gh/timesler/facenet-pytorch)
| Python | 3.7 | 3.6 | 3.5 |
| :---: | :---: | :---: | :---: |
| Status | [![Build Status](https://travis-ci.com/timesler/facenet-pytorch.svg?branch=master)](https://travis-ci.com/timesler/facenet-pytorch) | [![Build Status](https://travis-ci.com/timesler/facenet-pytorch.svg?branch=master)](https://travis-ci.com/timesler/facenet-pytorch) | [![Build Status](https://travis-ci.com/timesler/facenet-pytorch.svg?branch=master)](https://travis-ci.com/timesler/facenet-pytorch) |
[![xscode](https://img.shields.io/badge/Available%20on-xs%3Acode-blue?style=?style=plastic&logo=appveyor&logo=)](https://xscode.com/timesler/facenet-pytorch)
This is a repository for Inception Resnet (V1) models in pytorch, pretrained on VGGFace2 and CASIA-Webface.
Pytorch model weights were initialized using parameters ported from David Sandberg's [tensorflow facenet repo](https://github.com/davidsandberg/facenet).
Also included in this repo is an efficient pytorch implementation of MTCNN for face detection prior to inference. These models are also pretrained. To our knowledge, this is the fastest MTCNN implementation available.
## Table of contents
* [Table of contents](#table-of-contents)
* [Quick start](#quick-start)
* [Pretrained models](#pretrained-models)
* [Example notebooks](#example-notebooks)
+ [*Complete detection and recognition pipeline*](#complete-detection-and-recognition-pipeline)
+ [*Face tracking in video streams*](#face-tracking-in-video-streams)
+ [*Finetuning pretrained models with new data*](#finetuning-pretrained-models-with-new-data)
+ [*Guide to MTCNN in facenet-pytorch*](#guide-to-mtcnn-in-facenet-pytorch)
+ [*Performance comparison of face detection packages*](#performance-comparison-of-face-detection-packages)
+ [*The FastMTCNN algorithm*](#the-fastmtcnn-algorithm)
* [Running with docker](#running-with-docker)
* [Use this repo in your own git project](#use-this-repo-in-your-own-git-project)
* [Conversion of parameters from Tensorflow to Pytorch](#conversion-of-parameters-from-tensorflow-to-pytorch)
* [References](#references)
## Quick start
1. Install:
```bash
# With pip:
pip install facenet-pytorch
# or clone this repo, removing the '-' to allow python imports:
git clone https://github.com/timesler/facenet-pytorch.git facenet_pytorch
# or use a docker container (see https://github.com/timesler/docker-jupyter-dl-gpu):
docker run -it --rm timesler/jupyter-dl-gpu pip install facenet-pytorch && ipython
```
1. In python, import facenet-pytorch and instantiate models:
```python
from facenet_pytorch import MTCNN, InceptionResnetV1
# If required, create a face detection pipeline using MTCNN:
mtcnn = MTCNN(image_size=<image_size>, margin=<margin>)
# Create an inception resnet (in eval mode):
resnet = InceptionResnetV1(pretrained='vggface2').eval()
```
1. Process an image:
```python
from PIL import Image
img = Image.open(<image path>)
# Get cropped and prewhitened image tensor
img_cropped = mtcnn(img, save_path=<optional save path>)
# Calculate embedding (unsqueeze to add batch dimension)
img_embedding = resnet(img_cropped.unsqueeze(0))
# Or, if using for VGGFace2 classification
resnet.classify = True
img_probs = resnet(img_cropped.unsqueeze(0))
```
See `help(MTCNN)` and `help(InceptionResnetV1)` for usage and implementation details.
## Pretrained models
See: [models/inception_resnet_v1.py](models/inception_resnet_v1.py)
The following models have been ported to pytorch (with links to download pytorch state_dict's):
|Model name|LFW accuracy (as listed [here](https://github.com/davidsandberg/facenet))|Training dataset|
| :- | :-: | -: |
|[20180408-102900](https://github.com/timesler/facenet-pytorch/releases/download/v2.2.9/20180408-102900-casia-webface.pt) (111MB)|0.9905|CASIA-Webface|
|[20180402-114759](https://github.com/timesler/facenet-pytorch/releases/download/v2.2.9/20180402-114759-vggface2.pt) (107MB)|0.9965|VGGFace2|
There is no need to manually download the pretrained state_dict's; they are downloaded automatically on model instantiation and cached for future use in the torch cache. To use an Inception Resnet (V1) model for facial recognition/identification in pytorch, use:
```python
from facenet_pytorch import InceptionResnetV1
# For a model pretrained on VGGFace2
model = InceptionResnetV1(pretrained='vggface2').eval()
# For a model pretrained on CASIA-Webface
model = InceptionResnetV1(pretrained='casia-webface').eval()
# For an untrained model with 100 classes
model = InceptionResnetV1(num_classes=100).eval()
# For an untrained 1001-class classifier
model = InceptionResnetV1(classify=True, num_classes=1001).eval()
```
Both pretrained models were trained on 160x160 px images, so will perform best if applied to images resized to this shape. For best results, images should also be cropped to the face using MTCNN (see below).
By default, the above models will return 512-dimensional embeddings of images. To enable classification instead, either pass `classify=True` to the model constructor, or you can set the object attribute afterwards with `model.classify = True`. For VGGFace2, the pretrained model will output logit vectors of length 8631, and for CASIA-Webface logit vectors of length 10575.
## Example notebooks
### *Complete detection and recognition pipeline*
Face recognition can be easily applied to raw images by first detecting faces using MTCNN before calculating embedding or probabilities using an Inception Resnet model. The example code at [examples/infer.ipynb](examples/infer.ipynb) provides a complete example pipeline utilizing datasets, dataloaders, and optional GPU processing.
### *Face tracking in video streams*
MTCNN can be used to build a face tracking system (using the `MTCNN.detect()` method). A full face tracking example can be found at [examples/face_tracking.ipynb](examples/face_tracking.ipynb).
![](examples/tracked.gif)
### *Finetuning pretrained models with new data*
In most situations, the best way to implement face recognition is to use the pretrained models directly, with either a clustering algorithm or a simple distance metrics to determine the identity of a face. However, if finetuning is required (i.e., if you want to select identity based on the model's output logits), an example can be found at [examples/finetune.ipynb](examples/finetune.ipynb).
### *Guide to MTCNN in facenet-pytorch*
This guide demonstrates the functionality of the MTCNN module. Topics covered are:
* Basic usage
* Image normalization
* Face margins
* Multiple faces in a single image
* Batched detection
* Bounding boxes and facial landmarks
* Saving face datasets
See the [notebook on kaggle](https://www.kaggle.com/timesler/guide-to-mtcnn-in-facenet-pytorch).
### *Performance comparison of face detection packages*
This notebook demonstrates the use of three face detection packages:
1. facenet-pytorch
1. mtcnn
1. dlib
Each package is tested for its speed in detecting the faces in a set of 300 images (all frames from one video), with GPU support enabled. Performance is based on Kaggle's P100 notebook kernel. Results are summarized below.
|Package|FPS (1080x1920)|FPS (720x1280)|FPS (540x960)|
|---|---|---|---|
|facenet-pytorch|12.97|20.32|25.50|
|facenet-pytorch (non-batched)|9.75|14.81|19.68|
|dlib|3.80|8.39|14.53|
|mtcnn|3.04|5.70|8.23|
![](examples/performance-comparison.png)
See the [notebook on kaggle](https://www.kaggle.com/timesler/comparison-of-face-detection-packages).
### *The FastMTCNN algorithm*
This algorithm demonstrates how to achieve extremely efficient face detection specifically in videos, by taking advantage of similarities between adjacent frames.
See the [notebook on kaggle](https://www.kaggle.com/timesler/fast-mtcnn-detector-55-fps-at-full-resolution).
## Running with docker
The package and any of the example notebooks can be run with docker (or nvidia-docker) using:
```bash
docker run --rm -p 8888:8888
-v ./facenet-pytorch:/home/jovyan timesler/jupyter-dl-gpu \
-v <path to data>:/home/jovyan/data
pip install facenet-pytorch && jupyter lab
```
Navigate to the examples/ directory and run any of the ipython notebooks.
See [timesler/jupyter-dl-gpu](https://github.com/timesler/docker-jupyter-dl-gpu) for docker container details.
## Use this repo in your own git project
To use this code in your own git repo, I recommend first adding this repo as a submodule. Note that the dash ('-') in the repo name should be removed when cloning as a submodule as it will break python when importing:
`git submodule add https://github.com/timesler/facenet-pytorch.git facenet_pytorch`
Alternatively, the code can be installed as a package using pip:
`pip install facenet-pytorch`
## Conversion of parameters from Tensorflow to Pytorch
See: [models/utils/tensorflow2pytorch.py](models/tensorflow2pytorch.py)
Note that this functionality is not needed to use the models in this repo, which depend only on the saved pytorch `state_dict`'s.
Following instantiation of the pytorch model, each layer's weights were loaded from equivalent layers in the pretrained tensorflow models from [davidsandberg/facenet](https://github.com/davidsandberg/facenet).
The equivalence of the outputs from the original tensorflow models and the pytorch-ported models have been tested and are identical:
---
`>>> compare_model_outputs(mdl, sess, torch.randn(5, 160, 160, 3).detach())`
```
Passing test data through TF model
tensor([[-0.0142, 0.0615, 0.0057, ..., 0.0497, 0.0375, -0.0838],
[-0.0139, 0.0611, 0.0054, ..., 0.0472, 0.0343, -0.0850],
[-0.0238, 0.0619, 0.0124, ..., 0.0598, 0.0334, -0.0852],
[-0.0089, 0.0548, 0.0032, ..., 0.0506, 0.0337, -0.0881],
[-0.0173, 0.0630, -0.0042, ..., 0.0487, 0.0295, -0.0791]])
Passing test data through PT model
tensor([[-0.0142, 0.0615, 0.0057, ..., 0.0497, 0.0375, -0.0838],
[-0.0139, 0.0611, 0.0054, ..., 0.0472, 0.0343, -0.0850],
[-0.0238, 0.0619, 0.0124, ..., 0.0598, 0.0334, -0.0852],
[-0.0089, 0.0548, 0.0032, ..., 0.0506, 0.0337, -0.0881],
[-0.0173, 0.0630, -0.0042, ..., 0.0487, 0.0295, -0.0791]],
grad_fn=<DivBackward0>)
Distance 1.2874517096861382e-06
```
---
In order to re-run the conversion of tensorflow parameters into the pytorch model, ensure you clone this repo _with submodules_, as the davidsandberg/facenet repo is included as a submodule and parts of it are required for the conversion.
## References
1. David Sandberg's facenet repo: [https://github.com/davidsandberg/facenet](https://github.com/davidsandberg/facenet)
1. F. Schroff, D. Kalenichenko, J. Philbin. _FaceNet: A Unified Embedding for Face Recognition and Clustering_, arXiv:1503.03832, 2015. [PDF](https://arxiv.org/pdf/1503.03832)
1. Q. Cao, L. Shen, W. Xie, O. M. Parkhi, A. Zisserman. _VGGFace2: A dataset for recognising face across pose and age_, International Conference on Automatic Face and Gesture Recognition, 2018. [PDF](http://www.robots.ox.ac.uk/~vgg/publications/2018/Cao18/cao18.pdf)
1. D. Yi, Z. Lei, S. Liao and S. Z. Li. _CASIAWebface: Learning Face Representation from Scratch_, arXiv:1411.7923, 2014. [PDF](https://arxiv.org/pdf/1411.7923)
1. K. Zhang, Z. Zhang, Z. Li and Y. Qiao. _Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks_, IEEE Signal Processing Letters, 2016. [PDF](https://kpzhang93.github.io/MTCNN_face_detection_alignment/paper/spl.pdf)

View File

@@ -0,0 +1,11 @@
from .models.inception_resnet_v1 import InceptionResnetV1
from .models.mtcnn import MTCNN, PNet, RNet, ONet, prewhiten, fixed_image_standardization
from .models.utils.detect_face import extract_face
from .models.utils import training
import warnings
warnings.filterwarnings(
action="ignore",
message="This overload of nonzero is deprecated:\n\tnonzero()",
category=UserWarning
)

View File

@@ -0,0 +1,4 @@
coverage:
status:
project: off
patch: off

Some files were not shown because too many files have changed in this diff Show More