Difference between revisions of "Facial Recognition"
m (→Emotion Recognition) |
|||
(18 intermediate revisions by the same user not shown) | |||
Line 2: | Line 2: | ||
|title=PRIMO.ai | |title=PRIMO.ai | ||
|titlemode=append | |titlemode=append | ||
− | |keywords=artificial, intelligence, machine, learning, models | + | |keywords=ChatGPT, artificial, intelligence, machine, learning, GPT-4, GPT-5, NLP, NLG, NLC, NLU, models, data, singularity, moonshot, Sentience, AGI, Emergence, Moonshot, Explainable, TensorFlow, Google, Nvidia, Microsoft, Azure, Amazon, AWS, Hugging Face, OpenAI, Tensorflow, OpenAI, Google, Nvidia, Microsoft, Azure, Amazon, AWS, Meta, LLM, metaverse, assistants, agents, digital twin, IoT, Transhumanism, Immersive Reality, Generative AI, Conversational AI, Perplexity, Bing, You, Bard, Ernie, prompt Engineering LangChain, Video/Image, Vision, End-to-End Speech, Synthesize Speech, Speech Recognition, Stanford, MIT |description=Helpful resources for your journey with artificial intelligence; videos, articles, techniques, courses, profiles, and tools |
− | |description=Helpful resources for your journey with artificial intelligence; videos, articles, techniques, courses, profiles, and tools | + | |
+ | <!-- Google tag (gtag.js) --> | ||
+ | <script async src="https://www.googletagmanager.com/gtag/js?id=G-4GCWLBVJ7T"></script> | ||
+ | <script> | ||
+ | window.dataLayer = window.dataLayer || []; | ||
+ | function gtag(){dataLayer.push(arguments);} | ||
+ | gtag('js', new Date()); | ||
+ | |||
+ | gtag('config', 'G-4GCWLBVJ7T'); | ||
+ | </script> | ||
}} | }} | ||
− | [ | + | [https://www.youtube.com/results?search_query=facial+face+recognition+deep+machine+learning+ML Youtube search...] |
− | [ | + | [https://www.google.com/search?q=facial+face+recognition+deep+machine+learning+ML ...Google search] |
− | * [[Screening; Passenger, Luggage, & Cargo]] | + | * [[Cybersecurity]] ... [[Open-Source Intelligence - OSINT |OSINT]] ... [[Cybersecurity Frameworks, Architectures & Roadmaps | Frameworks]] ... [[Cybersecurity References|References]] ... [[Offense - Adversarial Threats/Attacks| Offense]] ... [[National Institute of Standards and Technology (NIST)|NIST]] ... [[U.S. Department of Homeland Security (DHS)| DHS]] ... [[Screening; Passenger, Luggage, & Cargo|Screening]] ... [[Law Enforcement]] ... [[Government Services|Government]] ... [[Defense]] ... [[Joint Capabilities Integration and Development System (JCIDS)#Cybersecurity & Acquisition Lifecycle Integration| Lifecycle Integration]] ... [[Cybersecurity Companies/Products|Products]] ... [[Cybersecurity: Evaluating & Selling|Evaluating]] |
− | * [ | + | * [https://www.nist.gov/programs-projects/face-projects Face | National Institute of Standards and Technology (NIST)] |
− | * [ | + | * [https://arxiv.org/ftp/arxiv/papers/1907/1907.12739.pdf Deep Learning For Face Recognition: A Critical Analysis | Andrew Jason Shepley] |
− | * [ | + | * [https://becominghuman.ai/facial-recognition-and-ai-latest-developments-and-future-directions-39d22201d88b Facial Recognition And AI: Latest Developments And Future Directions | Oleksii Kharkovyna - Being Human - Medium] |
* [[Rekognition]] | * [[Rekognition]] | ||
− | * [ | + | * [https://arxiv.org/pdf/1904.09523.pdf Neural Architecture Search for Deep Face Recognition | Ning Zhu] |
− | * [ | + | * [https://arxiv.org/pdf/1804.06655.pdf Deep Face Recognition: A Survey | Mei Wang, Weihong Deng - School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing,] [[Government Services#China|China]] |
− | * [ | + | * [https://www.aclu.org/blog/privacy-technology/surveillance-technologies/amazons-face-recognition-falsely-matched-28 Amazon’s Face Recognition Falsely Matched 28 Members of Congress With Mugshots | Jacob Snow - ACLU] |
* [[Python#OpenCV| OpenCV]] Open Computer Vision | * [[Python#OpenCV| OpenCV]] Open Computer Vision | ||
− | * [ | + | * [https://www.banfacialrecognition.com/map/ Interactive map shows where facial recognition surveillance is happening | Fightforthefuture.org] |
− | * [ | + | * [https://en.wikipedia.org/wiki/Facial_recognition_system Facial recognition system | Wikipedia] |
− | * [ | + | * [https://www.nextgov.com/emerging-tech/2020/06/ice-outlines-how-investigators-rely-third-party-facial-recognition-services/165846/ ICE Outlines How Investigators Rely on Third-Party Facial Recognition Services | Aaron Boyd - Nextgov] |
− | Facial recognition is a biometric software application capable of uniquely identifying or verifying a person by comparing and analyzing patterns based on the person’s facial contours. There are different facial recognition techniques in use, such as the generalized matching face detection method and the adaptive regional blend matching method. Most facial recognition systems function based on the different nodal points on a human face. The values measured against the variable associated with points of a person’s face help in uniquely identifying or verifying the person. With this technique, applications can use data captured from faces and can accurately and quickly identify target individuals. Facial recognition techniques are quickly evolving with new approaches such as 3-D modeling, helping to overcome issues with existing techniques. There are many advantages associated with facial recognition. Compared to other biometric techniques, facial recognition is of a non-contact nature. Face images can be captured from a distance and can be analyzed without ever requiring any interaction with the user/person. As a result, no user can successfully imitate another person. Facial recognition can serve as an excellent security measure for time tracking and attendance. Facial recognition is also cheap technology as there is less processing involved, like in other biometric techniques. [ | + | Facial recognition is a biometric software application capable of uniquely identifying or verifying a person by comparing and analyzing patterns based on the person’s facial contours. There are different facial recognition techniques in use, such as the generalized matching face detection method and the adaptive regional blend matching method. Most facial recognition systems function based on the different nodal points on a human face. The values measured against the variable associated with points of a person’s face help in uniquely identifying or verifying the person. With this technique, applications can use data captured from faces and can accurately and quickly identify target individuals. Facial recognition techniques are quickly evolving with new approaches such as 3-D modeling, helping to overcome issues with existing techniques. There are many advantages associated with facial recognition. Compared to other biometric techniques, facial recognition is of a non-contact nature. Face images can be captured from a distance and can be analyzed without ever requiring any interaction with the user/person. As a result, no user can successfully imitate another person. Facial recognition can serve as an excellent security measure for time tracking and attendance. Facial recognition is also cheap technology as there is less processing involved, like in other biometric techniques. [https://medium.com/datadriveninvestor/machine-learning-on-facial-recognition-b3dfba5625a7 Machine Learning on Facial Recognition - Damilola Omoyiwola - Medium] |
− | Face detection is one of the important tasks of object detection. Typically detection is the first stage of pattern recognition and identity authentication. In recent years, deep learning-based algorithms in object detection have grown rapidly. These algorithms can be generally divided into two categories, i.e., two-stage detector like Faster R-CNN and one-stage detector like You Only Look Once (YOLO). Although YOLO and its varieties are not so good as two-stage detectors in terms of accuracy, they outperform the counterparts by a large margin in speed. YOLO performs well when facing normal size objects, but is incapable of detecting small objects. The accuracy decreases notably when dealing with objects that have large-scale changing like faces. Aimed to solve the detection problem of varying face scales, we propose a face detector named YOLO-face based on YOLOv3 to improve the performance for face detection. The present approach includes using anchor boxes more appropriate for face detection and a more precise regression loss function. The improved detector significantly increased accuracy while remaining fast detection speed. Experiments on the WIDER FACE and the FDDB datasets show that our improved algorithm outperforms YOLO and its varieties. [ | + | Face detection is one of the important tasks of object detection. Typically detection is the first stage of pattern recognition and identity authentication. In recent years, deep learning-based algorithms in object detection have grown rapidly. These algorithms can be generally divided into two categories, i.e., two-stage detector like Faster R-CNN and one-stage detector like You Only Look Once (YOLO). Although YOLO and its varieties are not so good as two-stage detectors in terms of accuracy, they outperform the counterparts by a large margin in speed. YOLO performs well when facing normal size objects, but is incapable of detecting small objects. The accuracy decreases notably when dealing with objects that have large-scale changing like faces. Aimed to solve the detection problem of varying face scales, we propose a face detector named YOLO-face based on YOLOv3 to improve the performance for face detection. The present approach includes using anchor boxes more appropriate for face detection and a more precise regression loss function. The improved detector significantly increased accuracy while remaining fast detection speed. Experiments on the WIDER FACE and the FDDB datasets show that our improved algorithm outperforms YOLO and its varieties. [https://link.springer.com/article/10.1007/s00371-020-01831-7#citeas YOLO-face: a real-time face detector | W. Chen, H. Huang, S. Peng, C. Zhou & C. Zhang - The Visual Computer] [https://github.com/sthanhng/yoloface Deep learning based Face detection using the YOLOv3 algorithm | Ayoosh Kathuria - GitHub] ... [https://pjreddie.com/media/files/papers/YOLOv3.pdf YOLOv3: An Incremental Improvement | Joseph Redmon, Ali Farhadi - University of Washington] |
<youtube>535acCxjHCI</youtube> | <youtube>535acCxjHCI</youtube> | ||
Line 39: | Line 48: | ||
<b>Face Recognition @ Scale</b> | <b>Face Recognition @ Scale</b> | ||
− | [ | + | [https://www.youtube.com/results?search_query=facial+recognition+architecture+deep+machine+learning+ML Youtube search...] |
− | [ | + | [https://www.google.com/search?q=facial+recognition+architecture+deep+machine+learning+ML ...Google search] |
− | * [ | + | * [https://openaccess.thecvf.com/content_ICCV_2017_workshops/papers/w27/Xu_High_Performance_Large_ICCV_2017_paper.pdf High Performance Large Scale Face Recognition with Multi-Cognition Softmax and Feature Retrieval | Y. Xu, Y. Cheng, J. Zhao, Z. Wang, L. Xiong, K. Jayashree, H. Tamura, T. Kagaya, S. Pranata, S. Shen, J. Feng, and J. Xing] |
References: | References: | ||
− | * [ | + | * [https://dl.acm.org/ ACM Digital Library | Association for Computing Machinery] |
− | * [ | + | * [https://arxiv.org/ arXiv.org | Cornell University] |
Scaling Training: | Scaling Training: | ||
− | * [[Federated | + | * [[FDecentralized: Federated & Distributed]] Learning |
− | |||
Scaling Evaluation: | Scaling Evaluation: | ||
Line 60: | Line 68: | ||
<hr> | <hr> | ||
− | [ | + | [https://www.scmp.com/news/china/society/article/2115094/china-build-giant-facial-recognition-database-identify-any [[Government Services#China|China]] to build giant facial recognition database to identify any citizen within seconds | Stephen Chen - South [[Government Services#China|China]] Morning Post] |
* High demands for speed and accuracy: | * High demands for speed and accuracy: | ||
Line 70: | Line 78: | ||
* launched by the Ministry of Public Security in 2015 | * launched by the Ministry of Public Security in 2015 | ||
* cloud facilities to connect with data storage and processing centres distributed across the country | * cloud facilities to connect with data storage and processing centres distributed across the country | ||
− | * portrait information of each Chinese citizen (1.3 billion), amounts to 13 terabytes | + | * portrait information of each [[Government Services#China|Chinese]] citizen (1.3 billion), amounts to 13 terabytes |
* the size of the full database with detailed personal information does not exceed 90 terabytes | * the size of the full database with detailed personal information does not exceed 90 terabytes | ||
− | * [ | + | * [https://www.itworldcanada.com/ai/listing/isvision/ Isvision] will use an algorithm developed by [https://en.seetatech.com/ Seetatech Technology Co.], a start-up established by several researchers from the [https://english.ict.cas.cn/ Institute of Computing Technology at the [[Government Services#China|Chinese]] Academy of Sciences] in Beijing |
− | * [ | + | * [https://en.wikipedia.org/wiki/University_of_Electronic_Science_and_Technology_of_China University of Electronic Science and Technology of [[Government Services#China|China]] (UESTC) - Wikipedia] |
− | * [ | + | * [https://www.sciencedirect.com/journal/journal-of-electronic-science-and-technology Journal of Electronic Science and Technology - ScienceDirect] |
<hr> | <hr> | ||
− | faced is a proof of concept that you don’t always need to rely on general purpose trained models in scenarios were these models are an overkill to your problem and performance issues are involved. Don’t overestimate the power of spending time designing custom neural network architectures that are specific to your problem. These specific networks will be a much better solution than the general ones. [ | + | faced is a proof of concept that you don’t always need to rely on general purpose trained models in scenarios were these models are an overkill to your problem and performance issues are involved. Don’t overestimate the power of spending time designing custom neural network architectures that are specific to your problem. These specific networks will be a much better solution than the general ones. [https://towardsdatascience.com/faced-cpu-real-time-face-detection-using-deep-learning-1488681c1602 faced: CPU Real Time face detection using Deep Learning | Ivan Itzcovich - Towards Data Science] |
Haar Cascade [left] vs faced [right] | Haar Cascade [left] vs faced [right] | ||
− | <img src=" | + | <img src="https://miro.medium.com/max/1000/1*bQVE1o-cSwi4EZiFiYBd5A.gif" width="500" height="250"> |
− | <img src=" | + | <img src="https://miro.medium.com/max/1000/1*_K2pmLW2rgQMe8YYd3-SWw.gif" width="500" height="250"> |
<youtube>TTbH0fgsqp4</youtube> | <youtube>TTbH0fgsqp4</youtube> | ||
Line 89: | Line 97: | ||
<b>Liveness Detection</b> | <b>Liveness Detection</b> | ||
− | [ | + | [https://www.youtube.com/results?search_query=Liveness+facial+recognition Youtube search...] |
− | [ | + | [https://www.google.com/search?q=Liveness+facial+recognition ...Google search] |
− | * [ | + | * [https://www.liveness.com/ Biometric Liveness Detection Explained | Liveness.com] |
an AI computer system’s ability to determine that it is interfacing with a physically present human being and not an inanimate spoof artifact. Liveness Detection has become a necessary component of any authentication system that is based on face biometric technology where a trusted human is not supervising the authentication attempt. | an AI computer system’s ability to determine that it is interfacing with a physically present human being and not an inanimate spoof artifact. Liveness Detection has become a necessary component of any authentication system that is based on face biometric technology where a trusted human is not supervising the authentication attempt. | ||
Line 99: | Line 107: | ||
* Face Authentication (1:1 Matching+Liveness), on the other hand, takes User-initiated data collected from a device they do control and confirms that User's identity for their own direct benefit, like, for example, secure account access. | * Face Authentication (1:1 Matching+Liveness), on the other hand, takes User-initiated data collected from a device they do control and confirms that User's identity for their own direct benefit, like, for example, secure account access. | ||
+ | |||
+ | = Emotion Recognition = | ||
+ | |||
+ | [https://www.youtube.com/results?search_query=Emotion+facial+Recognition Youtube search...] | ||
+ | [https://www.google.com/search?q=Emotion+facial+Recognition ...Google search] | ||
+ | |||
+ | * [https://emojify.info/menu Emotion Recognition Sandbox | Nesta] | ||
+ | * [https://emojify.info/resources#resources Resources | Nesta] | ||
+ | * [https://hume.ai/ Hume] ... Capture nuances in expression—subtle facial movements of love or admiration, laughter tinged with awkwardness, sighs of relief—and build custom expression-language models | ||
+ | |||
+ | This project was created by a group of social scientists, citizen scientists, and designers. We want to open up conversations about emotion recognition systems from the science behind the technology to their social impacts--and everything else in between. Our aim is to promote public understanding of these technologies and citizen involvement in their [[development]] and use. We believe that through collective intelligence and sharing perspectives on such important issues, we can empower communities to promote a just and equitable society. | ||
+ | |||
+ | |||
+ | AI is used for emotion recognition through the application of various techniques and technologies. Emotion recognition aims to identify and understand human emotions based on facial expressions, speech patterns, gestures, and other physiological signals. Here are some ways in which AI is employed for emotion recognition: | ||
+ | |||
+ | * <b>Facial Recognition</b>: | ||
+ | ** Facial Expression Analysis: AI algorithms analyze facial features and expressions to detect emotions. Deep learning techniques, such as [[(Deep) Convolutional Neural Network (DCNN/CNN)]], are commonly used for this purpose. | ||
+ | ** Facial Landmark Detection: Algorithms identify key points on the face, such as the position of the eyes, nose, and mouth, to understand facial expressions and infer emotions. | ||
+ | * <b>[[Speech Recognition|Speech Recognition]]</b>: [[Speech Recognition|Voice Analysis]]: AI systems analyze speech patterns, tone, pitch, and other acoustic features to detect emotions in spoken language. [[Natural Language Processing (NLP)]] techniques are often employed for this task. | ||
+ | * <b>Gesture Recognition</b>: Body Language Analysis: AI can be trained to recognize specific gestures, body movements, and postures that are associated with different emotions. | ||
+ | * <b>Biometric Sensors</b>:: Physiological Signals: Some systems use biometric sensors to measure physiological signals such as heart rate, skin conductance, and EEG signals. Changes in these signals can be indicative of emotional states. | ||
+ | * <b>Multimodal Approaches</b>: Combining Multiple Modalities: Emotion recognition systems often integrate information from multiple sources, such as facial expressions, speech, and gestures, to improve accuracy and reliability. | ||
+ | * <b>Machine Learning and Deep Learning</b>: Training Models: AI models, particularly machine learning and deep learning models, are trained on large datasets containing labeled examples of different emotional states. These models learn patterns and features associated with specific emotions. | ||
+ | * <b>Real-Time Applications</b>: Live Interaction Analysis: Some applications use AI to analyze emotions in real-time, enabling adaptive responses in human-computer interaction scenarios, such as virtual assistants responding to user emotions. | ||
+ | * <b>Applications in Various Fields</b>: | ||
+ | ** Customer Service: Emotion recognition is applied in customer service to gauge customer satisfaction and provide personalized responses. | ||
+ | ** Education: AI-based emotion recognition is used in educational technology to adapt teaching methods based on students' emotional states. | ||
+ | ** Healthcare: Emotion recognition has applications in mental health monitoring and assisting individuals with conditions like autism or depression. | ||
+ | |||
+ | |||
+ | <i>Emotion dimensions</i>: | ||
+ | |||
+ | * <b>Valence</b>: This slider captures the positive or negative aspect of emotions. Users can slide towards positive for emotions like happiness and contentment or towards negative for emotions like sadness and frustration. | ||
+ | |||
+ | * <b>Arousal</b>: Arousal reflects the intensity or activation level of emotions. Users can adjust the slider to express how calm or excited they feel. Low arousal might represent calmness, while high arousal could signify excitement or agitation. | ||
+ | |||
+ | * <b>Intensity</b>: This slider measures the overall strength or magnitude of the emotion. Users can adjust it to convey whether their emotional experience is more subtle and mild or strong and intense. | ||
+ | |||
+ | * <b>Dominance</b>: Dominance indicates the perceived level of control or influence in the emotional state. A slider can help users express whether they feel more submissive or dominant in a given emotional moment. | ||
+ | |||
+ | * <b>Surprise</b>: The surprise slider allows users to convey the unexpectedness of their emotional experience. It ranges from no surprise to extreme surprise, capturing the element of novelty in the emotional response. | ||
+ | |||
+ | * <b>Sadness</b>: This slider gauges the depth of sadness. Users can slide towards "None" for emotions with minimal sadness and towards "Overwhelming" for emotions associated with profound sadness. | ||
+ | |||
+ | * <b>Anger</b>: The anger slider enables users to express the intensity of their anger. It spans from no anger to extreme fury, providing a spectrum for users to convey the strength of their emotional response. | ||
+ | |||
+ | * <b>Joy/Happiness</b>: Users can use this slider to indicate the level of happiness or joy they are experiencing. It ranges from no joy to an ecstatic level of joy, helping to capture the range of positive emotional experiences. | ||
+ | |||
+ | * <b>Fear</b>: Fear intensity is expressed through this slider. Users can convey whether they feel a slight unease or are experiencing a state of extreme terror or fear. | ||
+ | |||
+ | * <b>Disgust</b>: This slider measures the intensity of disgust. Users can slide towards "None" for minimal disgust or towards "Repulsed" for emotions associated with strong feelings of disgust. | ||
+ | |||
+ | * <b>Interest/Engagement</b>: Users can adjust this slider to express their level of interest or engagement. It ranges from being uninterested or disengaged to fully engaged or absorbed in the experience. | ||
+ | |||
+ | * <b>Confusion</b>: The confusion slider reflects the clarity of the emotional experience. Users can slide towards "Clear" if they feel no confusion or towards "Confused" to indicate a high level of confusion associated with the emotion. | ||
+ | |||
+ | |||
+ | <youtube>5ALNG6fm4XQ</youtube> | ||
+ | <youtube>QZQwKa1sBlY</youtube> | ||
= Pets = | = Pets = | ||
− | * [ | + | * [https://delawarestatenews.net/news/spca-adds-facial-recognition-technology-to-reunite-lost-pets SPCA adds facial recognition technology to reunite lost pets - Delaware State News] |
− | * [ | + | * [https://findingrover.com/ Finding Rover | Finding Rover, Inc.] |
− | + | https://alloveralbany.com/assets_c/2015/11/finding_rover_app_screenshots-thumb-525x302-21174.jpg | |
<youtube>IrIrT_y4PlE</youtube> | <youtube>IrIrT_y4PlE</youtube> | ||
<youtube>PRbJGWuH_1o</youtube> | <youtube>PRbJGWuH_1o</youtube> | ||
<youtube>cHui0xSos-k</youtube> | <youtube>cHui0xSos-k</youtube> | ||
− | <youtube> | + | <youtube>89Aq03rjg7U</youtube> |
== Pet Detection == | == Pet Detection == |
Latest revision as of 22:00, 14 December 2023
Youtube search... ...Google search
- Cybersecurity ... OSINT ... Frameworks ... References ... Offense ... NIST ... DHS ... Screening ... Law Enforcement ... Government ... Defense ... Lifecycle Integration ... Products ... Evaluating
- Face | National Institute of Standards and Technology (NIST)
- Deep Learning For Face Recognition: A Critical Analysis | Andrew Jason Shepley
- Facial Recognition And AI: Latest Developments And Future Directions | Oleksii Kharkovyna - Being Human - Medium
- Rekognition
- Neural Architecture Search for Deep Face Recognition | Ning Zhu
- Deep Face Recognition: A Survey | Mei Wang, Weihong Deng - School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, China
- Amazon’s Face Recognition Falsely Matched 28 Members of Congress With Mugshots | Jacob Snow - ACLU
- OpenCV Open Computer Vision
- Interactive map shows where facial recognition surveillance is happening | Fightforthefuture.org
- Facial recognition system | Wikipedia
- ICE Outlines How Investigators Rely on Third-Party Facial Recognition Services | Aaron Boyd - Nextgov
Facial recognition is a biometric software application capable of uniquely identifying or verifying a person by comparing and analyzing patterns based on the person’s facial contours. There are different facial recognition techniques in use, such as the generalized matching face detection method and the adaptive regional blend matching method. Most facial recognition systems function based on the different nodal points on a human face. The values measured against the variable associated with points of a person’s face help in uniquely identifying or verifying the person. With this technique, applications can use data captured from faces and can accurately and quickly identify target individuals. Facial recognition techniques are quickly evolving with new approaches such as 3-D modeling, helping to overcome issues with existing techniques. There are many advantages associated with facial recognition. Compared to other biometric techniques, facial recognition is of a non-contact nature. Face images can be captured from a distance and can be analyzed without ever requiring any interaction with the user/person. As a result, no user can successfully imitate another person. Facial recognition can serve as an excellent security measure for time tracking and attendance. Facial recognition is also cheap technology as there is less processing involved, like in other biometric techniques. Machine Learning on Facial Recognition - Damilola Omoyiwola - Medium
Face detection is one of the important tasks of object detection. Typically detection is the first stage of pattern recognition and identity authentication. In recent years, deep learning-based algorithms in object detection have grown rapidly. These algorithms can be generally divided into two categories, i.e., two-stage detector like Faster R-CNN and one-stage detector like You Only Look Once (YOLO). Although YOLO and its varieties are not so good as two-stage detectors in terms of accuracy, they outperform the counterparts by a large margin in speed. YOLO performs well when facing normal size objects, but is incapable of detecting small objects. The accuracy decreases notably when dealing with objects that have large-scale changing like faces. Aimed to solve the detection problem of varying face scales, we propose a face detector named YOLO-face based on YOLOv3 to improve the performance for face detection. The present approach includes using anchor boxes more appropriate for face detection and a more precise regression loss function. The improved detector significantly increased accuracy while remaining fast detection speed. Experiments on the WIDER FACE and the FDDB datasets show that our improved algorithm outperforms YOLO and its varieties. YOLO-face: a real-time face detector | W. Chen, H. Huang, S. Peng, C. Zhou & C. Zhang - The Visual Computer Deep learning based Face detection using the YOLOv3 algorithm | Ayoosh Kathuria - GitHub ... YOLOv3: An Incremental Improvement | Joseph Redmon, Ali Farhadi - University of Washington
Ethnus Codemithra Masterclass
Face Recognition @ Scale
Youtube search... ...Google search
References:
Scaling Training:
Scaling Evaluation:
- Shared nothing architecture
- Neural network/classifier rarely change
- Load balancing pattern
- Partitioning data if needed
China to build giant facial recognition database to identify any citizen within seconds | Stephen Chen - South China Morning Post
- High demands for speed and accuracy:
- identify any one of its 1.3 billion citizens within three seconds
- someone’s face to their ID photo with about 90 per cent accuracy
- accuracy of the photo that most closely matched the face being searched for was below 60 per cent.
- with the top 20 matches the accuracy rate remained below 70 per cent
- when a photo, gender and age range are inputted accuracy level higher than 88 per cent
- launched by the Ministry of Public Security in 2015
- cloud facilities to connect with data storage and processing centres distributed across the country
- portrait information of each Chinese citizen (1.3 billion), amounts to 13 terabytes
- the size of the full database with detailed personal information does not exceed 90 terabytes
- Isvision will use an algorithm developed by Seetatech Technology Co., a start-up established by several researchers from the Institute of Computing Technology at the Chinese Academy of Sciences in Beijing
- University of Electronic Science and Technology of China (UESTC) - Wikipedia
- Journal of Electronic Science and Technology - ScienceDirect
faced is a proof of concept that you don’t always need to rely on general purpose trained models in scenarios were these models are an overkill to your problem and performance issues are involved. Don’t overestimate the power of spending time designing custom neural network architectures that are specific to your problem. These specific networks will be a much better solution than the general ones. faced: CPU Real Time face detection using Deep Learning | Ivan Itzcovich - Towards Data Science
Haar Cascade [left] vs faced [right]
Liveness Detection
Youtube search... ...Google search
an AI computer system’s ability to determine that it is interfacing with a physically present human being and not an inanimate spoof artifact. Liveness Detection has become a necessary component of any authentication system that is based on face biometric technology where a trusted human is not supervising the authentication attempt.
- Facial Recognition is for surveillance; it's the 1-to-N matching of images captured with cameras the user doesn't control, like those in a casino or an airport. And it only provides "possible" matches for the surveilled person from face photos stored in an existing database.
- Face Authentication (1:1 Matching+Liveness), on the other hand, takes User-initiated data collected from a device they do control and confirms that User's identity for their own direct benefit, like, for example, secure account access.
Emotion Recognition
Youtube search... ...Google search
- Emotion Recognition Sandbox | Nesta
- Resources | Nesta
- Hume ... Capture nuances in expression—subtle facial movements of love or admiration, laughter tinged with awkwardness, sighs of relief—and build custom expression-language models
This project was created by a group of social scientists, citizen scientists, and designers. We want to open up conversations about emotion recognition systems from the science behind the technology to their social impacts--and everything else in between. Our aim is to promote public understanding of these technologies and citizen involvement in their development and use. We believe that through collective intelligence and sharing perspectives on such important issues, we can empower communities to promote a just and equitable society.
AI is used for emotion recognition through the application of various techniques and technologies. Emotion recognition aims to identify and understand human emotions based on facial expressions, speech patterns, gestures, and other physiological signals. Here are some ways in which AI is employed for emotion recognition:
- Facial Recognition:
- Facial Expression Analysis: AI algorithms analyze facial features and expressions to detect emotions. Deep learning techniques, such as (Deep) Convolutional Neural Network (DCNN/CNN), are commonly used for this purpose.
- Facial Landmark Detection: Algorithms identify key points on the face, such as the position of the eyes, nose, and mouth, to understand facial expressions and infer emotions.
- Speech Recognition: Voice Analysis: AI systems analyze speech patterns, tone, pitch, and other acoustic features to detect emotions in spoken language. Natural Language Processing (NLP) techniques are often employed for this task.
- Gesture Recognition: Body Language Analysis: AI can be trained to recognize specific gestures, body movements, and postures that are associated with different emotions.
- Biometric Sensors:: Physiological Signals: Some systems use biometric sensors to measure physiological signals such as heart rate, skin conductance, and EEG signals. Changes in these signals can be indicative of emotional states.
- Multimodal Approaches: Combining Multiple Modalities: Emotion recognition systems often integrate information from multiple sources, such as facial expressions, speech, and gestures, to improve accuracy and reliability.
- Machine Learning and Deep Learning: Training Models: AI models, particularly machine learning and deep learning models, are trained on large datasets containing labeled examples of different emotional states. These models learn patterns and features associated with specific emotions.
- Real-Time Applications: Live Interaction Analysis: Some applications use AI to analyze emotions in real-time, enabling adaptive responses in human-computer interaction scenarios, such as virtual assistants responding to user emotions.
- Applications in Various Fields:
- Customer Service: Emotion recognition is applied in customer service to gauge customer satisfaction and provide personalized responses.
- Education: AI-based emotion recognition is used in educational technology to adapt teaching methods based on students' emotional states.
- Healthcare: Emotion recognition has applications in mental health monitoring and assisting individuals with conditions like autism or depression.
Emotion dimensions:
- Valence: This slider captures the positive or negative aspect of emotions. Users can slide towards positive for emotions like happiness and contentment or towards negative for emotions like sadness and frustration.
- Arousal: Arousal reflects the intensity or activation level of emotions. Users can adjust the slider to express how calm or excited they feel. Low arousal might represent calmness, while high arousal could signify excitement or agitation.
- Intensity: This slider measures the overall strength or magnitude of the emotion. Users can adjust it to convey whether their emotional experience is more subtle and mild or strong and intense.
- Dominance: Dominance indicates the perceived level of control or influence in the emotional state. A slider can help users express whether they feel more submissive or dominant in a given emotional moment.
- Surprise: The surprise slider allows users to convey the unexpectedness of their emotional experience. It ranges from no surprise to extreme surprise, capturing the element of novelty in the emotional response.
- Sadness: This slider gauges the depth of sadness. Users can slide towards "None" for emotions with minimal sadness and towards "Overwhelming" for emotions associated with profound sadness.
- Anger: The anger slider enables users to express the intensity of their anger. It spans from no anger to extreme fury, providing a spectrum for users to convey the strength of their emotional response.
- Joy/Happiness: Users can use this slider to indicate the level of happiness or joy they are experiencing. It ranges from no joy to an ecstatic level of joy, helping to capture the range of positive emotional experiences.
- Fear: Fear intensity is expressed through this slider. Users can convey whether they feel a slight unease or are experiencing a state of extreme terror or fear.
- Disgust: This slider measures the intensity of disgust. Users can slide towards "None" for minimal disgust or towards "Repulsed" for emotions associated with strong feelings of disgust.
- Interest/Engagement: Users can adjust this slider to express their level of interest or engagement. It ranges from being uninterested or disengaged to fully engaged or absorbed in the experience.
- Confusion: The confusion slider reflects the clarity of the emotional experience. Users can slide towards "Clear" if they feel no confusion or towards "Confused" to indicate a high level of confusion associated with the emotion.
Pets
- SPCA adds facial recognition technology to reunite lost pets - Delaware State News
- Finding Rover | Finding Rover, Inc.
Pet Detection