Difference between revisions of "Evaluation"

From
Jump to: navigation, search
m
m
Line 24: Line 24:
 
||
 
||
 
<youtube>xoBWBNsjWoM</youtube>
 
<youtube>xoBWBNsjWoM</youtube>
<b>HH1
+
<b>Evaluating AI- and ML-Based Security Products
</b><br>BB1
+
</b><br>Anup Ghosh, Founder and CEO, Invincea Liam Randall, President, Critical Stack, A Division of Capital One  Chad Skipper, VP Competitive Intelligence and Product Testing, Cylance
 +
Mike Spanbauer, Vice President of Research and Strategy, NSS Labs With endless AI or machine learning product claims, buyers are left bewildered with how to test these claims. It falls to independent third-party test organizations to develop and update traditional test protocols to test and validate AI and ML product capability claims. This panel will tackle the key issues that third-party testing must address to validate AI and ML security products.
 
|}
 
|}
 
|<!-- M -->
 
|<!-- M -->
Line 32: Line 33:
 
||
 
||
 
<youtube>7CcSm0PAr-Y</youtube>
 
<youtube>7CcSm0PAr-Y</youtube>
<b>HH2
+
<b>How Should We Evaluate Machine Learning for AI?: Percy Liang
</b><br>BB2
+
</b><br>Machine learning has undoubtedly been hugely successful in driving progress in AI, but it implicitly brings with it the train-test evaluation paradigm. This standard evaluation only encourages behavior that is good on average; it does not ensure robustness as demonstrated by adversarial examples, and it breaks down for tasks such as dialogue that are interactive or do not have a correct answer. In this talk, I will describe alternative evaluation paradigms with a focus on natural language understanding tasks, and discuss ramifications for guiding progress in AI in meaningful directions. Percy Liang is an Assistant Professor of Computer Science at Stanford University (B.S. from MIT, 2004; Ph.D. from UC Berkeley, 2011).  His research spans machine learning and natural language processing, with the goal of developing trustworthy agents that can communicate effectively with people and improve over time through interaction.  Specific topics include question answering, dialogue, program induction, interactive learning, and reliable machine learning.  His awards include the IJCAI Computers and Thought Award (2016), an NSF CAREER Award (2016), a Sloan Research Fellowship (2015), and a Microsoft Research Faculty Fellowship (2014).
 
|}
 
|}
 
|}<!-- B -->
 
|}<!-- B -->
Line 44: Line 45:
 
||
 
||
 
<youtube>cRpuJcALfkA</youtube>
 
<youtube>cRpuJcALfkA</youtube>
<b>HH1
+
<b>Build or buy AI? You're asking the wrong question
</b><br>BB1
+
</b><br>Evan Kohn, chief business officer and head of marketing at Pypestream, talks with Tonya Hall about why companies need to turn to staffing for AI and building data sets.
 
|}
 
|}
 
|<!-- M -->
 
|<!-- M -->
Line 52: Line 53:
 
||
 
||
 
<youtube>b2Yvf7poKbM</youtube>
 
<youtube>b2Yvf7poKbM</youtube>
<b>HH2
+
<b>Why you should Buy Open-Source AI
</b><br>BB2
+
</b><br>Considering an AI assistant in your home? Before you auto-buy that pretty picture in front of you, be sure to check out the open-source offerings as well.
 
|}
 
|}
 
|}<!-- B -->
 
|}<!-- B -->

Revision as of 08:29, 4 September 2020

YouTube search... ...Google search

Many products today leverage artificial intelligence for a wide range of industries, from healthcare to marketing. However, most business leaders who need to make strategic and procurement decisions about these technologies have no formal AI background or academic training in data science. The purpose of this article is to give business people with no AI expertise a general guideline on how to assess an AI-related product to help decide whether it is potentially relevant to their business. How to Assess an Artificial Intelligence Product or Solution (Even if You’re Not an AI Expert) | Daniel Faggella - Emerj

Evaluating AI- and ML-Based Security Products
Anup Ghosh, Founder and CEO, Invincea Liam Randall, President, Critical Stack, A Division of Capital One Chad Skipper, VP Competitive Intelligence and Product Testing, Cylance Mike Spanbauer, Vice President of Research and Strategy, NSS Labs With endless AI or machine learning product claims, buyers are left bewildered with how to test these claims. It falls to independent third-party test organizations to develop and update traditional test protocols to test and validate AI and ML product capability claims. This panel will tackle the key issues that third-party testing must address to validate AI and ML security products.

How Should We Evaluate Machine Learning for AI?: Percy Liang
Machine learning has undoubtedly been hugely successful in driving progress in AI, but it implicitly brings with it the train-test evaluation paradigm. This standard evaluation only encourages behavior that is good on average; it does not ensure robustness as demonstrated by adversarial examples, and it breaks down for tasks such as dialogue that are interactive or do not have a correct answer. In this talk, I will describe alternative evaluation paradigms with a focus on natural language understanding tasks, and discuss ramifications for guiding progress in AI in meaningful directions. Percy Liang is an Assistant Professor of Computer Science at Stanford University (B.S. from MIT, 2004; Ph.D. from UC Berkeley, 2011). His research spans machine learning and natural language processing, with the goal of developing trustworthy agents that can communicate effectively with people and improve over time through interaction. Specific topics include question answering, dialogue, program induction, interactive learning, and reliable machine learning. His awards include the IJCAI Computers and Thought Award (2016), an NSF CAREER Award (2016), a Sloan Research Fellowship (2015), and a Microsoft Research Faculty Fellowship (2014).

Buying

Build or buy AI? You're asking the wrong question
Evan Kohn, chief business officer and head of marketing at Pypestream, talks with Tonya Hall about why companies need to turn to staffing for AI and building data sets.

Why you should Buy Open-Source AI
Considering an AI assistant in your home? Before you auto-buy that pretty picture in front of you, be sure to check out the open-source offerings as well.