Difference between revisions of "Benchmarks"
m (→Large Language Model (LLM)s) |
m |
||
| Line 26: | Line 26: | ||
** [[Gaming]] | ** [[Gaming]] | ||
** [[Algorithm Administration#Model Monitoring|Model Monitoring]] | ** [[Algorithm Administration#Model Monitoring|Model Monitoring]] | ||
| − | |||
* [[ALFRED]] ... Action Learning From Realistic Environments and Directives | * [[ALFRED]] ... Action Learning From Realistic Environments and Directives | ||
* [[Algorithm Administration]] | * [[Algorithm Administration]] | ||
| Line 56: | Line 55: | ||
[https://news.google.com/search?q=ai+~Consciousness+test ...Google News] | [https://news.google.com/search?q=ai+~Consciousness+test ...Google News] | ||
[https://www.bing.com/news/search?q=ai+~Consciousness+test&qft=interval%3d%228%22 ...Bing News] | [https://www.bing.com/news/search?q=ai+~Consciousness+test&qft=interval%3d%228%22 ...Bing News] | ||
| + | |||
| + | * [[Singularity]] ... [[Artificial Consciousness / Sentience|Sentience]] ... [[Artificial General Intelligence (AGI)| AGI]] ... [[Inside Out - Curious Optimistic Reasoning| Curious Reasoning]] ... [[Emergence]] ... [[Moonshots]] ... [[Explainable / Interpretable AI|Explainable AI]] ... [[Algorithm Administration#Automated Learning|Automated Learning]] | ||
== <span id="Turing Test"></span>Turing Test == | == <span id="Turing Test"></span>Turing Test == | ||
Revision as of 19:30, 13 July 2023
YouTube ... Quora ...Google search ...Google News ...Bing News
- Data Science ... Governance ... Preprocessing ... Exploration ... Interoperability ... Master Data Management (MDM) ... Bias and Variances ... Benchmarks ... Datasets
- Large Language Model (LLM) ... Natural Language Processing (NLP) ... Generation ... Classification ... Understanding ... Translation ... Tools & Services
- Risk, Compliance and Regulation ... Ethics ... Privacy ... Law ... AI Governance ... AI Verification and Validation
- Case Studies
- ALFRED ... Action Learning From Realistic Environments and Directives
- Algorithm Administration
- Data Quality ...validity, accuracy, cleaning, completeness, consistency, encoding, padding, augmentation, labeling, auto-tagging, normalization, standardization, and imbalanced data
- Managed Vocabularies
- Excel ... Documents ... Database ... Graph ... LlamaIndex
- Analytics ... Visualization ... Graphical Tools ... Diagrams & Business Analysis ... Requirements ... Loop ... Bayes ... Network Pattern
- Development ... Notebooks ... AI Pair Programming ... Codeless, Generators, Drag n' Drop ... AIOps/MLOps ... AIaaS/MLaaS
- Backpropagation ... FFNN ... Forward-Forward ... Activation Functions ...Softmax ... Loss ... Boosting ... Gradient Descent ... Hyperparameter ... Manifold Hypothesis ... PCA
- Strategy & Tactics ... Project Management ... Best Practices ... Checklists ... Project Check-in ... Evaluation ... Measures
- AI Solver ... Algorithms ... Administration ... Model Search ... Discriminative vs. Generative ... Optimizer ... Train, Validate, and Test
- Machine Learning Benchmarks and AI Self-Driving Cars | Lance Eliot - AItrends
- Benchmarking simple models with feature extraction against modern black-box methods | Martin Dittgen - Towards Data Science
- DAWNBench | Stanford - an End-to-End Deep Learning Benchmark and Competition
- Benchmarking 20 Machine Learning Models Accuracy and Speed | Marc Borowczak - Data Science Central
- Benchmarking deep learning models on large healthcare datasets | S. Purushotham, C. Meng, Z. Chea, and Y. Liu
- Supercomputers Flex Their AI Muscles New benchmarks reveal science-task speedups | Sammuel K. Moore - IEEE Spectrum
Contents
AI Consciousness Testing
YouTube ... Quora ...Google search ...Google News ...Bing News
- Singularity ... Sentience ... AGI ... Curious Reasoning ... Emergence ... Moonshots ... Explainable AI ... Automated Learning
Turing Test
The Turing test, originally called the imitation game by Alan Turing in 1950, is a test of a machine's ability to exhibit intelligent behavior equivalent to, or indistinguishable from, that of a human. Turing proposed that a human evaluator would judge natural language conversations between a human and a machine designed to generate human-like responses. The evaluator would be aware that one of the two partners in conversation was a machine, and all participants would be separated from one another. The conversation would be limited to a text-only channel, such as a computer keyboard and screen, so the result would not depend on the machine's ability to render words as speech. If the evaluator could not reliably tell the machine from the human, the machine would be said to have passed the test. The test results would not depend on the machine's ability to give correct answers to questions, only on how closely its answers resembled those a human would give. - Turing Test | Wikipedia
Today an AI has to dumb down to pass the Turing Test - Ray Kurzweil
|
|
Theory of Mind
"Theory of Mind" means that people have thoughts, feelings and emotions that affect their behavior. Future AI systems must learn to understand that everyone (both people and AI objects) have thoughts and feelings. Future AI systems must know how to adjust their behavior to be able to walk among us.
|