Agents

From
Jump to: navigation, search

Youtube ... Quora ...Google search ...Google News ...Bing News



There are 3 stages; Beside, Inside, & Outside - Steven Bathiche


  1. AI will be beside what we are doing: AI is a valuable tool; a helper, a sidebar, an assistant that can aid us in many aspects of our lives. As a conversational AI, it can help us with research, writing, and presentations, breaking our mental blacks, managing our personal lives by scheduling appointments and managing our finances. By automating tasks such as data entry and email processing, AI frees up our time so that we can focus on more important tasks.
  2. AI will inside what we are doing: AI will become even more integrated into our daily lives, understanding our needs and preferences and anticipating our actions. It will be everywhere, acting as the scaffolding for our interactions with technology. Instead of waiting for us to make a keystroke or select an application, AI will take the lead, choosing the best app for us and coaching us through a vast array of possibilities.
  3. AI will be outside what we are doing: AI agents will act as intermediaries, pulling information from disparate data sources and interpreting the world for us. They will help us navigate and understand the context, acting as facilitators and eliminating the challenge of starting with a blank sheet.

An intelligent agent is anything which perceives its environment, takes actions autonomously in order to achieve goals, and may improve its performance with learning or acquiring knowledge. An agent has an "objective function" that encapsulates all the IA's goals. Such an agent is designed to create and execute whatever plan will, upon completion, maximize the expected value of the objective function.[2] For example, a reinforcement learning agent has a "reward function" that allows the programmers to shape the IA's desired behavior,[3] and an evolutionary algorithm's behavior is shaped by a "fitness function". - Wikipedia

What is an AI Agent?

Jarvis

YouTube ... Quora ...Google search ...Google News ...Bing News

Jarvis is a project from Microsoft that uses ChatGPT as the controller for a system where it can employ a variety of other models as needed to respond to your prompt. Microsoft Jarvis connects LLMs with ML community. Language serves as an interface for Large Language Model (LLM)s to connect numerous AI models for solving complicated AI tasks. Solving complicated AI tasks with different domains and modalities is a key step toward advanced artificial intelligence. While there are abundant AI models available for different domains and modalities, they cannot handle complicated AI tasks. Considering Large Language Model (LLM) have exhibited exceptional ability in language understanding, generation, interaction, and reasoning, we advocate that LLMs could act as a controller to manage existing AI models to solve complicated AI tasks and language could be a generic interface to empower this.



HuggingGPT is one instance of Jarvis, a web-based chatbot at Hugging Face, an online AI community which hosts thousands of open-source models.



When a user makes a request to the bot, Jarvis plans the task, chooses which models it needs, has those models perform the task and then generates and issues a response. The workflow of this system consists of four stages:

  1. Task Planning: Using ChatGPT to analyze the requests of users to understand their intention, and disassemble them into possible solvable tasks via prompts.
  2. Model Selection: To solve the planned tasks, ChatGPT selects expert models that are hosted on Hugging Face based on model descriptions.
  3. Task Execution: Invoke and execute each selected model, and return the results to ChatGPT.
  4. Response Generation: Finally, using ChatGPT to integrate the prediction of all models, and generate answers for users.



HuggingGPT

YouTube ... Quora ...Google search ...Google News ...Bing News

HuggingGPT that the Microsoft researchers have set up that leverages Large Language Model (LLM) such as ChatGPT to connect various AI models in machine learning communities to solve AI tasks. Specifically, HuggingGPT uses ChatGPT to conduct task planning when receiving a user request, select models according to their function descriptions available in Hugging Face, execute each subtask with the selected AI model, and summarize the response according to the execution results. To use HuggingGPT, you'll need to obtain an OpenAPI API Key if you don't already have one and sign up for a free account at Hugging Face. Once you've logged in to the site, navigate to Settings -> Access Tokens by clicking the links in the left rail.

Auto-GPT

YouTube ... Quora ...Google search ...Google News ...Bing News



Auto-GPT (AutoGPT) is an open-source Python AI Agent application based on GPT-4 that can self-prompt. This means that if the user states an end goal, the system can work out the steps needed to get there and carry them out. Auto-GPT works by setting a goal; the AI will then generate and complete tasks. Basically, it does all the follow-up work for you, asking and answering its own prompts in a loop. It utilizes the GPT-4 API and can perform a task with little human intervention.

Auto-GPT manages short-term and long-term memory by writing to and reading from databases and files; manages context window length requirements with summarization; can perform internet-based actions such as web searching, web form, and API interactions unattended; and includes text-to-speech for voice output. However, it has limitations in understanding and retaining extensive contextual information because the GPT model it leverages has a token limit. One way to address this contextual issue is to access a window of historical messages, such as the last ten messages or a fixed number of tokens, without exceeding the token limit of a single conversation. However, this method restricts Auto-GPT from accessing earlier contextual information, which might lead to the failure of Auto-GPT to accomplish its goal

To install Auto-GPT, you will need to have Python and Pip installed on your computer. You can download the latest version of Python from the official website and install it on your computer. You will also need to add API keys to use Auto-GPT. You can go to the GitHub release page of Auto-GPT and download the ZIP file by clicking on “Source code (zip)”

If you want Auto-GPT to speak using ElevenLabs, you will need to have an ElevenLabs API key. You can obtain your ElevenLabs API key from their website. Once you have your API key, you can add it to the .env file in the Auto-GPT directory.



Auto-GPT is not yet capable of achieving the AGI (Artificial General Intelligence) due to data quality, generalization, and explainability issues. - Kanwal Mehreen



Breaking Down AutoGPT | Kanwal Mehreen - KDnuggets ... here are the steps

  1. Input from the User
  2. Task Creation Agent
  3. Task Prioritization Agent
  4. Communication Between Agents
  5. Final Result - It also uses external memory to keep track of history and learn from its past experiences to generate more precise results.

The actions of these agents are visible on the user end in the following form:

  • Thoughts: AI agent share their thoughts after completing the action
  • Reasoning: It explains its choices of why is it choosing a particular course of action
  • Plan: The plan includes the new set of tasks
  • Criticism: Critically review the choices by identifying the limitations or concerns



Reflexion

YouTube ... Quora ...Google search ...Google News ...Bing News

Reflexion is a meta-technique approach that endows an agent with dynamic memory and self-reflection capabilities to enhance its existing reasoning trace and task-specific action choice abilities1 It builds on recent research and allows agents to learn from their mistakes and solve novel problems efficiently through a process of trial and error. Reflexion’s success has been demonstrated through evaluations in AlfWorld and HotPotQA environments, achieving success rates of 97% and 51%, respectively. To achieve full automation, Reflexion introduces a straightforward yet effective heuristic that enables the agent to pinpoint hallucination instances, avoid repetition in action sequences, and, in some environments, construct an internal memory map of the given environment.


Self-reflection allows humans to efficiently solve novel problems through a process of trial and error.




Autonomous GPT

YouTube ... Quora ...Google search ...Google News ...Bing News



AgentGPT

YouTube ... Quora ...Google search ...Google News ...Bing News

AgentGPT allows you to configure and deploy Autonomous AI agents. Name your custom AI and have it embark on any goal imaginable. It will attempt to reach the goal by thinking of tasks to do, executing them, and learning from the results

BabyAGI

YouTube ... Quora ...Google search ...Google News ...Bing News


MULTI·ON

YouTube ... Quora ...Google search ...Google News ...Bing News

Personal AI agent and life copilot that uses the browser to execute complex tasks.

  • MULTI·ON Browser is a web browser that uses ChatGPT and OpenAI plugins to interact with anything on the internet on your behalf
  • MULTI·ON is also a ChatGPT plugin that can automate tasks for you, such as posting on social media or ordering products online, or find any content on the web

Agents with Reinforcement Learning (RL)

Reinforcement Learning (RL) aims to make an agent (our “model”) learn through the interaction with an environment (this can be either virtual or real). RL was firstly developed to adhere to Markov Decision Process (MDP)es. In this ambit, an agent is placed in a stochastic stationary environment and tries to learn a policy through a reward/punishment mechanism. In this scenario, it is proved the agent will converge to a satisfactory policy. However, if multiple agents are placed in the same environment, this condition is no longer true. In fact, before the learning of the agent was only dependent on the interaction between the agent and the environment, now it is also dependent on the interaction between agents

AI-Powered Search

YouTube ... Quora ...Google search ...Google News ...Bing News


Multi-Agents Reinforcement Learning (MARL)

In a similar vein, multi-agent RL also addresses sequential decision-making problems, but with more than one agent involved. In particular, both the evolution of the system state and the reward received by each agent are influenced by the joint actions of all agents. More intriguingly, each agent has its own long-term reward to optimize, which now becomes a function of the policies of all other agents.

  • Markov/Stochastic Games
    • Cooperative Setting
    • Competitive Setting
    • Mixed Setting
  • Extensive-Form Games

Challenges

Despite a general model with broad applications, MARL suffers from several challenges in theoretical analysis, in addition to those that arise in single-agent RL.

  1. Non-Unique Learning Goals - Unlike single-agent RL, where the goal of the agent is to maximize the long-term return efficiently, the learning goals of MARL can be vague at times. ...Indeed, the goals that need to be considered in the analysis of MARL algorithms can be multi-dimensional ... is undoubtedly a reasonable solution concept in game theory, under the assumption that the agents are all rational, and are capable of perfectly reasoning and infinite mutual modeling of agents. However, with bounded rationality, the agents may only be able to perform finite mutual modeling
  2. Non-Stationarity - multiple agents usually learn concurrently, causing the environment faced by each individual agent to be non-stationary. In particular, the action taken by one agent affects the reward of other opponent agents, and the evolution of the state. As a result, the learning agent is required to account for how the other agents behave and adapt to the joint behavior accordingly. This invalidates the stationarity assumption for establishing the convergence of single-agent RL algorithms
  3. Scalability Issue - To handle non-stationarity, each individual agent may need to account for the joint action space, whose dimension increases exponentially with the number of agents. This is also referred to as the combinatorial nature of MARL
  4. Various Information Structures - Compared to the single-agent case, the information structure of MARL, namely, who knows what at the training and execution, is more involved. For example, in the framework of Markov games, it suffices to observe the instantaneous state st , in order for each agent to make decisions

Communication

Youtube search... ...Google search

Robotic Process Automation (RPA)

Robotic process automation (RPA) is a software technology that allows businesses to automate repetitive, rule-based tasks. RPA software robots, or "bots," can mimic human actions by interacting with computer systems and software applications. This can free up human employees to focus on more strategic and value-added tasks. RPA is a powerful tool that can be used to automate a wide variety of tasks, including:

  • Data entry
  • Data extraction
  • Data validation
  • Report generation
  • Email management
  • Workflow automation
  • Compliance checks
  • Customer service
  • IT support

AI is being used with RPA in a number of ways, including:

  • Automating tasks that require human judgment: AI can be used to automate tasks that require human judgment, such as identifying fraud or making predictions. This can free up human employees to focus on more strategic and value-added tasks.
  • Making RPA more efficient: AI can be used to make RPA more efficient by automating tasks that are currently done manually, such as data extraction and processing. This can save time and resources, and it can also help to ensure that tasks are done accurately and consistently.
  • Improving the accuracy of RPA: AI can be used to improve the accuracy of RPA by identifying and correcting errors. This can help to ensure that RPA is producing accurate and reliable results.
  • Expanding the capabilities of RPA: AI can be used to expand the capabilities of RPA by enabling it to perform tasks that were previously not possible, such as understanding natural language and making decisions based on incomplete or uncertain information. This can make RPA a more powerful tool for automating a wider range of tasks.