3

Imagine a system that controls dampers in a complex vent system that has an objective to perfectly equalize the output from each vent. The system has sensors for damper position, flow at various locations and at each vent. The system is initially implemented using a rather small data set or even a formulaic algorithm to control the dampers. What if that algorithm were programmed to "try" different configurations of dampers to optimize the air flows, guided broadly by either the initial (weak) training or the formula? The system would try different configurations and learn what improved results, and what worsened results, in an effort to reduce error (differential outflow).

What is that kind of AI system called? What is that system of learning called? Are there systems that do that currently?

nbro
  • 39,006
  • 12
  • 98
  • 176
  • This question is very similar to [this one](https://ai.stackexchange.com/q/8241/2444), if not an exact duplicate. – nbro Nov 07 '20 at 17:23

3 Answers3

2

Near solution to your problem definition is reinforcement learning. You can define some reward using the objective function and define some possible state space for the machine and finally solve the problem by reinforcement learning techniques (near to trial and error by learning the preferences).

OmG
  • 1,731
  • 10
  • 19
1

I think any learning algorithm probably uses trial and error and analysis of the results with the ultimate goal of maximizing utility.

It seems that the recent milestones in AI fall under the general umbrella of machine learning, which includes all forms of reinforcement learning. Essentially, any learning algorithm is using some form of statistical analysis.

  • For an umbrella term, I've been using "learning algorithm"

However, there is also a venerable history of less capable adaptive systems such as self-organizing networks. (See also optimal control.)

DukeZhou
  • 6,237
  • 5
  • 25
  • 53
1

I believe "Reinforcement Learning" is the term you are looking for (as mentioned by others as well) but keep in mind that the scope of your problem falls under the section of AI that is called Search.

Search algorithms are based upon experimenting with different actions (decisions) and selecting the one that minimizes an arbitrary cost function (reward), given the current and past problem states.

pcko1
  • 241
  • 2
  • 7