9 years ago · 5197f4aa5a
--- a/Intelligence/lesson_06.md
+++ b/Intelligence/lesson_06.md
@@ -1,100 +1,74 @@
 
															 # AI - lesson 06
														
 
															 #### Francesco Arrigoni
														
 
															-###### 4 November 2015
														
 
															-## Adversarial search
														
 
															-Is usually employed in situations called *games*, in which there are multiple agents that interact together in a __strategic way__  
														
 
															-For example in the game of chess we play against another player, and so our moves can be represented by a tree
														
 
															+###### 28 October 2015
														
 
															+## Informed search strategies
														
 
															-There are different kind of games:
														
 
															-- __Perfect information__ games: we completely know the status of the game
														
 
															-- __Imperfect information__ games: 
														
 
															+### Evaluation function $f(n)$ types
														
 
															-Another distinction is:
														
 
															-- __Deterministic__ games: There are no randomness factors
														
 
															-- __Chance__ games: there are elements of chance for example rolling dices or drawing cards.
														
 
															+### Greedy best first
														
 
															+$$f(n)=h(n)$$
														
 
															+$h(n)$ is called *heuristic function* and is an estimate of __how far__ is a node __from the goal__
														
 
															-#### examples
														
 
															-- An example of perfect information and deterministic game is chess,
														
 
															-- An imperfect and deterministic game is battleship
														
 
															-- A perfect information based on chance is backgammon, or gioco dell'oca, or Monopoly
														
 
															-- An imperfect information and chance game is poker or cards games.
														
 
															+#### example
														
 
															+- The nodes of the graph are __cities__ and the heuristic function can be the cartesian distance of the cities
														
 
															-For this time will focus of __Perfect information__ and __deterministic__ games like chess.
														
 
															+- In the game of 15 an heuristic can be an estimate of the number of moves needed to complete the game
														
 
															-- We have two players: Max and Min,  
														
 
															-- They are playing in turns,
														
 
															-- Max will play first
														
 
															+By definition the heuristic function is defined over __nodes__, but commonly are defined over a __state__, in fact
														
 
															+the heuristic function of two nodes referring to the same node is the same.
														
 
															-An __initial state__ is an initial configuration of the game.
														
 
															+According to how heuristic functions are defines, there can be loops in __greedy best first__
														
 
															-### Tic Tac Toe
														
 
															+#### Optimality
														
 
															-Max: X
														
 
															-Min: O
														
 
															+This strategy is __not optimal__ in general
														
 
															-Initial state:
														
 
															+#### Complexity
														
 
															-||
														
 
															----|---|---
														
 
															-||
														
 
															-||
														
 
															+Setting a not clever heuristic function, this strategy is equivalent to the depht first  
														
 
															+But with a good heuristic function, we can exploit this to achieve better results.
														
 
															+__time__:$O(b^m)$
														
 
															+__space__:$O(b^n)$
														
 
															-We can define a __function "player(s)" that given a state will return which is the next player
														
 
															-- $\text{PLAYER}(s)\in \{max,min\}$
														
 
															-- $\text{ACTIONS}(s)= \{a_1,a_2,...\}$
														
 
															-- $\text{RESULT}(s,a)=s'$
														
 
															-- $\text{TERMINAL-TEST}(s)=\{yes,no\}$
														
 
															+### $A^*$
														
 
															-Next we define a utility function
														
 
															-- $\text{UTLITY}(s,p)\in R$
														
 
															+In this case the __evaluation function__ is defined as $f(n)=g(n)+h(n)$
														
 
															+$g(n)$ is the cost for going from the root to node n
														
 
															+$h(n)$ is an estimation of the costs for going from node n to the goal g
														
 
															-We consider __zero-based__ games, in which the utility of the two players in a final state are always summing up to zero
														
 
															-$\text{UTILITY}(s,\text{MAX})+\text{UTILITY}(s,\text{MIN})=0$
														
 
															+Doing so $f(n)$ is the estimated cost of the solution passing through n
														
 
															-For example
														
 
															-X|O|O
														
 
															----|---|---
														
 
															-|X|
														
 
															-||
														
 
															-$\text{RESULT}$Will give __no__ as a result
														
 
															+#### Optimality
														
 
															-While
														
 
															-X|O|O
														
 
															----|---|---
														
 
															-|X|
														
 
															-||X
														
 
															+$A^*$ is optimal when using __tree search__, that is when the heuristic function is *admissible*
														
 
															+$h^*()$ is admissible when $\forall n\;h(n)\le h^*(n)$ The heuristic function never overestimates the cost of reaching a node.
														
 
															-$\text{RESULT}$Will give __yes__ as a result
														
 
															-While $\text{UTILITY(s,MAX}$Will give -1 as a result
														
 
															-And  $\text{UTILITY(s,MIN}$Will give +1 as a result
														
 
															+Returning to the road example:  
														
 
															+The straight distance (line of sight) between two cities is a valid heuristic by definition, in fact the real distance
														
 
															+can not be smaller than this.
														
 
															-### minimax
														
 
															+##### example
														
 
															-Is an algorithm for solving __game trees__
														
 
															-- The root is called "MAX" node because t corresponds to MAX's turn  
														
 
															-- All children of MAX are called MIN nodes for the same reason  
														
 
															-- The third row contains the __terminal nodes__, and the number associated to every node is the utility for max to be in that node.
														
 
															+$$f(g_2)=g(g_2)+h(g_2)>c^*$$
														
 
															+$$f(n)=g(n)+h(n)\le c^*$$
														
 
															+$$f(n)\le c^*<f(g_2)$$
														
 
															-The minimax algorithm is based on the idea of a __minimax value__ for each node, that represents:
														
 
															-> The utility for max of being in (reaching) that node assuming that the other players will play optimally from that node to the end of the game
														
 
															+An Heuristic function is __consistent__ if for every node that is a successor of n
														
 
															+$$\foralln,n' h(n)\le c(n,n')+h(n')$$
														
 
															-The general rule for calculating the minimax value is the following:  
														
 
															-- The minimax value of a __terminal node__ is the utility of MAX
														
 
															-- The minimax value of a __MIN node__ is the minimum of the minimax value of the children
														
 
															-- The minimax value of a __MAX node__ is the maximum of the minimax value of the children
														
 
															+__Consistency__ of heuristic function implies __admissibilitu__
														
 
															-#### The minimax algorithm
														
 
															-- Build all the game tree
														
 
															-- Starting from the bottom we have to back-up the minimax value to the upper nodes.
														
 
															-- Knowing the minimax value of all the children of a node, we can calculate the value of a node.
														
 
															+- $f(n) is not decreasing along every path
														
 
															-The minimax value can be __minimized__ building a tree in a *depth first* fashion.
														
 
															+$A^*$ will choose nodes from the frontier with value that in a non decreasing order.
														
 
															-The problem of a possible *cutoff strategy* is that not completing the tree, we don't have terminal nodes, and i need an __evaluation function__
														
 
															+#### Complexity
														
 
															+- Time complexity $O(\square^{|h-h^*|})
														
 
															-#### Cutoff strategy
														
 
															+If the heuristic function is always zero, $A^*$ degerates in uniform cost.
														
 
															+
														
 
															+If we have a perfect heuristic function, the time complexity is constant
														
 
															+$A^*$ is called __optimally efficient__, this means that given a fixed heuristic function. $A^*$ is guaranteed to expand the
														
 
															+minimum number of nodes
														
 
															-Usually in chess or checkers i can cut-off the tree at a given level, but i can not do so at a random level
														
 
															-An option is implementing a __quiescence evaluation function__ that tells us if a certain level is stable, 
														
 
															-otherwise we continue 
														
--- a/Intelligence/lesson_07.md
+++ b/Intelligence/lesson_07.md
@@ -1,56 +1,100 @@
 
															 # AI - lesson 07
														
 
															 #### Francesco Arrigoni
														
 
															-###### 6 November 2015
														
 
															-## Adversarial games
														
 
															+###### 4 November 2015
														
 
															+## Adversarial search
														
 
															+Is usually employed in situations called *games*, in which there are multiple agents that interact together in a __strategic way__  
														
 
															+For example in the game of chess we play against another player, and so our moves can be represented by a tree
														
 
															-### $\alpha - \beta pruning$ 
														
 
															-#### Iteration
														
 
															-- We start from the root, that will be labeled after the starting player.
														
 
															-- From the root the first set of arcs representing actions is called a1,a2,a3...
														
 
															-- The arcs starting from a 2nd level node are named b1,b2,b3...
														
 
															-- For a MIN node (e.g 2nd level) we not necessarily know the minimax value, 
														
 
															-but we can tell it is lower than the values of the known children (minimum value)
														
 
															-- We can have an hint about the minimax value of the root only when we have at least one branch completely built
														
 
															-- After we have found the minimax value of a node, we can remove from memory his children.
														
 
															-I do not need to generate the complete tree of a node if MAX has already found a node of higher value
														
 
															+There are different kind of games:
														
 
															+- __Perfect information__ games: we completely know the status of the game
														
 
															+- __Imperfect information__ games:
														
 
															-The order of the nodes determines which nodes are discovered or not
														
 
															+Another distinction is:
														
 
															+- __Deterministic__ games: There are no randomness factors
														
 
															+- __Chance__ games: there are elements of chance for example rolling dices or drawing cards.
														
 
															-#### Complexity
														
 
															+#### examples
														
 
															+- An example of perfect information and deterministic game is chess,
														
 
															+- An imperfect and deterministic game is battleship
														
 
															+- A perfect information based on chance is backgammon, or gioco dell'oca, or Monopoly
														
 
															+- An imperfect information and chance game is poker or cards games.
														
 
															-Using the most efficient $\alpha - \beta pruning$
														
 
															-We have a __time complexity__ of $O(b^{m/2})$
														
 
															-With respect of classic minimax $O(b^n)$
														
 
															+For this time will focus of __Perfect information__ and __deterministic__ games like chess.
														
 
															-This means that with $\alpha - \beta pruning$ we can obtain a tree with __double the depth__ comparing to minimax
														
 
															+- We have two players: Max and Min,  
														
 
															+- They are playing in turns,
														
 
															+- Max will play first
														
 
															-#### Meaning of the name
														
 
															+An __initial state__ is an initial configuration of the game.
														
 
															-In the first public version of this algorithm, the current best option for MAX was called $\alpha$
														
 
															+### Tic Tac Toe
														
 
															-In a dual way $\beta$ was the value of the current best option for MIN.
														
 
															+Max: X
														
 
															+Min: O
														
 
															-#### General case
														
 
															+Initial state:
														
 
															-In a general case, known a MAX node value $\alpha$, while discovering MAX nodes of lower levels, all children with value higher than $\alpha$ are pruned.
														
 
															+||
														
 
															+---|---|---
														
 
															+||
														
 
															+||
														
 
															-We can repeat the same reasoning for MIN and $\beta$
														
 
															+We can define a __function "player(s)" that given a state will return which is the next player
														
 
															+- $\text{PLAYER}(s)\in \{max,min\}$
														
 
															+- $\text{ACTIONS}(s)= \{a_1,a_2,...\}$
														
 
															+- $\text{RESULT}(s,a)=s'$
														
 
															+- $\text{TERMINAL-TEST}(s)=\{yes,no\}$
														
 
															-$\alpha$ and $\beta$ are not the extremens of a node interval.
														
 
															+Next we define a utility function
														
 
															+- $\text{UTLITY}(s,p)\in R$
														
 
															-## Games with chance
														
 
															+We consider __zero-based__ games, in which the utility of the two players in a final state are always summing up to zero
														
 
															+$\text{UTILITY}(s,\text{MAX})+\text{UTILITY}(s,\text{MIN})=0$
														
 
															-For this games certain authors say that there are three players: MAX,MIN and Nature, but this is misleading.
														
 
															+For example
														
 
															+X|O|O
														
 
															+---|---|---
														
 
															+|X|
														
 
															+||
														
 
															+$\text{RESULT}$Will give __no__ as a result
														
 
															-### Expectiminimax algorithm
														
 
															-We have a similar tree to the minimax one, With chance nodes with  probability on the descending arcs.
														
 
															+While
														
 
															+X|O|O
														
 
															+---|---|---
														
 
															+|X|
														
 
															+||X
														
 
															-The procedure of calculating and backing up the minimax values for the normal nodes is the same as minimax.
														
 
															+$\text{RESULT}$Will give __yes__ as a result
														
 
															+While $\text{UTILITY(s,MAX}$Will give -1 as a result
														
 
															+And  $\text{UTILITY(s,MIN}$Will give +1 as a result
														
 
															-#### Alternative strategy
														
 
															+### minimax
														
 
															-We know that our utility function from design will return values in a given interval e.g $[-2,2]$
														
 
															+Is an algorithm for solving __game trees__
														
 
															+- The root is called "MAX" node because t corresponds to MAX's turn  
														
 
															+- All children of MAX are called MIN nodes for the same reason  
														
 
															+- The third row contains the __terminal nodes__, and the number associated to every node is the utility for max to be in that node.
														
 
															-From this we know that for every node, the minimax value will be between $[-2,2]$
														
 
															+The minimax algorithm is based on the idea of a __minimax value__ for each node, that represents:
														
 
															+> The utility for max of being in (reaching) that node assuming that the other players will play optimally from that node to the end of the game
														
 
															-Expectiminimax is dependent on the __actual values__ of the utility function, while in standard minimax it does not mattes.
														
 
															+The general rule for calculating the minimax value is the following:  
														
 
															+- The minimax value of a __terminal node__ is the utility of MAX
														
 
															+- The minimax value of a __MIN node__ is the minimum of the minimax value of the children
														
 
															+- The minimax value of a __MAX node__ is the maximum of the minimax value of the children
														
 
															+
														
 
															+#### The minimax algorithm
														
 
															+- Build all the game tree
														
 
															+- Starting from the bottom we have to back-up the minimax value to the upper nodes.
														
 
															+- Knowing the minimax value of all the children of a node, we can calculate the value of a node.
														
 
															+
														
 
															+The minimax value can be __minimized__ building a tree in a *depth first* fashion.
														
 
															+
														
 
															+The problem of a possible *cutoff strategy* is that not completing the tree, we don't have terminal nodes, and i need an __evaluation function__
														
 
															+
														
 
															+#### Cutoff strategy
														
 
															+
														
 
															+Usually in chess or checkers i can cut-off the tree at a given level, but i can not do so at a random level
														
 
															+
														
 
															+An option is implementing a __quiescence evaluation function__ that tells us if a certain level is stable,
														
 
															+otherwise we continue
														
--- a/Intelligence/lesson_08.md
+++ b/Intelligence/lesson_08.md
@@ -0,0 +1,56 @@
 
															+# AI - lesson 08
														
 
															+#### Francesco Arrigoni
														
 
															+###### 6 November 2015
														
 
															+## Adversarial games
														
 
															+
														
 
															+### $\alpha - \beta pruning$
														
 
															+#### Iteration
														
 
															+- We start from the root, that will be labeled after the starting player.
														
 
															+- From the root the first set of arcs representing actions is called a1,a2,a3...
														
 
															+- The arcs starting from a 2nd level node are named b1,b2,b3...
														
 
															+- For a MIN node (e.g 2nd level) we not necessarily know the minimax value,
														
 
															+but we can tell it is lower than the values of the known children (minimum value)
														
 
															+- We can have an hint about the minimax value of the root only when we have at least one branch completely built
														
 
															+- After we have found the minimax value of a node, we can remove from memory his children.
														
 
															+I do not need to generate the complete tree of a node if MAX has already found a node of higher value
														
 
															+
														
 
															+The order of the nodes determines which nodes are discovered or not
														
 
															+
														
 
															+#### Complexity
														
 
															+
														
 
															+Using the most efficient $\alpha - \beta pruning$
														
 
															+We have a __time complexity__ of $O(b^{m/2})$
														
 
															+With respect of classic minimax $O(b^n)$
														
 
															+
														
 
															+This means that with $\alpha - \beta pruning$ we can obtain a tree with __double the depth__ comparing to minimax
														
 
															+
														
 
															+#### Meaning of the name
														
 
															+
														
 
															+In the first public version of this algorithm, the current best option for MAX was called $\alpha$
														
 
															+
														
 
															+In a dual way $\beta$ was the value of the current best option for MIN.
														
 
															+
														
 
															+#### General case
														
 
															+
														
 
															+In a general case, known a MAX node value $\alpha$, while discovering MAX nodes of lower levels, all children with value higher than $\alpha$ are pruned.
														
 
															+
														
 
															+We can repeat the same reasoning for MIN and $\beta$
														
 
															+
														
 
															+$\alpha$ and $\beta$ are not the extremens of a node interval.
														
 
															+
														
 
															+## Games with chance
														
 
															+
														
 
															+For this games certain authors say that there are three players: MAX,MIN and Nature, but this is misleading.
														
 
															+
														
 
															+### Expectiminimax algorithm
														
 
															+We have a similar tree to the minimax one, With chance nodes with  probability on the descending arcs.
														
 
															+
														
 
															+The procedure of calculating and backing up the minimax values for the normal nodes is the same as minimax.
														
 
															+
														
 
															+#### Alternative strategy
														
 
															+
														
 
															+We know that our utility function from design will return values in a given interval e.g $[-2,2]$
														
 
															+
														
 
															+From this we know that for every node, the minimax value will be between $[-2,2]$
														
 
															+
														
 
															+Expectiminimax is dependent on the __actual values__ of the utility function, while in standard minimax it does not mattes.
														
--- a/Intelligence/lesson_0x.md
+++ b/Intelligence/lesson_0x.md
@@ -1,74 +0,0 @@
 
															-# AI - lesson 06
														
 
															-#### Francesco Arrigoni
														
 
															-###### 28 October 2015
														
 
															-## Informed search strategies
														
 
															-
														
 
															-### Evaluation function $f(n)$ types
														
 
															-
														
 
															-### Greedy best first
														
 
															-$$f(n)=h(n)$$
														
 
															-$h(n)$ is called *heuristic function* and is an estimate of __how far__ is a node __from the goal__
														
 
															-
														
 
															-#### example
														
 
															-- The nodes of the graph are __cities__ and the heuristic function can be the cartesian distance of the cities
														
 
															-
														
 
															-- In the game of 15 an heuristic can be an estimate of the number of moves needed to complete the game
														
 
															-
														
 
															-By definition the heuristic function is defined over __nodes__, but commonly are defined over a __state__, in fact
														
 
															-the heuristic function of two nodes referring to the same node is the same.
														
 
															-
														
 
															-According to how heuristic functions are defines, there can be loops in __greedy best first__
														
 
															-
														
 
															-#### Optimality
														
 
															-
														
 
															-This strategy is __not optimal__ in general
														
 
															-
														
 
															-#### Complexity
														
 
															-
														
 
															-Setting a not clever heuristic function, this strategy is equivalent to the depht first  
														
 
															-But with a good heuristic function, we can exploit this to achieve better results.
														
 
															-__time__:$O(b^m)$
														
 
															-__space__:$O(b^n)$
														
 
															-
														
 
															-### $A^*$
														
 
															-
														
 
															-In this case the __evaluation function__ is defined as $f(n)=g(n)+h(n)$
														
 
															-$g(n)$ is the cost for going from the root to node n
														
 
															-$h(n)$ is an estimation of the costs for going from node n to the goal g
														
 
															-
														
 
															-Doing so $f(n)$ is the estimated cost of the solution passing through n
														
 
															-
														
 
															-#### Optimality
														
 
															-
														
 
															-$A^*$ is optimal when using __tree search__, that is when the heuristic function is *admissible*
														
 
															-$h^*()$ is admissible when $\forall n\;h(n)\le h^*(n)$ The heuristic function never overestimates the cost of reaching a node.
														
 
															-
														
 
															-Returning to the road example:  
														
 
															-The straight distance (line of sight) between two cities is a valid heuristic by definition, in fact the real distance
														
 
															-can not be smaller than this.
														
 
															-
														
 
															-##### example
														
 
															-
														
 
															-$$f(g_2)=g(g_2)+h(g_2)>c^*$$
														
 
															-$$f(n)=g(n)+h(n)\le c^*$$
														
 
															-$$f(n)\le c^*<f(g_2)$$
														
 
															-
														
 
															-An Heuristic function is __consistent__ if for every node that is a successor of n
														
 
															-$$\foralln,n' h(n)\le c(n,n')+h(n')$$
														
 
															-
														
 
															-__Consistency__ of heuristic function implies __admissibilitu__
														
 
															-
														
 
															-- $f(n) is not decreasing along every path
														
 
															-
														
 
															-$A^*$ will choose nodes from the frontier with value that in a non decreasing order.
														
 
															-
														
 
															-#### Complexity
														
 
															-- Time complexity $O(\square^{|h-h^*|})
														
 
															-
														
 
															-If the heuristic function is always zero, $A^*$ degerates in uniform cost.
														
 
															-
														
 
															-If we have a perfect heuristic function, the time complexity is constant
														
 
															-$A^*$ is called __optimally efficient__, this means that given a fixed heuristic function. $A^*$ is guaranteed to expand the
														
 
															-minimum number of nodes
														
 
															-
														
 
															-