Download Lecture Title

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Nonsynaptic plasticity wikipedia , lookup

Neurophilosophy wikipedia , lookup

Artificial neural network wikipedia , lookup

Neural engineering wikipedia , lookup

Holonomic brain theory wikipedia , lookup

Catastrophic interference wikipedia , lookup

Central pattern generator wikipedia , lookup

Neuropsychopharmacology wikipedia , lookup

Expert system wikipedia , lookup

Convolutional neural network wikipedia , lookup

AI winter wikipedia , lookup

Metastability in the brain wikipedia , lookup

Incomplete Nature wikipedia , lookup

Synaptic gating wikipedia , lookup

Neural modeling fields wikipedia , lookup

Recurrent neural network wikipedia , lookup

Type-2 fuzzy sets and systems wikipedia , lookup

Biological neuron model wikipedia , lookup

Nervous system network models wikipedia , lookup

Types of artificial neural networks wikipedia , lookup

Fuzzy logic wikipedia , lookup

Transcript
Intro. ANN & Fuzzy Systems
Lecture 3
Basic Definitions of ANN
Intro. ANN & Fuzzy Systems
Outline
• What is an Artificial Neural Network (ANN)?
• Basic Definitions:
– Neuron
• Net functions
• Activation functions
– ANN Configurations
(C) 2001 by Yu Hen Hu
2
Intro. ANN & Fuzzy Systems
What is an Artificial Neural Network?
An artificial neural network (ANN) is a massively parallel
distributed computing system (algorithm, device, or other) that
has a natural propensity for storing experiential knowledge and
making it available for use. It resembles the brain in two
aspects:
1). Knowledge is acquired by the network through a learning
process.
2). Inter–neuron connection strengths known as synaptic
weights are used to store the knowledge.
– Aleksander & Morton (1990), Haykin (1994)
(C) 2001 by Yu Hen Hu
3
Intro. ANN & Fuzzy Systems
Schematic of a Biological Neuron
In a neuron cell, many dendrites accept information
from other neurons, and a single axon output signals to
other neurons. This is an extremely simplified model.
However, it is still a popular and useful one.
(C) 2001 by Yu Hen Hu
4
Intro. ANN & Fuzzy Systems
WHY ANN?
• Has a potential to solve difficult problems current
methods can not solve well (realistic reasons):
– Pattern classification: hand-written characters, facial
expression, engine diagnosis, etc.
– Non-linear time series modeling, forecasting: Stock price,
utility forecasting, ecg/eeg/emg, speech, etc.
– Adaptive control, machine learning: robot arm, autonomous
vehicle
• Requires massive parallel implementation with optical
devices, analog ICs.
• Performance degrades gracefully when portions of
the network are faulty.
(C) 2001 by Yu Hen Hu
5
Intro. ANN & Fuzzy Systems
NEURON MODEL
• McCulloch-Pitts (Simplistic) Neuron Model
y1
yj
wi1
wij
wiN
yN

ui
ai
i
• The network function of a neuron is a
weighted sum of its input signals plus a bias
term.
(C) 2001 by Yu Hen Hu
6
Intro. ANN & Fuzzy Systems
Neuron Model
• The net function is a linear or nonlinear mapping from the
input data space to an intermediate feature space (in
terms of pattern classification).
• The most common form is a hyper-plane

N
ui   wij y j   i wTi y   i   i
j 1

1 
wTi  
 y
w
all the y such that
w•y = constant
form this plane
w•y
(C) 2001 by Yu Hen Hu
y
7
Intro. ANN & Fuzzy Systems
Side note: A Hyper Plane
Let y = [y1, y2, …, yN]T be a point (vector) in the N dimensional
space, and w = [w, w, …, wN]T be another vector. Then, the
inner product between these two vectors,
wTy = c
defines a (N1) dimensional hyperplane in the N-dimensional
space. In 2-dimensional space, a hyperplane is a straight line
that has the equation
w1y1 + w2y2 = c
In a 3D space, a hyperplane is just a plane.
A hyperplane partitions the space into two halves.
(C) 2001 by Yu Hen Hu
8
Intro. ANN & Fuzzy Systems
Other Net Functions
• Higher order net function: Net function is a linear
combination of higher order polynomial terms. for
example, a 2nd order net function has the form:
ui 
N
w
j 1, k 1
ijk
y j yk   i
• Delta (S-P) net function – instead of summation, the
product of all weighted synaptic inputs are computed:
N
ui   wij y j
j 0
(C) 2001 by Yu Hen Hu
9
Intro. ANN & Fuzzy Systems
NEURON Activation Function
• Linear – f(x) = a x + b,
df ( x )
dx
f(x)
= a : a constant
x
Linear
• Radial basis –
f(x) = exp(–a||x - xo||2);
f(x)
df ( x)
 2a ( x  xo )  f ( x)
dx
(C) 2001 by Yu Hen Hu
Radial
basis
x
10
Intro. ANN & Fuzzy Systems
More Activation Function
• Threshold –
f(x)
1 x  x1;
f(x) =  1 x  x .
1

• Sigmoid –
1
f ( x) 
1  exp(  x / T )
1
x1
x
–1
; T: temperature
f(x)
1
df ( x) 1
 f ( x)[1  f ( x)]
dx
T
(C) 2001 by Yu Hen Hu
Sigmoid
x
11
Intro. ANN & Fuzzy Systems
More Activation Function
x
• Hyperbolic tangent – f ( x)  tanh( )
T
f(x)
1
df ( x)
 T [1  f 2 ( x)]
dx
x
–1
• Inverse tangent –
 x
f ( x)  tan  

T 
2
1
df ( x)
2
1

dx
T 1  ( x / T ) 2
(C) 2001 by Yu Hen Hu
12
Intro. ANN & Fuzzy Systems
ANN CONFIGURATION
• Uni-directional communication links represented by
directed arcs. The ANN structure thus can be
described by a directed graph.
• Fully connected – a cyclic graph with feed-back.
There are N2 connections for N neurons.
(C) 2001 by Yu Hen Hu
13
Intro. ANN & Fuzzy Systems
ANN CONFIGURATION
• Feed-forward, layered connection – acyclic
directed graph, no loop or cycle.
Layer#2
Layer#1
Layer#0
(C) 2001 by Yu Hen Hu
14
Intro. ANN & Fuzzy Systems
ANN Configuration
A net with unidirection
lateral connection
(C) 2001 by Yu Hen Hu
A net with feedback
15
Intro. ANN & Fuzzy Systems
Feed-back Dynamic System
• Without Delay, feedback
cause causality problem: an
unknown variable depends
on an unknown variable!
a2 = g(a1) = g(g(a2)) = …
• To break the cycle, at least
one delay element must be
inserted into the feedback
loop.
• This effectively created a
nonlinear dynamic system
(sequential machine).
(C) 2001 by Yu Hen Hu
a2
a1
D a2
a1
16