CM10135
/ Programming II: Lecture
18
Applications
of Search II: Learning and `New AI'
-I. Housekeeping
- Those of
you who failed Programming I badly and have to rework the exam should
do it over break.
- Will help you do better in Programming II as well!
- Problems may not be just with programming, but also with exam
taking.
- Exam counts for more than all the courseworks combined.
- Coursework Marking:
- I'm glad many students have gotten programs working for the
first time here.
- Unfortunately, we can't give you extra marks for that, this
is programming II.
- Last year many students also first understood programming
second term.f
- Their marks got better & better across the three
courseworks.
- This is partly why CW1 is only worth 10 points instead of
13.333333...
- Appeal procedure on marking:
- Go to the TAs.
- If it can't be resolved, sign up for my office hours using
this link (only accessable from campus.)
- I still won't set a new mark without checking again with the
TAs so that everything is fair.
I. AI at Bath
- Courses in Logic, Agents, Vision, HCI.
- Might be dedicated
courses in AI, machine learning and/or modelling by your final year (or
for MEng.)
- Talk to your SSLC rep if you care.
- AI Seminar series (also HCI, Logic, general Department Seminars.)
- Look at Seminar link on dept. home page.
- Undergraduates can go to most of these, although we don't
organize your timetables around them.
- If you are keen, you can often work with a lecturer / help with
research.
- Look at faculty web pages (here are the AI ones)
to find out
what people are interested in.
- Only do this if you have time after your coursework -- if you
tend to finish things both well and early.
II. Searching in Advance I: Heuristic-Based AI
- Review on Hueristics and Search
- AI problems tend to be
- NP hard.
- Represented as a tree, where each node is an action, and its
children are the possible actions you could do next.
- Must use heuristics to prune
that tree and find a reasonable action.
- Design is search by the programmer!
- My PhD research area, see Designing
Intelligent Systems
- Trying to come up with good design rules is like evolution.
- This is why
sometimes
programmers can chance on the right answer to a question even sooner
than
they can understand how to solve the question.
- Languages
like java are designed to give you a heavily pruned search space.
- Using
good technique makes it likely you will find good answers!
- Using very limited search of known good strategies is the key to
several other kinds of AI...
- Production Rule Systems (see ACT-R, Soar)
- Production: perceptual context / precondition connected to an action,
- Many productions in a system
- action selection:
- go through all productions,
- find one that's precondition matches the current context
- execute / `fire' its action
- Problem: many productions can be triggered (be able to fire / have
their preconditions match the context) at the same time.
- e.g. it's day, AND it's raining
- System for choosing which production to run.
- Many systems have rules like recency, efficacy (must learn
for both of these)
- Soar does a search if more than one rule files, then saves
the results of the search as another production.
- Supposed to be like the brain.
- Often learns too much!
- ACT-R has Bayesian statistics on efficacy as well as
learning
new productions.
- Both use "problem space" to narrow number of possible
productions tested.
- Expert
Systems are production rule systems that encode knowledge
elicited from experts.
- The elicitation part is hard -- experts know things
implicitly.
- Perhaps should be called "novice systems"
- novices follow sets of rules
- experts recognize situations.
- Have got bad press (partly due to too large of sales pitch)
but still used in industry.
- Reactive
AI (that's my web page on the subject!)
- Designers replace evolution in providing the small search
area
for what to do next.
- Will hopefully show
some movies.
- Excellent
slides about this approach.
III. Searching in Advance II: Learning
- Important AI Concept: Anytime Algorithm.
- Have an algorithm that always gives you some solution.
- The longer you give the algorithm, the
better the solution gets.
- How chess programs work, some general
planning systems.
- Review on Parallel Search:
- One way to solve the problem of combinatorics is to have the
search run in parallel.
- Solutions people are currently working on:
- Quantum computing :
- Biological computing:
- Evolution / memetics:
- If a few members of a species or a culture find a good /
winning
solution, the entire species or culture may eventually adopt that
solution.
- This works much faster in memetics (ideas) then genetics
(requires waiting for children to grow up and breed.)
- Evolution doesn't necessarily discover optimal solutions.
- Can think of it as an anytime algorithm.
- Can think of it as a search for heuristics.
- Sometimes heuristics work well in some environments and
not others.
- Other organisms or ideas are evolving too, will find hacks
around what used to be a good solution.
- E.g. giant birds as predators in SA, nailed when cats got
across from Asia.
- Evolution does come up with a bunch of `good tricks' (Dennett)
- This is true of cultural evolution as well.
- Although we think we are rational agents who do things that
provably make sense, in fact much of what we do and know we get by
imitation of authority.
- Learning when you don't even know you've learned is called
Implicit Learning.
- Adopting mannerisms of people you admire or just observe.
- Language -- you pick up new words and even dialects without
noticing.
- May be most of what you learn.
- Several kinds of learning:
- Evolution: search done by species, we have a set of
good
tricks to choose from.
- E.g. how we pick something up is limited / enabled by the
number of arms we have, which way the joints work.
- Reduces search space => affects expressed behaviour
=>
part of intelligence!
- Examples
with legs.
- Storing results of previous search:
- Remember problem and solution, don't search again.
- update cost estimates (remember A*?) with real cost.
- These often are combined together
- Evolution essentially creates variables in animals which
are
easily updated with the right kind of information.
- E.g imprinting.
- Language?
- Short term memory, episodic memory, perceptual memory.
- Bottom line: learning is still search, it's just search in
advance, and search with a lot of biases / pruning already provided.
IV. Learning in AI
- Many kinds of machine learning, again, could easily be a whole
course.
- Tom
Mitchell Book
- New
Chris Bishop Book
- Don't neglect simple answers : giving variables and filling in
solutions when they become apparent.
- This approach can also be formalized: frames, case-based
reasoning.
- My work: Behaviour Oriented Design -- just use state in
objects
for learning.
- Neural Networks
- Perceptron Learning
- Backpropogation
- Applied Statistics (Chris Bishop
Book NN Book)
- Spiking models, compartment models. See Bath's group: Dan Richardson's page has
papers, he and all his students are a part of AmonI.
- Genetic Algorithms
- Evolution-like rules.
- Pick a representation of variables that affect the behavior
of an individual agent.
- Start with a population of possible values for those
variables.
- Run all the programs and evaluate how well they do.
- Select the ones that have done the job best,
- Create `children' by systematically permuting the variables
of the winners
- crossover : choose two (or more) parents, choose some of
the variable settings from each.
- mutation : just change a small number of variables
randomly
- any other operator -- doesn't have to be naturally
inspired.
- These children make a new population, now iterate! (go back
to 2)
- Good for searching around likely solutions
- Need to have a good idea what the right representation /
set
of variables is.
- Need to have a good idea of how to tell if the individuals
are doing well
- (tricky when starting from a solution too far from right
--
- what solves the first steps well may not be able to solve
the last steps.)
- Many variations / applications:
- See Bath's learning
classifier systems group.
- See demo from Karl
Sims (you'll have to chase some links...)
- Just the tip of the iceberg.
V. Summary
- Learning is a form of search.
- But it's specialized so likely to succeed.
- Still any bias to search may mean that you miss a good solution.
- Intelligence is (NP) hard!
- Come to the seminars if you like.
page author: Joanna Bryson
6 March 2007