Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
version 1.02 Advanced Malcolm E. Hays U N I V E R S I T Y O F M I S S O U R I – R O L L A Copyright © 1999 – 2003 Malcolm E. Hays and the University of Missouri – Rolla. All rights reserved. No part of this work may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, scanning, or by any information storage or retrieval system without the prior written consent of the University of Missouri – Rolla unless such copying is permitted by federal copyright law. Preface The University of Missouri – Rolla's (UMR's) BrainTrax Algebra system is an online learning tool aimed at helping struggling students comprehend algebra. You can find it at: http://braintrax.umr.edu Click on "Algebra" at the top of the page to enter the online BrainTrax Algebra Brain. The Algebra Brain is a powerful tool designed to associate different concepts with each other in a visual web of knowledge. We have enhanced the visual web with substantial content. It is not a textbook. It is a way of learning mathematics. What you hold in your hands is the entire body of knowledge contained in the Algebra Brain. Over 1000 pages of pure algebraic goodness, distilled into printer-friendly format. You might be wondering, "If it is available online, why print it out?" First and foremost, in an effort to be more accessible to teachers and administrators, we have created this document so that everyone has a chance to see what the Algebra Brain has to offer. Oftentimes a computer terminal is not handy, or the right software has not been installed, or for some reason the hardware just does not work. In that case, we have BrainTrax Algebra in hand to provide instructors and administrators a glimpse of the many features offered in the Algebra Brain. As mentioned earlier, the online Algebra Brain contains over 1000 pages of algebra. This is very difficult to comprehend in a website since you only see a small portion of the site at any given time. This monstrously thick volume hammers home the idea of 1000 pages of algebra, each page better than the last. Unfortunately, BrainTrax Algebra lacks one important component: it is not interactive like a website. You cannot point and click your way through this book. Instructors can use BrainTrax Algebra as a reference to find information that might be of interest to their students (or themselves—there are a lot of interesting facts in BrainTrax Algebra not found in any algebra textbook). Having reviewed a given concept, the instructor can then point students to it in the Algebra Brain. Each concept in BrainTrax Algebra has a "thought-path" associated with it to guide users to that concept in the Algebra Brain (concepts in the Algebra Brain are called "thoughts"). We would like to stress that fact that BrainTrax Algebra is not a textbook! It does not contain any problem sets or exercises, it does not contain information on how to use a graphing calculator (found in many current textbooks), and it does not operate in a linear fashion like a typical algebra textbook. BrainTrax Algebra does have lengthy discussions on most topics covered in algebra. It has hundreds, if not thousands, of examples each with a detailed explanation of the solution. It has three levels of explanation for most concepts, suitable for the beginning algebra student through the more advanced students. In addition, it tries to connect algebra with the real world by illustrating how history has played an important role in the development of mathematics. Although we have tried to duplicate the many features of the Algebra Brain in this document, some features are only available online. These include the audio commentary for the Basic-level instruction, the evershifting matrix of the Brain itself, and the instantaneous transfer between levels of instruction. The BrainTrax website by itself has many features available to students and instructors alike. For students, we have sample exams, which are actual tests administered at UMR over the years. Each test also has a detailed solution provided. Students can work the tests, and then see how they did. Instructors can access a large number of documents, including all of the information here in BrainTrax Algebra, all in printerfriendly format. We feel that we have created a highly usable resource for instructors and students. Each concept is covered in-depth in language the student can understand, while training the student to think rigorously about mathematics. It is never enough to simply fill-in-the-blanks of a formula; we encourage the student to understand why he or she obtained the given answer, a critical component in problem solving. The BrainTrax Development Team has worked long and hard under the auspices of the University of Missouri – Rolla to provide superior mathematics instruction for the students of UMR. However, we feel it is in the best interests of UMR in particular and the rest of the state of Missouri in general to provide Preface – iii superior mathematical instruction to all students preparing to go on to college. If you would like more information about BrainTrax Algebra, the BrainTrax Development Team, or the Algebra Brain, please do not hesitate to contact us: Attn: BrainTrax Development UMR Computer and Information Services 104 Computer Science Building 1870 Miner Circle Rolla, MO 65409–1110 Phone: (573) 341–4841 Fax: (573) 341–4216 Email: [email protected] iv – Preface BrainTrax Algebra – Advanced Contents Features ix Multivariable Linear Systems 87 Unit 1 – Fundamentals of Algebra 1 Systems of Inequalities 93 Introduction to Fundamentals of Algebra 3 Unit 3 – Functions 97 Algebraic Expressions 5 Introduction to Functions 99 Basic Rules of Algebra 7 Function Terminology 103 Properties of Equality 9 Introduction to Polynomial Functions 105 Properties of Exponents 11 The Fundamental Theorem of Algebra 107 Properties of Fractions 15 Polynomial Operations 109 Properties of Inequalities 17 Adding / Subtracting Polynomials 111 Properties of Zero 19 Multiplying Polynomials 113 Cartesian Plane 21 Special Product Patterns 115 Radicals 27 Special Factoring Patterns 117 Properties of Radicals 29 Polynomial Division 121 Real Numbers 31 Synthetic Division 123 Absolute Value 33 Factor Theorem 125 Ordering Real Numbers 35 Remainder Theorem 127 Unit 2 – Linear Equations 39 Real Zeros 129 Introduction to Linear Equations 41 Descartes's Rule of Signs 133 Lines and Slope 45 Rational Zero Test 137 Parallel and Perpendicular Lines 51 Complex Numbers 143 Slope-Intercept Form of a Line 53 Rational Functions 149 Point-Slope Form of a Line 55 Asymptotes 151 Equations of Lines 59 Sketching Rational Functions 155 Linear Inequalities 61 Partial Fractions 161 Absolute Value Inequalities 63 Distinct Linear Factors 165 Polynomial Inequalities 65 Distinct Quadratic Factors 169 Rational Inequalities 69 Repeated Linear Factors 173 Linear Systems 71 Repeated Quadratic Factors 177 Substitution Method 73 Mixed Factors 181 Graphical Approach 77 Transcendental Functions 185 Graphical Interpretation of Solutions 79 Exponential Functions 187 Method of Elimination 83 Logarithmic Functions 191 Contents – v Properties of Logarithms 195 Unit 5 – Matrices and Determinants 297 Natural Base e 199 Matrices 299 Exponential / Logarithmic Equations 203 Matrix Operations 301 Exponential and Logarithmic Models 207 Identity Matrix 303 Exponential Growth and Decay 209 Matrix Addition 305 Gaussian Model 213 Scalar Multiplication 307 Logarithmic Model 215 Matrix Multiplication 309 Logistics Growth 219 Properties of Matrix Operations 313 Translations and Combinations 223 Linear Systems in a Matrix 317 Arithmetic Combinations 225 Elementary Row Operations 319 Composition of Functions 227 Gaussian Elimination 323 Reflections in Coordinate Axes 229 Gauss-Jordan Elimination 327 Vertical and Horizontal Shifts 231 Inverse of a Square Matrix 329 Inverse Functions 235 Finding Matrix Inverse 333 Finding the Inverse 239 Inverse of 2 x 2 Matrix 337 Horizontal Line Test 243 Determinants 341 Unit 4 – Quadratics 245 Properties of Determinants 347 Quadratics 247 Triangular Matrix 351 Quadratic Equations 249 Applications of Determinants 353 Factoring 253 Area of a Triangle 355 Extracting Square Roots 255 Cramer's Rule 359 Completing the Square 257 Test for Collinear Points 365 Quadratic Formula 259 Unit 6 – Sequences and Probability 369 Conic Sections 261 Sequences and Probability 371 Circles 263 Factorials 373 Parabolas 267 Counting Principle 375 Ellipses 271 Combinations 377 Hyperbolas 275 Permutations 379 Translations of Conics 281 Binomial Theorem 383 Circle Translation 283 Pascal's Triangle 387 Parabola Translation 285 Probability 389 Ellipse Translation 289 Independent Events 393 Hyperbola Translation 293 Union of Two Events 395 Complementary Events 397 vi – Contents Sequences 399 Arithmetic Sequence 403 Geometric Sequence 407 Summation Notation 413 Appendix 417 Appendix A – Algebra Alphabet 419 Glossary 421 Works Consulted 431 Index Contents – vii viii – Contents General Features: Three levels of instruction – In an effort to reach as wide an audience as possible, we include three levels of instruction: Basic, Intermediate and Advanced. Sometimes a student needs a little extra explanation, therefore we provide the student with the following:. Basic: Suitable for students of Algebra I and up, Basic level instruction is just that, a rudimentary explanation of the concept at its most fundamental level. We still assume that the user understands basic arithmetic, however, along with some fundamental algebraic concepts such as variables and expressions. Basic level instruction also serves as remediation for higher-level students who require a little more explanation on a given concept. Some concepts are not even covered at the Basic level in a normal algebra course, but whenever possible, we still try to include a brief, non-technical description of the concept as well as showing the student why the given concept is important in later mathematics courses. Intermediate: This level is most suitable for students of Algebra II and up. Advanced users who are struggling to understand the Advanced level of a concept may remediate to this level for a simpler, more detailed explanation of the given concept. Basic users who wish to increase their knowledge of a given concept may decide to learn at this level too, thus becoming even more proficient in algebra. Almost all concepts are covered at the Intermediate level; often with much more detail than at the Basic or even Advanced levels. Advanced: The highest level of instruction in the Algebra Brain. The Advanced users are typically freshman at UMR who require some additional instruction outside the classroom in order to fully understand a given concept. Other users include students in high school who are taking a collegiate-level algebra course. Users who are struggling with mathematical comprehension can remediate themselves down to the Intermediate level, if they so choose. Every concept in the Algebra Brain is covered at the Advanced level. Some concepts—in particular formula sheets— are covered exclusively at the Advanced level simply because no additional explanation is required for Basic and Intermediate users. Unit Organization – The information is structured into six units, roughly analogous to the algebra syllabus used on the UMR campus. Unit 1: Fundamentals of Algebra – This unit covers pre-algebra concepts such as real numbers, absolute value, the Cartesian plane, algebraic expressions, and properties of arithmetic. Unit 2: Linear Equations – Linear equations is the study of lines and slope. We introduce those concepts, along with linear inequalities and linear systems. Unit 3: Functions – Central to the study of algebra is the concept of a function. We cover numerous types of functions, including rational functions, polynomial functions, transcendental functions, and inverse functions. Unit 4: Quadratics – Although quadratics should technically fall under the category of functions, we assign the study of quadratic its own unit. In this unit we describe the various ways of solving a general quadratic equation, concluding with the Quadratic Formula. We also introduce the conic sections, a subset of quadratic equations. The conic sections include the circle, ellipse, parabola, and hyperbola. Unit 5: Matrices and Determinants – Matrices are often used in algebra to represent systems of linear equations. We demonstrate how to use a matrix to solve a system of linear equations. A determinant is a real number associated with a square matrix (and only a square matrix!). Using determinants, we can find the area of a triangle, solve a system of linear equations (via Cramer's Rule), and determine if three points are all on the same line. Unit 6: Sequences and Probability – Sequences are special functions in algebra. We cover arithmetic and geometric sequences, sums of sequences, and summation notation. Probability is the determination of the likelihood of an event given some constraints. We discuss the Binomial Theorem, permutations, combination, factorials, complementary events and independent events. Each unit is further subdivided into numerous concepts arranged in a sequential manner, although they do not appear sequentially in the Algebra Brain. Features – ix Concept Features: Each concept in BrainTrax Algebra includes the following features. Color coding – We have color coded BrainTrax Algebra to match the color codes we use in the online Algebra Brain. Each concept has a title and a level assigned to it at the top of the page. For instance, "Descartes's Rule of Signs" is green on the Intermediate level pages, and "Intermediate" (also in green) appears next to the title. The three colors used in the brain are: Basic – purple Intermediate – green Advanced – blue Intermediate Goal / Terms / MathTrax – On each page containing detailed information about a given concept, we have reproduced the information found in the online Algebra Brain in a colored box at the top of the concept. Goal – the learning objective for the concept Terms – a list of words and phrases, along with the definitions, the student is expected to learn in the given concept MathTrax – to add some historical perspective to the study of algebra, we include an interesting factoid about the development of mathematics over the course of several millennia To find the number of real (and imaginary) roots of a polynomial function. Descartes's Rule of Signs – a rule determining upper bounds to the number of positive zeros and to the number of negative zeros of a polynomial function. positive real zeros – positive real x-values for which the function equals zero. negative real zeros – negative real x-values for which the function equals zero. variation in sign – two consecutive coefficients in a polynomial function having opposite signs. In 1649, René Descartes (1590-1650) traveled to Sweden to tutor 20-year old Queen Christina. She preferred to start her days at 5 am. The combination of extremely cold winters and early rising resulted in Descartes contracting a fatal case of pneumonia. He died early spring of 1650. In 1666, his remains were exhumed and returned to France. The French ambassador received permission to remove Descartes's right forefinger from the corpse. Descartes's skull is said to have been removed by a guard and sold several times. Supposedly, Descartes's skull is currently on display at the Museé de l'Homme in the Palais de Chaillot in France. Thought Path – After the Goal / Terms/ MathTrax box, we include a thought path. This is the route the student would travel to find the page in the Algebra Brain. Following this path in the Algebra Brain will reinforce the relationships between algebraic concepts. In addition, the thoughts in the Algebra Brain contain more features than are found here, simply because the online experience offers much more in terms x – Features of interactivity. Basic users, for instance, can have the text read to them, along with descriptions of the graphics to help students understand what they see. Functions → Polynomial Functions → Real Zeros → Descartes's Rule of Signs To get to "Descartes's Rule of Signs" in the Algebra Brain, the user clicks on "Functions" followed by "Polynomial Functions" followed by "Real Zeros" and finally on "Descartes's Rule of Signs". Conceptual information – Following the thought path, the user enters the realm of conceptual information. Here we explain the concepts in detail, often with fully-worked proofs. We also provide the relevant formulas the student needs to understand the material. Much of the conceptual information is given in plain (often very plain) English with little mathematical terminology. Often the most obscure mathematical formulas are better understood if redefined in terms the user can understand. We also include historical information to help the student realize that mathematics is continually being refined over the centuries. However, we can use Descartes's Rule of Signs to find not only which roots are real and which roots are imaginary, but also the signs of those roots. Many problems in the real world only concern themselves with positive solutions. A good example is any projectile example involving time. Time is always a positive quantity (time moves forward, not backward). Thus, if we have a polynomial equation involving time that gives us negative roots, we can automatically disregard them because they do not make sense in the context of a time-based problem. We must note the qualifier that the roots given by Descartes's Rule of Signs are all real. According to the Fundamental Theorem of Algebra, the degree of the polynomial tells us the total number of roots. Therefore, if Descartes's Rule of Signs yields us three real roots (positive, negative, or both) for a 5th degree polynomial, then we must conclude that the remaining 2 roots are imaginary. Formulas / Definitions – Many ideas in algebra can be summed up in a simple formula or phrase. For instance, the Pythagorean Theorem is written as, "Given a right triangle with sides of lengths a, b, and c, where the side of length c is opposite the right angle, the square of c is the sum of the squares of a and b, or in other words, a2 + b2 = c2." Students are expected to not only memorize the formula, but also understand why it works, although sometimes that is well beyond the scope of algebra. Descartes's Rule of Signs Let f(x) = anxn + an – 1xn – 1 + . . . + a2x2 + a1x + a0 be a polynomial with real coefficients and a0 ≠ 0. 1. The number of positive real zeros of f is either equal to the number of variations in signs of f(x) or less than that number by an even integer. This means that for every change of sign (from +an to –an – 1 to + an – 2, etc), you will have a zero for each change in sign or two. If you do not have a zero for each change in sign, then you will have two less than the maximum number of zeros. If you do not have that many, then you will have four less than the maximum, etc… These are all for positive real zeros. 2. The number of negative real zeros of f is either equal to number of variations of sign of f(–x) or less than that number by an even integer. The same rule that applies to the positive real zeros applies to the negative zeros as well. Features – xi Notes – Many formulas and definitions are followed by notes. These are additional pieces of information the student needs to keep in mind when applying the formula or definition. Notes: Variation in sign means that two consecutive coefficients have opposite signs. Where there is only one variation in sign, Descartes’s Rule of Signs guarantees the existence of exactly one positive (or negative) real root. Descartes's Rule of Signs does not tell us what the zeros are. It simply states how many zeros we will find, as well as whether they will be positive or negative. We need to use other methods to actually find the zeros. Examples – Most pages contain at least one example problem after a formula or definition is given. This is so the student can see for him or herself how the formula or definition is applied. Each and every example problem is accompanied by a step-by-step explanation of how the problem is solved. Many example problems are real-world situations, sometimes with disastrous consequences if the problem is solved incorrectly! Besides simply showing the student how to work a problem, we also strive to teach the student how to answer the question that is asked. Oftentimes, the mathematics is only a small part of the problem. If we obtain two answers, one positive and one negative, which one do we use? Use Descartes's Rule of Signs to determine how many real solutions the following function has as well as if they are positive or negative. x3 – 2x2 – 5 = 0 given equation solution Our given equation has coefficients that are positive, negative, and negative again. This translates to only 1 variation in sign. Descartes's Rule of Signs tells us that the number of positive real solutions is equal to the number of changes in sign or less than that by an even number. Thus, we can have either 1 positive real root or 0 positive real roots (we can't have less than 0 roots. That just wouldn't make sense). In order to determine our negative real roots, we have to replace "x" with "–x" in the function and simplify, noting any variations in sign again: (–x)3 – 2(–x)2 – 5 = –x3 – 2x – 5 replace "x" with "–x" All the coefficients are negative. Thus, we conclude from Descartes's Rule of Signs that there are no negative real roots, since there are no variations in sign. Thus we have at most, 1 positive real root. answer Graphs / Pictures – Each graph and diagram in the Algebra Brain has been hand-crafted for maximum effect. Pictures really can be worth a thousand words if designed efficiently. We have strived to provide clear, concise images that relay as much information about a given situation as possible. Most diagrams include color to enhance the effect. For instance, indicates how we determine the number of changes of sign in a polynomial expression so that we can then apply Descartes's Rule of Signs. xii – Features Connections – Finally, we try to enhance the learning experience by providing connections between concepts. Many concepts have little or no meaning to the student now, but may become more and more important if the student wishes to learn higher mathematics. For instance, exponential functions may not seem to have any importance to a student learning algebra in 10th grade, but that same student will be very glad he or she learned about exponential functions in 10th grade when he or she studies calculus. Since the Algebra Brain was originally designed for algebra students on the UMR campus, we feel it is only fitting that we inform students that the concepts they learn today will be of great benefit to them when they study calculus later in their academic careers. Property 4 is somewhat misleading. When you study calculus, you will find that it is appropriate to say that (a / 0) "approaches" infinity. To see why, consider dividing a by a really small decimal number that is close to zero, such as 0.00001. Then you have: a 0.00001 a = −5 rewrite denominator in exponential form 10 = a(105) property of exponents = a(100,000) rewrite in standard notation Dividing a by a really small decimal number is the same as multiplying a by a really large number. Thus, as the denominator gets closer and closer to zero, the evaluation of the expression becomes larger and larger, eventually "reaching" infinity when the denominator equals zero. Features found only in the Algebra Brain: All of the features described above can be found in the online Algebra Brain—located at braintrax.umr.edu—but the following features are found only in the Algebra Brain, since they are unique to the web-based learning environment and cannot be represented in a paper-based model. The Brain – The online Algebra Brain contains a navigation feature known simply as "The Brain". This navigation tool allows students to quickly find the algebra concept they seek. Students need no more than a dozen clicks to go from any point in the Algebra Brain to any other point in the Algebra Brain. Interactive Example and Testing System (IETS) – Authorized students can access our database of interactive example problems for each concept. Instructors who contact us can request userids and passwords for their students. Students are able to test their knowledge of the various concepts and applications, obtaining instantaneous feedback, as well as detailed explanations when they fail to solve a problem correctly. Transfer between levels – Each concept in the Algebra Brain contains a menu with three buttons indicating "Basic", "Intermediate", or "Advanced" levels of knowledge. Students can click on one of the three buttons to be instantly transferred to the level of instruction they selected. Furthermore, from the BrainTrax homepage, they can choose a default level of instruction. Audio – Basic level pages in the Algebra Brain contain an audio icon that—when clicked—reads the information on the page to the student. Graphs and diagrams are explained to the student so they understand what the graph or diagram represents. "i" and "e" buttons – The "i" button on a page indicates that if a student clicks on the button, he or she will be transferred to another page on the same concept with more information about the concept. The "e" button is placed next to an example on a page and will transfer the student to another example of the same concept, though at a different instruction level. Mouseover definitions – Bolded terms on a web page in the Algebra Brain can be moused over to reveal the definition of the word or phrase. This provides the student with instantaneous Features – xiii information about the term he or she is unfamiliar with instead of having to refer to a glossary. In addition, all terms are located at the top of a given web page for easy access. Access 24/7 – The Algebra Brain is available to all Missouri K–12 students 24-hours a day, 7 days a week. To access the online Algebra Brain, simply enter "braintrax.umr.edu" (NO "www") into the address window of Internet Explorer 5.0+. Due to complex technical issues, the Algebra Brain can only be viewed on a PC-compatible machine running Internet Explorer 5.0 or higher. Once on our homepage, click on "Algebra" at the top of the page to enter the Algebra Brain. xiv – Features Unit 1 Fundamentals of Algebra Fundamentals of Algebra – 1 2 – Fundamentals of Algebra Before we can really get into algebra, we first need to guarantee that we actually understand the very foundations of algebra. Algebra is really only the manipulation of numbers, variable, or expressions using the basic operations of addition, subtraction, multiplication, and division. Of course, this is a very oversimplified explanation. We will see that this can become quite complicated at times. Algebraic Expressions An algebraic expression is a number, variable or combination of numbers and variables joined together by addition, subtraction, multiplication or division symbols. Do not mistake an expression for an equality. They are quite different. An equality is composed of two or more expressions separated by an equals sign "=". An expression does not contain an equals sign. Basic Rules of Algebra We also need to understand some of the most common properties that are shared by real numbers and variables. These include the Commutative Properties of Addition and Multiplication as well as the infamous Distributive Property. It is necessary to memorize these properties as they will make life much easier during later portions of the course. Radicals Radicals, or roots of numbers, are very common in algebra. They crop up in a wide variety of applications and sometimes the solution to a particular problem can only be expressed by a radical. The followers of the Greek mathematician Pythagoras originally discovered radicals, but they did not fully understand their implication, so they ignored them. Real Numbers Finally, we have real numbers. A real number is any number that can be expressed in decimal form. This includes all rational and irrational numbers. For the most part, the only numbers we will be dealing with in algebra are real numbers. It is still necessary to state when we are dealing with real numbers to avoid any ambiguity. There are other numbers called complex, or imaginary, numbers that we will deal with at a certain point in the Brain. Fundamentals of Algebra – 3 4 – Fundamentals of Algebra Advanced To learn the basic terms associated with algebraic expressions. variable – a symbol which can represent any one of a set of numbers or other objects (including other algebraic expressions). algebraic expression – a combining of numbers, variables and operations in a way that stands for a number. constant – a numerical symbol that never changes its values. term – a constant number of a variable or the product of numbers and variables. constant term – term in an expression that does not contain any variables. variable term – term in an expression that contains variables. coefficient – the numerical factor of a monomial expression. evaluate – to find the value of an expression for a given value. The Arabic word al-jabr, from which we get the word "algebra", is translated into English as "restoration" or "completion". It refers to the operation of moving known or unknown quantities of the same power to one side or the other of an equation so that neither side has a negative quantity. For example, x = 7 – 2x would be "restored" or "completed" to 3x = 7. Fundamentals of Algebra → Algebraic Expressions One of the most basic (and famous) characteristics of algebra is the use of letters to represent numbers. A letter is a variable because it can represent a variety of numbers. A combination of letters and numbers is called an algebraic expression. Here are a few examples: 3x + 2 4x2 2−x 7 5x Definition of an Algebraic Expression A collection of letters (variables) and real numbers (constants) combined using the operations of addition, subtraction, multiplication, division and exponentiation is called an algebraic expression. Note: An algebraic expression does not contain an equals sign (=). In algebra we use the equals sign (=) to equate two expressions together. An algebraic term is a part of an algebraic expression that is separated from the other parts by addition or subtraction. For example, in the expression: 3x2 + 4x – 1 we have three terms: 3x2 and 4x are variable terms and 1 is a constant term. The numerical factor in front of the variable is the coefficient of the variable term. The coefficient of 3x2 is 3. If there is no number in front of the variable, then it has an implied coefficient of 1. Thus, the coefficient of x3 is simply 1. Fundamentals of Algebra Algebraic Expressions – 5 Oftentimes, we are asked to evaluate a particular algebraic expression. This means substituting in numerical values into each of the variables and simplifying the resulting arithmetic expression. For example, 2x – 4 Value of Variable x=3 3x2 – x + 5 x = –1 Expression 6 – Algebraic Expressions Substitute Value of Expression 2(3) – 4 6–4=2 3(–1)2 – (–1) + 5 3+1+5=9 Fundamentals of Algebra Advanced To review the most elementary properties obeyed by real numbers, variables, and expressions. additive inverse – the opposite of a number. If the number a is negative, then the additive inverse of a is a positive number. multiplicative inverse – the reciprocal of a number. numerator – signifies the number of parts of the denominator that is taken. In a fraction, the numerator lies above the line representing division. denominator – lies below the line of division in a fraction. This is the divisor. If the numerator equals 1, then the denominator determines in how many parts the unit is to be divided. The French mathematician Nicolas Chuquet (ca. 1500) wrote Triparty en la science des nombres, in which he uses a form of exponent notation. Our expressions 3x2 and 4x3 were written as .3.2 and .4.3, respectively. Zero and negative exponents were also represented using a similar notation, so x0 would be written as .1.0 and 5x-2 as .5.2.m. Chuquet even used this notation to perform calculations such as .72.1 divided by .8.3 is .9.2.m. In modern terms, this is equivalent to 72x ÷ 8x3 = 9x-2. Fundamentals of Algebra → Basic Rules of Algebra The four fundamental operations of algebra are: • Addition + • Subtraction – • Multiplication x () · • Division ÷ / Of these four operations, the primary ones are addition and multiplication. Subtraction and division are inverses of addition and multiplication, respectively. We use subtraction to "undo" addition and division to "undo" multiplication. Consider that we can write subtraction as: Subtraction a – b = a + (–b) and division as: Division If b ≠ 0, then a÷b = a ⎛⎜ 1⎞ ⎟ ⎝b⎠ = a b Using these definitions, –b is the additive inverse (or opposite) of b. Similarly, 1 / b is the multiplicative inverse (or reciprocal) of b. When we write division in its fractional form, a is the numerator and b is the denominator. Note that we cannot let b equal 0 because then we would have division by zero, which is undefined. Fundamentals of Algebra Basic Rules of Algebra – 7 The following list of rules of algebra is true not only for real numbers, but for algebraic expressions and variables. Basic Rules of Algebra Let a, b, and c be real numbers, variables, or algebraic expressions. Property Commutative Property of Addition a+b=b+a Example 3x + 2x2 = 2x2 + 3x Commutative Property of Multiplication ab = ba (1 – x)(2x) = (2x)(1 – x) Associative Property of Addition (a + b) + c = a + (b + c) (x + 5) + 2 = x + (5 + 2) Associative Property of Multiplication (ab)c = a(bc) (2x • 4x2)(7) = (2x)(4x2 · 7) Distributive Properties: a(b + c) = ab + ac 3x(5 + x) = 3x · 5 + 3x · x (a + b)c = ac + bc (y + 2)y = y · y + 2 · y Additive Identity Property a+0=a 2y2 + 0 = 2y2 Multiplicative Identity Property a·1=a 3x · 1 = 3x Additive Inverse Property a + (–a) = 0 7x3 + (–7x3) = 0 Multiplicative Inverse Property a≠0 In addition to the above properties of operators, we also have the following properties of negation: Properties of Negation Let a and b be real numbers, variables, or algebraic expressions: Note: Property 1.) (–1)a = –a Example (–1)6 = –6 2.) –(–a) = a –(–5) = 5 3.) (–a)b = –(ab) = a(–b) (–4)2 = –(4 · 2) = 4(–2) 4.) (–a)(–b) = ab (–2)(–x) = 2x 5.) –(a + b) = (–a) + (–b) –(3 + x) = (–3) + (–x) = –3 – x Be sure that you understand the difference between the opposite of a number and a negative number. If a is already negative, then its opposite, –a, is positive. For example, if a = –3, then –a = –(–3) = 3. 8 – Basic Rules of Algebra Fundamentals of Algebra Advanced To review the basic properties of equality used in algebra. No definitions on this page. Robert Recorde (1510-1558) was the first to introduce the equals sign, "=", because nothing could be more equal than two parallel straight lines. The symbol is found in his book The Whetstone of Witte. Fundamentals of Algebra → Basic Rules of Algebra → Properties of Equality The following properties are used extensively throughout algebra to help us solve problems. In general, for both sides of an equation to be equal, the sum total of both sides must be equivalent. Furthermore, whenever we do something to one side of an equation (add, subtract, multiply, or divide by a quantity), then we have to perform the exact same procedure on the other side of the equation in order to maintain balance. Properties of Equality Let a, b, and c be real numbers, variables, or algebraic expressions. 1.) If a = b, then a + c = b + c add c to both sides 2.) If a = b, then ac = bc multiply both sides by c 3.) If a + c = b + c, then a = b subtract c from both sides 4.) If ac = bc and c ≠ 0, then a = b divide both sides by c Fundamentals of Algebra Properties of Equality – 9 10 – Properties of Equality Fundamentals of Algebra Advanced To review properties of exponents and their use in scientific notation. exponent – in a power, the number of times the base occurs as a factor. In the expression 23, 3 is the exponent and tells us that 2 occurs as a factor three times. base – the repeated factor in a power. In a2, a is the base. power – a product of equal factors. The repeated factor is the base. A positive exponent tells the number of times the base occurs as a factor. Nicolas Chuquet (c.1455-c.1500) anticipated Napier's logarithms by almost 100 years. In his analysis of the powers of 2, he makes it very clear that if x = 32 = 25, then, in hindsight, 5 = log2 x. Chuquet unfortunately failed to realize the connection so we had to wait a century until Napier could complete Chuquet's thought. Fundamentals of Algebra → Basic Rules of Algebra → Properties of Exponents An exponent is nothing more than a shorthand form of multiplication. repeated multiplication x·x·x·x exponential form x4 (2)(2)(2)(2)(2) (2)5 (3x)(3x)(3x) (3x)3 In more general terms, if a is a real number and n is a positive integer than an = a·a·a·a·...·a n factors where n is the exponent and a is the base. The expression an is read as "a raised to the nth power." Exponents obey a number of useful properties that we can exploit when working with algebraic equations. They also have a number of applications when working with exponential and logarithmic equations. Fundamentals of Algebra Properties of Exponents – 11 Properties of Exponents Let a and b be real numbers, variables, or algebraic expressions. Let m and n be integers (positive or negative, it makes no difference). All denominators and bases are nonzero. 1.) Property a ma n = a m + n 2.) 3.) = a–n am – n = x5 – 2 = t–3 = = = a0 = 1, a ≠ 0 (x + 5z)0 = 1 5.) (ab)m = ambm (5y)2 = 52y2 = 25y2 6.) (am)n = amn 8.) (y3)–4 y(3)(–4) = = = | a2 | = | a |2 = a2 x3 = 4.) 7.) Note: Example 23 · 24 = 23 + 4 = 27 = 128 = y–12 = = |(–4)|2 = | –4 |2 = (4)2 = 16 It is vitally important to understand the difference between (–2)4 and –24. In (–2)4, the parentheses indicate the exponent applies to everything inside the parentheses, not just the 2. However, in –24, the exponent only applies to the 2. Hence, (–2)4 = 16, whereas –24 = –16. Scientific Notation Oftentimes it is extremely inefficient and time consuming to write down very large numbers. For instance, one famous number is Avogadro's number, which is: 602,200,000,000,000,000,000,000 Avogadro's number This would become extremely tedious to write down every time we needed to use it, so scientists and mathematicians rely on a short hand method of writing really large and really small numbers. This shortcut is called scientific notation. Using scientific notation, Avogadro's number is reduced to: 602,200,000,000,000,000,000,000 = 6.022 x 1023 Avogadro's number in scientific notation Now, which number is more compact? I think the choice is obvious. We can also use scientific notation for really small numbers. For instance, the mass (in grams) of one proton is approximately: 1. 67 x 10–24 = 0.000000000000000000000000167 mass of proton in grams Again, it would be really quite tedious to write out the mass of a proton in standard decimal form. In general, scientific notation takes the form: ±c x 10n 1 ≤ c < 10, n is an integer scientific notation A positive exponent indicates that the absolute value of the number is large (sometimes very, very large). A negative exponent indicates that the absolute value of the number is less than 1. 12 – Properties of Exponents Fundamentals of Algebra Some more useful large and small numbers, both in normal and scientific notation: speed of light gravitational constant Planck constant mass of the sun distance to the nearest star distance to the Andromeda galaxy scientific notation 3 x 108 m/s –11 6.67 x 10 Nm2/kg2 6.63 x 10–34 Js 1.99 x 1030 kg 4.04 x 1016 m 2.1 x 1022 m normal decimal notation 300,000,000 m/s 0.0000000000667 Nm2/kg2 0.000000000000000000000000000000000663 Js 1,990,000,000,000,000,000,000,000,000,000 kg 40,400,000,000,000,000 m 21,000,000,000,000,000,000,000 m After comparing the two notations, which would you rather use? Fundamentals of Algebra Properties of Exponents – 13 14 – Properties of Fractions Fundamentals of Algebra Advanced To review the basic properties of fractions. No definitions on this page. Simon Stevin (1548-1620), a Dutch mathematician, physicist, and engineer, is credited with explaining the concept of decimal fractions. At the time, it was unfamiliar to most mathematicians. He didn't actually view decimal fractions as fractions and wrote the numerators without the denominators. The number π was written as 301 4 1 6 . . . The circled digit of each pair after the zero (which stands for the decimal point) represents the power of 10 in the denominator: 1 = 1/10, 4 = 4 / 100 = 4 / 102. Fundamentals of Algebra → Basic Rules of Algebra → Properties of Fractions Fractions serve a number of uses in algebra. For one thing, they indicate a small portion of the whole, such as 1 /2. They can also indicate a ratio such as 3:2 = 3/2. Finally, they can be used to represent division of two numbers as in 7 ÷ 5 = 7/5. The following properties apply to all fractions, everywhere, regardless of the purpose they are trying to convey: Fundamentals of Algebra Properties of Fractions – 15 Properties of Fractions Let a, b, c, and d be real numbers, variables, or algebraic expressions such that b ≠ 0 and d ≠ 0. 1.) Equivalent Fractions = 2.) if and only if ad = bc Rules of Signs = 3.) = = Generate Equivalent Fractions = 4.) and c≠0 Add or Subtract with Like Denominators = 5.) Add or Subtract with Unlike Denominators = 6.) Multiply Fractions = 7.) Divide Fractions = 16 – Properties of Fractions = c≠0 Fundamentals of Algebra Advanced To review the properties of inequalities. No definitions on this page. Charles Babbage (1792-1871) is considered to be the father of modern computers. He is most famous for his design of the Difference Engine, which would have greatly speeded arithmetic calculations. Unfortunately, Babbage was never able to obtain the funding to build a full-scale production model. He attempted to get funding for a smaller-scale, more efficient version and was again turned down. Finally, nearly a century and a half later, the Science Museum of London constructed Babbage's second Difference Engine. It weighed three tons and worked flawlessly. Fundamentals of Algebra → Basic Rules of Algebra → Properties of Inequality When we solve linear inequalities in one variable, we use procedures remarkably similar to those used to solve liner equations. That is, we try to isolate the variable on one side of the inequality symbol (<, >, ≤, ≥). However, there are two very important considerations we have to keep in mind when we work with inequalities. When we multiply or divide both sides of an inequality by a negative number, we reverse the direction of the inequality. Here is an example: –2 < 4 original inequality (–3)(–2) > 4(–3) multiply both sides by –3 reverse inequality symbol 6 > –12 simplify both sides As we can see, when we multiplied both sides of the inequality by –3, the left hand side suddenly became a positive number and the right hand side suddenly became a negative number. Since positive numbers are always greater than negative numbers, we had to reverse the inequality symbol. If two inequalities have the same solution set, then they are equivalent. For instance, x+3 < 6 and x < 3 are equivalent because values of x less than 3 will satisfy both inequalities. Here is a list of the operations that we can use to create equivalent inequalities: Fundamentals of Algebra Properties of Inequalities – 17 Properties of Inequalities Let a, b, c, and d be real numbers. 1.) Transitive Property a<b 2.) Note: implies a<c and c<d implies a+c<b+d Addition of a Constant a<b 4.) b<c Addition of Inequalities a<b 3.) and implies a+c<b+c Multiplication by a Constant for c > 0, a < b implies ac < bc for c < 0, a < b implies ac > bc Each of the properties given above is true if we replace the symbol "<" with "≤" and if we replace the symbol ">" with "≥". For instance, the multiplication property above could just as easily be written as: for c > 0, a < b implies ac ≤ bc for c < 0, a < b implies ac ≥ bc 18 – Properties of Inequalities Fundamentals of Algebra Advanced To examine why the number zero plays such an important part in mathematics and to learn about some of its basic properties. inclusive or – in the statement a or b, the answer may be a, it may be b, and the answer could also be both a and b. While we in the United States are used to using a period (.) as a decimal indicator, other countries, such as those in Continental Europe, use a comma (,) instead. Fundamentals of Algebra → Basic Rules of Algebra → Properties of Zero Zero is by far one of the strangest numbers in all of existence. It is both nothing and everything. Since it is so unique, it should come as no surprise that it exhibits a number of unique characteristics when combined with other real numbers in algebraic expressions. Properties of Zero Let a and b be real numbers, variables, or algebraic expressions. 1.) a + 0 = a and a – 0 = a 2.) a·0=0 3.) Note: a≠0 4.) (a / 0) is undefined 5.) Zero Factor Property: If ab = 0, then a = 0 or b = 0 The "or" in the Zero Factor Property includes the possibility that either or both factors may be zero. This is an inclusive or and it is the way "or" is generally used in mathematics. Property 4 is somewhat misleading. When you study calculus, you will find that it is appropriate to say that (a / 0) "approaches" infinity. To see why, consider dividing a by a really small decimal number that is close to zero, such as 0.00001. Then you have: a 0.00001 = a −5 10 rewrite denominator in exponential form = a(105) property of exponents = a(100,000) rewrite in standard notation Dividing a by a really small decimal number close to zero is the same as multiplying a by a really large number. Thus, as the denominator gets closer and closer to zero, the evaluation of the expression becomes larger and larger, eventually "reaching" infinity when the denominator equals zero. Fundamentals of Algebra Properties of Zero – 19 20 – Properties of Zero Fundamentals of Algebra Advanced To plot points in the Cartesian plane. To find the distance between two points in the Cartesian plane. To find the midpoint of a line segment connecting two points in the Cartesian plane. rectangular coordinate system - a means of plotting points using two perpendicular axes, one designated as x and the other designated as y. Each point is assigned an (x, y) coordinate that determines its location relative to a central point in the coordinate system. Cartesian plane - the more common term for the rectangular coordinate system. It is named after Rene Descartes (1596-1650). x-axis - the horizontal real number line in the Cartesian plane. y-axis - the vertical real number line in the Cartesian plane. origin - the point of intersection of the two axes, designated with the ordered pair (0, 0) and labeled as O. quadrant - one of four equal pieces of the Cartesian plane, formed by the perpendicular intersection of the axes. coordinate - the general term for either of the two numbers in the ordered pair (x, y). abscissa - the specific term for the x-coordinate in the ordered pair (x, y). ordinate - the specific term for the y-coordinate in the ordered pair (x, y). Descartes (1596-1650), like many mathematicians of his day, also delved extensively into the realm of natural philosophy (what we call science today). He postulated that there are ten natural laws to the universe, the sun is in the middle of the solar system and the planets create small whirlpools or vortices in the medium of the universe. These vortices affect all other nearby objects. His theories have long since been disproved by Isaac Newton (1642-1727) and others, but he did lay some of the groundwork for others who followed him. Interestingly, the first two of his natural laws are nearly identical to Newton's first two laws of motion. Fundamentals of Algebra → Cartesian Plane We often use a real number line to indicate how two real numbers relate to each other. For instance, the numbers x = 4 and x = –1 can be indicated on the following number line: This is fine if we are only dealing with one variable. But oftentimes we are forced by circumstances to deal with two variables, one of which is dependent on the other. The most familiar two variables are x and y. The physical representation of an ordered pair of x and y variables is a two-dimensional model often called the rectangular coordinate system or the Cartesian plane. The Cartesian Plane is named after René Descartes (1596–1650), who is responsible for a large portion of analytic geometry and is also credited with the existential phrase, "I think, therefore I am" (cogito ergo sum in Latin). Fundamentals of Algebra Cartesian Plane – 21 The Cartesian plane The horizontal real line is called the x-axis. The vertical real line is called the y-axis. The point of intersection of the two axes is called the origin. The two axes divide the plane into four equal parts called quadrants. The first quadrant is always the top right portion of the graph above the x-axis and to the right of the y-axis. The order of the remaining quadrants goes around in a counterclockwise direction. That is, the second quadrant is above the x-axis and to the left of the y-axis; the third quadrant is below the x-axis and to the left of the y-axis; the fourth quadrant is below the x-axis and to the right of the y-axis. Each point in the Cartesian plane can be identified by a unique ordered pair (x, y) of real numbers x and y. These are called the coordinates of the point. The number x represents the directed distance from the y-axis to the point. The number y represents the directed distance from the x-axis to the point. We call the first coordinate of the ordered pair the x-coordinate, or abscissa. The second coordinate of the ordered pair is called the y-coordinate, or ordinate. In practice, we usually just refer to them as x- and y-coordinates, seldom calling them the abscissa and ordinate. Note: Although we use an ordered pair (a, b) to represent an open interval on the real number line, this should not be confused with the ordered pair (a, b) representing a point in the Cartesian plane. The context of the given problem should give us enough clues to determine if we are discussing an open interval or a point in the plane. Two formulas are extremely useful when discussing points in the Cartesian plane. They are the Distance Formula and the Midpoint Formula: The Distance Formula The distance d between the point (x1, y1) and (x2, y2) in the plane is given by d = The Distance Formula is derived from the Pythagorean Theorem. Recall that the Pythagorean Theorem states that for any right triangle, the square of the length of the hypotenuse is equal to the sum of the squares of the 22 – Cartesian Plane Fundamentals of Algebra lengths of the two sides. That is, if the right triangle has hypotenuse of length c and sides of length a and b, then c2 = a2 + b2. Suppose we want to find the distance d between two points (x1, y1) and (x2, y2) in the plane. With these two points, we can construct a right triangle, as shown below: The length of the vertical side is the absolute value of the difference between the y-coordinates: | y2 – y1 |. Similarly, the length of the horizontal side is the absolute value of the difference of the x-coordinates: | x2 – x1 |. When we apply the Pythagorean Theorem, we obtain: d2 = d = | x2 – x1 |2 + | y2 – y1 |2 ( x2 − x1 )2 + ( y2 − y1 Pythagorean Theorem )2 take square roots of both sides We can remove the absolute value operators because we are squaring numbers and that always leads to a positive result. Thus, we can simplify our equation to: d = ( x 2 − x 1) 2 + ( y 2 − y 1) 2 remove absolute value operators Distance Formula We choose the positive square root for d because the distance between two points in the plane is not a directed distance. We have just established the Distance Formula. Given the following ordered pairs (2, 1) and (4, 5) a.) Plot the points represented by these ordered pairs. b.) Find the distance between the two points. solution a.) The first ordered pair can be represented by plotting a point on the Cartesian plane where x = 2 and y = 1. The second ordered pair can be represented by a point at x = 4 and y = 5, as shown below: Fundamentals of Algebra Cartesian Plane – 23 answer b.) The distance between the two points is found by applying the Distance Formula, with (x1, y1) = (2, 1) and (x2, y2) = (4, 5): d d = ( x 2 − x 1) 2 + ( y 2 − y 1) 2 = ( 4 − 2) + ( 5 − 1) = ( 2) + ( 4) = 4 + 16 simplify = 20 simplify under radical operator = 2 5 2 2 2 2 Distance Formula substitute in x2 = 4, x1 = 2, y2 = 5 and y1 = 1 simplify inside parentheses simplify radical answer The other useful formula is the Midpoint Formula and is useful for finding the midpoint of a line segment in the plane. The Midpoint Formula The midpoint of the line segment joining the points (x1, y1) and (x2, y2) is given by Find the midpoint of the line segment connecting the points (2, 1) and (4, 5). Plot this point on the same graph we used in the previous example. solution The Midpoint Formula is pretty straightforwa rd. We simply substitute in (x1, y1) = (2, 1) and (x2, y2) = (4, 5). 24 – Cartesian Plane Fundamentals of Algebra Midpoint = ⎛ x1 + x2 , y1 + y2 ⎞ ⎜ ⎟ 2 ⎝ 2 ⎠ Midpoint Formula = ⎛ 2 + 4, 1 + 5⎞ ⎜ 2 ⎟ 2 ⎠ ⎝ substitute in x1 = 2, x2 = 4, y1 = 1 and y2 = 5 = ⎛ 6, 6⎞ ⎜ 2 2⎟ ⎝ ⎠ simplify numerators = (3, 3) simplify fractions answer This point can be represented as follows: answer Fundamentals of Algebra Cartesian Plane – 25 26 – Cartesian Plane Fundamentals of Algebra Advanced To recognize that we can write radicals in exponential notation and to review normal radical notation. square root – a square root of a number b is the solution of the equation x2 = b. cube root – a cube root of a number c is the solution of the equation x3 = c. principle nth root of a – the nth root that has the same sign as a. radical symbol – an operator symbol that indicates we are taking the nth root of a real number, variable, or algebraic expression. index (radical) – a real number placed above and to the left of a radical symbol to indicate what root is sought. radicand – the quantity under a radical symbol. A fragment of a table of square roots was unearthed in Nippur, approximately 100 miles southeast of Baghdad, Iraq. It was written by the ancient Mesopotamians and dates back to 1800 B.C.E. It is currently located in the Babylonian section of the University of Pennsylvania. Fundamentals of Algebra → Radicals Every once in a while, we are asked to find the "root" of a number, whether it’s the square root or cube root or some other root. These kinds of numbers are called radicals and are denoted with the symbol "√". In general, the root of a number a is another number, which, when multiplied by itself n times, will produce our original number a. For example, the third root of 8 is 2 because if we multiply 2 by itself 3 times, we end up with 8, i.e. 2 x 2 x 2 = 8 so 3√8 = 2. Definition of the nth Root of a Number Let a and b be real numbers and let n ≥ 2 be a positive integer. If a = bn then b is an nth root of a. If n = 2, the root is a square root. If n = 3, the root is a cube root. Many numbers have more than one nth root. For example, both –4 and 4 are square roots of 16. The principle nth root of a number is defined as follows: Definition of Principle nth Root of a Number Let a be a real number that has at least one nth root. The principle nth root of a is the nth root that has the same sign as a. It is denoted by a radical symbol: principle nth root of a The positive integer n is the index of the radical, and the number a is the radicand. If n = 2, we omit the index n and simply write √a rather than 2√a. Fundamentals of Algebra Radicals – 27 In looking at the nth root of various real numbers, we can arrive at the following conclusions: 1. If a is a positive real number and n is a positive even integer, then a has exactly two real nth roots denoted by n√a and –n√a. 2. If a is any real number and n is an odd integer, then a has only one real nth root denoted by n√a. 3. If a is a negative real number and n is an even integer, then a has no real nth root. (It has a complex root instead.) 4. n 28 – Radicals √0 = 0 Fundamentals of Algebra Advanced To review basic properties of radical numbers. No definitions on this page. In the 6th century B.C.E., the Greek mathematicians, following Pythagoras, considered irrational numbers an inexplicable error on the part of the Supreme Architect. Thus, they had to be kept secret, otherwise one might incur divine wrath. Fundamentals of Algebra → Radicals → Properties of Radicals Radicals are essentially exponents in disguise. Thus, they obey many of the same properties obeyed by conventional integer exponents. Properties of Radicals Let a and b be real numbers, variables, or algebraic expressions such that the indicated roots are real numbers. Let m and n be positive integers. Property 1.) Example = 2.) = 4.) = = = 5 For n even = |a| = | –6 | For n odd = a = –6 Fundamentals of Algebra 4 = = a = = = = 5.) (2)2 = = = 3.) 6.) = = 6 Properties of Radicals – 29 30 – Properties of Radicals Fundamentals of Algebra Advanced To review the concept of real numbers and how they include other sets of numbers such as the rational and irrational numbers. real number – a rational or an irrational number, which can be represented by a finite or infinite decimal. rational number – a number that is the ratio, or quotient, of two integers. irrational number – any number that cannot be expressed as a terminating or repeating decimal. integers – the set of whole numbers and their negative counterparts. whole numbers – all positive integers and zero. natural numbers – all positive integers. real number line – a line with numbers evenly spaced throughout the line. origin – the point 0 on the real number line. one-to-one correspondence – every point on the number line corresponds to exactly one real number. In the first half of the fifteenth century (ca. 1450 B.C.E), the Persian mathematician Ghiyat ad din Ghamshid ibn Mas'ud al-Kashi did important work on algebra, especially concerning the binomial theorem, decimal fractions, exponential powers of whole numbers, and the set of irrational numbers. Fundamentals of Algebra → Real Numbers For most applications in the real world, we use a concept known as real numbers. When you count your pocket change, wonder how many more minutes until lunch, or decide to invest in a lucrative stock, you are using real numbers. Real numbers come in a wide variety of symbols, such as: 9, –3, 5/2, 0.66666666…, 34.32, √5, π, and e. We can separate out different groups of the set of real numbers into smaller subsets. The two largest subsets of real numbers are the rational and irrational numbers. A rational number is any number that can be expressed as the ratio of two integers p/q where q ≠ 0. For example, the numbers: 0.25 = 0.34343434…. = 1 4 rational numbers 34 99 are both rational numbers. The decimal representation of a rational number either repeats (as in 2.734734734….) or terminates (as in 0.4) If a real number cannot be written as the ratio of two integers, then it is an irrational number. Irrational numbers have infinite non-repeating decimal representations. Some of the most famous irrational numbers include: Fundamentals of Algebra Real Numbers – 31 2 ≈ 1.4142135623731 .π ≈ 3.14159265358979 e ≈ 2.71828182845905 irrational numbers The symbol "≈" means "approximately to". We can never actually write down the exact value of π because as far as we have been able to determine, π has no end. Besides rational and irrational numbers, we can further subdivide the rational numbers into integers and nonintegers. An integer is basically a rational number whose denominator is "1". This include both positive and negative numbers, such as {…–2, –1, 0, 1, 2…}. The set of integers can also be subdivided into whole numbers (which includes both positive integers and 0) and natural numbers (which includes positive integers, but does not include zero!). If we want to actually "look" at a real number, we plot the number on a real number line. The point 0 on the number line is called the origin, as all other numbers originate from this central point. By convention, numbers to the right of 0 are positive numbers and numbers to the left of zero are negative, as shown below: Every real number corresponds to exactly one point on the real number line. Every point on the number line corresponds to exactly one real number. This relationship is called one-to-one correspondence. 32 – Real Numbers Fundamentals of Algebra Advanced To review the concept of absolute value and a few of the properties of the absolute value function absolute value – the absolute value of a number is its distance from 0 on a number line. distance – a measure of how far apart two things are from each other. The ancient Mayans used more than one number system. The "ordinary" system was a base 20 system and contained a true "0". The "learned" system was used exclusively by the priests in astronomical calculations as a means of reckoning time. Fundamentals of Algebra → Real Numbers → Absolute Value One way to define the absolute value of a real number is its distance from the origin. For example, the number –4 is located 4 units to the left of the origin, so its absolute value is 4. We disregard the negative sign when we are talking about absolute value, since a magnitude (distance) is always positive. The negative (or positive) sign only indicates the number's direction with respect to the origin. A formal definition of absolute value is as follows: Definition of Absolute Value If a is a real number, then the absolute value of a is: |a| = a –a if a ≥ 0 if a < 0 From this definition, we can see that the absolute value of a real number is never negative. For example, if a = –3, then | –3 | = –(–3) = 3 > 0. Like many operations, the absolute value operator has a number of properties associated with it that follow from the definition. Properties of Absolute Value If a and b are any two real numbers, then the following properties are true: 1. |a| ≥ 0 2. | –a | = |a| 3. | ab | = |a||b| 4. Fundamentals of Algebra = b≠0 Absolute Value – 33 Finally, we can use the absolute value operator to define the distance between two points on the real number line. For example, the distance between –3 and 5 is: | –3 – 5 | = | –8 | = 8 as shown below: Distance Between Two Points on the Real Number Line Let a and b be any two real numbers. The distance between a and b is given by: d(a, b) 34 – Absolute Value = |b–a| = |a–b| Fundamentals of Algebra Advanced To review how real numbers relate to one another. less than - one value is smaller than a compared value. greater than - one value is larger than a compared value. greater than or equal to - one value is larger than or the same as a compared value. less than or equal to - one value is smaller than or the same as a compared value. interval - any subset of real numbers. bounded - any interval which has a finite upper and lower boundary. unbounded - an interval that has infinity as one or both of its endpoints. Although we call our numerals "Arabic numerals", the Arabs actually borrowed them from the Hindu number system. The original Arabic numerals were much more similar in nature to the Greek and Roman systems, which used letters to represent numerals and had no concept of place-value (where the position of a number indicates its value). Fundamentals of Algebra → Real Numbers → Ordering Real Numbers When we talk about the relationships between real numbers, we usually like to talk about one number being "larger" or "smaller" than another number. In a sense, we determine the order of the numbers. 3, for example, is a larger quantity than 2, so we say that 3 is "greater than" 2 and denote this using a mathematical symbol: 3 > 2. This can be extended to other types of relationships, such as "less than", "greater than or equal to", and "less than or equal to". Definition of Order on the Real Number Line If a and b are real numbers, a is less than b if b – a is positive. This order is denoted by the inequality: a < b a is less than b This can also be described by saying that b is greater than a and is denoted by the inequality b > a b is greater than a a b is greater than or equal to a The inequality b ≥ means that b is greater than or equal to a, while the inequality a ≤ b a is less than or equal to b means that a is less than or equal to b. The symbols <, >, ≤, ≥ are inequality symbols. If we want to look at this definition geometrically, this implies that a < b only if a is to the left of b on the real number line. Then the next statement is obviously that b is greater than a if b is to the right of a on the real number line, as shown below: Fundamentals of Algebra Ordering Real Numbers – 35 Interval Notation We can use the power of inequalities to describe subsets of real numbers called intervals. For example, if we wanted to look at all the numbers between 4 and 5 (but not including 4 and 5), then we would write this as the interval (4, 5). The parentheses tell us that the points 4 and 5 are not included in our interval. If, on the other hand, we really did want to have 4 and 5 in our interval, then we would write this with straight brackets instead of parentheses [4, 5]. Of course, we could also include 4 and exclude 5 or vice versa. If we include one point but not the other, we write one side with a parentheses and the other side with a straight bracket: (4, 5] or [4, 5). This can easily be generalized to any two real numbers (including negative real numbers). All of these intervals are called bounded intervals because they involve an upper and lower boundary. This is summarized below: Bounded Intervals on the Real Number Line Notation Interval Type Inequality [a, b] Closed a≤x≤b (a, b) Open a<x<b (a, b] Half-open a<x≤b [a, b) Half-open a≤x<b Graph Besides bounded intervals, we can talk about the notion of unbounded intervals. This is an interval that essentially has infinity as one of its endpoints. For example, the interval described by "all real numbers greater than 7" means that 7 is the lower bound of the interval and there is no real upper bound of the interval. This can be written as (7, ), where the symbol for infinity, ∞, indicates that this interval is unbounded. Note that we always close of the unbounded side of an interval with a parenthesis instead of with a square bracket. You cannot write (7, ], because infinity is not bounded. 36 – Ordering Real Numbers Fundamentals of Algebra This is summarized below: Unbounded Intervals on the Real Number Line Notation Interval Type Inequality [a, ) Half-open x≥a (a, ) Open x>a (– , b] Half-open x≥b (– , b) Open x<b (– , ) Graph Entire real line The Law of Trichotomy states that for any two real numbers a and b, precisely one of the following relationships is possible: a a a = or < or > Fundamentals of Algebra b b Law of Trichotomy b Ordering Real Numbers – 37 38 – Ordering Real Numbers Fundamentals of Algebra ` Unit 2 Linear Equations Linear Equations – 39 40 – Linear Equations Advanced To learn the basics of linear equations and how to solve them solve – to find the result by the use of certain given data, previously known facts or methods, and newly observed relations. solution – a replacement of the variable(s) in an algebraic sentence that makes the sentence true. identity – an equation whose sides are equivalent expressions. conditional equation – an equation that is true for only a few members in the domain. linear equation – an equation in which each term is either a constant or a monomial of degree 1. equivalent equations – equations having the same solution set over a given domain. Aryabhata the Elder (c.476-550) was a Hindu astronomer and mathematician. He was often correct about the solution to one particular problem, but incorrect about the solution to a related problem. For instance, in his poem Garite (Hindu mathematics was expressed in poetic form), he correctly states the formulae for the areas of a triangle and a circle. However, he fails to correctly extrapolate the formulae to find the volume of threedimensional shapes. An Arab commentator, al-Biruni, described all of Hindu mathematics as a mixture of "common pebbles and costly crystals." Linear Equations An equation is a statement that two algebraic expressions are equal to each other. Examples include: 3x – 4 = 5, x2 – 4 = 0, and 3 – 2x = x2. If we wish to solve an equation, we want to find all values of x that will satisfy the equation or in other words make it true. If we let x = 3, then 3x – 4 = 5 can also be written as 3(3) – 4 = 5. If we add 4 to both sides of the equation, we end up with 3(3) = 9, which is a true statement. Therefore we say that x = 3 is the solution to the equation 3x – 4 = 5. An equation that is true for every value of x in the domain of the variable is called an identity. For example, x2 – 4 = (x – 2)(x + 2) identity is an identity because it is true for any real value of x. We are restricting the domain of x to real numbers only, excluding complex numbers. An equation that is true for only a few of the numbers in the domain of x is called a conditional equation . For example, 0 = (x + 3)(x – 1) conditional equation is a conditional equation because it is only true for x = –3 and x = 1. Learning how to solve conditional equations is one of the primary purposes of algebra. Linear Equations in One Variable The focus of this section of the Algebra Brain is in solving Linear Equations. A linear equation has exactly one solution. Period. In general, a linear equation in one variable is defined as: Linear Equations Introduction to Linear Equations – 41 Definition of Linear Equation A linear equation in one variable x is an equation that can be written in the standard form ax + b = 0 linear equation where a and b are real numbers and a ≠ 0. To solve a conditional equation in x (such as a linear equation), the idea is to isolate x on one side of the equation and group all the real numbers on the other side. Then we simplify the equation by adding, subtracting, multiplying or dividing the real numbers. When all we have is an x on one side and a single real number on the other side, the equation has been solved for x. This process takes the form of a number of steps. We transform the original linear equation that we are given into a simpler, equivalent equation, which has the same solution as our original equation. Generating Equivalent Equations An equation can be transformed into an equivalent equation by one or more of the following steps: Guideline Given Equation Equivalent Equation 1. Remove symbols of grouping, combine like terms, or reduce fractions on one or both sides of the equation 3x – 2x = 5 x=5 2. Add (or subtract) the same quantity to (or from) both sides of the equation 2x + 7 = 4 2x = –3 3. Multiply (or divide) both sides of the equation by the same nonzero quantity. 4x = 8 x=2 4. Interchange the two sides of the equation 3=x x=3 Sometimes we find out that a linear equation has no solution because we end up with a contradictory statement, such as 0 = 4 or 3 = 12. An example of a contradictory equation is x = x + 1. If we solve it for x, we end up with 0 = 1, which cannot possibly be true. Solve the equation and check the solution. 7 – 2x = 15 solution In order to isolate the x-term, we need to get rid of the 7. We subtract 7 from both sides. Then, in order to get rid of the –2 in front of the x, we divide both sides by –2. It is important that we do this in the stated order; otherwise we will get the wrong answer. 7 – 2x = 15 given equation 7 – 2x – 7 = 15 – 7 subtract 7 from both sides –2x = 8 simplify −2x = −2 x = 8 −2 –4 42 – Introduction to Linear Equations divide both sides by –2 simplify answer Linear Equations Now we need to check our answer. We substitute in our answer into the original equation: 7 – 2x = 15 original equation 7 – 2(–4) = 15 substitute in x = –4 7+8 = 15 simplify 15 = 15 check Since 15 = 15 is a true statement, we conclude that x = –4 is indeed the solution to this equation. Let's look at another, more complicated example. Solve the equation (if possible) and check the solution. 8(x + 2) – 3(2x + 1) = 2(x + 5) solution The first thing we need to do is apply the Distributive Property to get rid of all the parentheses. They only get in the way right now. 8(x + 2) – 3(2x + 1) = 2(x + 5) given equation 8x + 16 – 6x – 3 = 2x + 10 Distributive Property (note that the negative sign gets carried inside the parentheses) Next, we simplify the equation, attempting to get all the real numbers on one side, and the x-term isolated by itself on the other side: 8x + 16 – 6x – 3 = 2x + 10 result from above 2x + 13 = 2x + 10 simplify by combining like terms 13 = 10 subtract 2x from both sides Whoops! What happened here? We ended up with a statement that is clearly not true. How do we interpret this? Graphically, we can interpret the left and right hand sides of the equation as parallel lines. On the left hand side, we have y1 = 2x + 13. On the right hand side, we have y2 = 2x + 10. This means that both equations have the same slope, but different y-intercepts. They will never intersect, thus they have no common points. In other words, our given equation has no real solution. answer Equations Involving Fractional Expressions Another common type of equation is when a linear expression is in the denominator of a fraction. An example of this would be: 1 x + 2 x−5 = 0 equation with a linear expression as a denominator We solve these by finding the Least Common Denominator (LCD). We then multiply every term by the LCD. Then we solve the resulting equation just as we did before. Linear Equations Introduction to Linear Equations – 43 Continuing with our example above, the LCD is x(x – 5). We multiply both terms by the LCD: ⎛ 1 + 2 ⎞ ( x) ( x − 5) ⎜ x x − 5⎟ ⎝ ⎠ 1 x ( x) ( x − 5) + = 0(x)(x – 5) multiply both sides by LCD ( x) ( x − 5) = 0 Distributive Property (x – 5) + 2(x) = 0 cancel like terms in numerators and denominators x – 5 + 2x = 0 remove parentheses 3x – 5 = 0 combine like terms 3x – 5 + 5 = 0+5 add 5 to both sides 3x = 5 simplify 2 x−5 3x = 3 = x 5 3 5 3 divide both sides by 3 simplify In order to check this, we let x = 5/3 in our original equation and get: 1 5 3 + 2 5 3 −5 3 = 5 + 2 5 − 3 3 = 5 + 15 3 dividing by a fraction is same as multiplying by reciprocal convert whole number in denominator in second term to a fraction 2 simplify secondary fraction in second term − 10 3 = = = = 3 5 3 5 3 5 + 2⎛⎜ − ⎝ − − 0 44 – Introduction to Linear Equations 6 10 3 5 3 ⎞ ⎟ 10 ⎠ dividing by fraction is same as multiplying by its reciprocal multiply 2 inside parentheses reduce second fraction to lowest terms simplify check Linear Equations Advanced To learn about the relationship between lines and slope. No definitions on this page. Thales of Miletus (c.640-546 B.C.) is not only famous for introducing geometry to the Greeks, but also for establishing the need for deductive reasoning and proofs to demonstrate the validity of a theorem or statement. The Greek philosopher Proclus credits Thales with five basic theorems of geometry: (1) The circle is bisected by its diameter. (2) The base angles of an isosceles triangle are equal. (3) Pairs of vertical angles formed by two intersecting straight lines are equal. (4) An angle inscribed in a semi-circle is a right angle. (5) Two triangles are congruent if they have two angles and one side that are equal. Thales reportedly used this last theorem to find the distance from a ship to the shore using a tower of known height, a plum line, and a carpenter's square (called a gnomon). Linear Equations → Lines and Slope In real life problems the word slope can mean several different things. The slope of a line can refer to a ratio or a rate. The slope is a ratio if the x-axis and y-axis have the same unit of measure. For example, the steepness of Mt. St. Helens is a ratio of height to length and both are measured in feet. The slope is said to be a rate, or rate of change if the x-axis and y-axis have different units. Economists calculate marginal cost, the rate of cost per unit, which is measured in dollars and the number of units. The slope of a line is also an important tool in sketching graphs. We can easily draw a graph given any two points on the line or by using one point and the slope of the line. Making a graph is important because you can use it to determine other points on the line. When looking at the graph, you can determine the behavior of the line at a glance. Definition of the Slope of a Line In a coordinate plane, the slope of a line is the ratio of its vertical rise to the horizontal run. slope = rise run definition of slope The rise is simply the change in the y direction of the line and the run is the change in the x direction. The slope is usually denoted by m and is determined by any two points on a particular line. Using the points (x1, y1) and (x2 , y2), we can calculate the slope of the line. slope = = rise run y2 − y1 x2 − x1 Let's consider a line passing through the points (0, 3) and (4, 5). Linear Equations Lines and Slope – 45 If we let (x1, y1) = (0, 3) and (x2, y2) = (4, 5), we can find the slope of the line using the definition of the slope: m = = = = y2 − y1 x2 − x1 5−3 4−0 2 4 1 2 two point form of slope this is read as "y sub 2 minus y sub 1 divided by x sub 2 minus x sub 1" substitute in x1 = 0, x2 = 4, y1 = 3 and y2 = 5 simplify numerator and denominator reduce fraction to lowest terms The change in the y direction (or vertical change) is the rise and the change in the x direction (horizontal change) is the run. The slope, or rate of change, is the quotient of these changes. A slope of –3/4 would mean for every change of 4 units to the right there is a change of 3 units down. Also, this means that for every change of 1 horizontal unit there is a vertical change of –3/4 of a unit. Note: When calculating the slope of a line, it is important to remember to subtract the coordinates in the same order. Therefore, when practicing you should always label your first point as (x1, y1) and your second point as (x2, y2). Then make sure that you form the numerator and denominator using the same order of subtraction: Correct y2 − y1 Incorrect y2 − y1 x2 − x1 x1 − x2 When finding the slope of the line, 4 different possibilities can occur. The line can rise, fall, have no slope, or be undefined. Let's take a look at the 4 cases and their graphs. Case 1: A Line with a Positive Slope A line with a positive slope, as in the previous example, rises from left to right. That is, the right end of the line is higher on the graph than the left end of the line, as indicated below: 46 – Lines and Slope Linear Equations Case 2: A Line with a Negative Slope Find the slope of the line passing through (0, 3) and (4, 1). m = = = = y2 − y1 x2 − x1 1−3 4−0 −2 4 − 1 2 two point slope formula substitute in y2 = 1, y1 = 3, x2 = 4, and y1 = 0 simplify reduce fraction to simplest form slope is negative A negative slope indicates that the end of the line is lower on the right than it is on the left, as shown below: Case 3: A Line with a Zero Slope Find the slope of the line passing through (5, 4) and (10, 4). m = = = = Linear Equations y2 − y1 x2 − x1 4−4 10 − 5 0 5 0 two point slope formula substitute in y2 = 4, y1 = 4, x2 = 10 and x1 = 5 simplify slope is 0 Lines and Slope – 47 A zero slope means that the line is completely horizontal. It neither rises nor falls as we move along the line from left to right. A graph of a flat line is given below: The equation for this horizontal line is y = 5. In general, horizontal lines have equations in the form y = constant general form of a horizontal linear equation Case 4: A Line with an Undefined Slope Find the slope of the line passing through (3, 2) and (3, 7). m = = = = y2 − y1 x2 − x1 7−2 3−3 5 0 undefined two point slope formula substitute in y2 = 7, y1 = 2, x2 = 3, and x1 = 3 simplify division by zero cannot be defined Because division by zero is not possible, this expression has no meaning. We say that the slope of a vertical line is undefined. A graph of a vertical line is shown below: 48 – Lines and Slope Linear Equations The equation for this vertical line is x = 3. In general, vertical lines have equations of the form x = constant general form of a vertical linear equation Given the point (10, 2) and slope of 1/5, sketch the graph. solution Plot the point (10, 2) on a coordinate plane. Then, move up 1 unit in the y direction and to the right 5 units in the x direction. You should be at the point (15, 3). Keep in mind a slope of 1/5 is the same as a slope of –1/–5. So, if you move down 1 unit in the y direction, then you also move left 5 units in the x direction. Also, the calculated slope for a given line will always be the same no matter which pair of points is chosen. answer The church of St. Paul is a mortuary chapel, which stands 32 feet wide. The design is similar to that of a miniCathedral. The main roof and the bell tower both rise 5 feet for every 4 feet of horizontal distance. The bell tower is 4 feet wide and 15 feet tall. If the walls of the chapel are 50 feet high and the walls of the bell tower are 12 feet high, find the distance from the ground to the peak of the bell tower. solution First, we find the slope of the roof. It rises 5 feet for every 4 feet of horizontal distance it travels, so we can determine the slope as follows: m = = rise run 5 4 definition of slope substitute in rise = 5 and run = 4 Next, we need to find the height of the main roof. The distance from one side of the chapel to the peak of its roof is one-half the total width of the chapel. The chapel is 32 feet wide, so half that distance is 16 feet. Now to find the height of the roof, we set up ratio relating the slope to the actual height of the chapel roof, which we will call c. Linear Equations Lines and Slope – 49 5 4 = c 16 ratio of slope to actual height of chapel 80 = 4c cross multiply 20 = c divide both sides by 4 Thus, the height of the chapel roof is 20 feet. Next, determine the height of the roof on the bell tower and call it b. The distance from one side of the chapel to the peak of its roof is 4 divided by 2, which is 2 ft. Again, set up a ratio. 5 b = 4 2 ratio of slope to actual height of bell tower 10 = 4b cross multiply 2.5 = b divide both sides by 4 Thus, the roof of the bell tower is only 2.5 feet high. To find the total height of the chapel from the stone floor to the very top of the bell tower, we add c = 20 and b = 2.5 to height of the walls w = 50 and the height of the bell tower a = 12: h = w+c+a+b total height is sum of four heights = 50 + 20 + 12 + 2.5 substitute in w = 50, c = 20, a = 12, and b = 2.5 = 84.5 ft simplify answer 50 – Lines and Slope Linear Equations Advanced To define parallel and perpendicular lines. parallel lines – two lines are parallel if there is a plane in which they both lie and they do not cross at any point within the plane. perpendicular lines – two straight lines in a plane, the intersection of which forms right angles. Euclid's (c.300 B.C.) Fifth Postulate is one of the most important contributions to geometry. This postulate lays the foundation for the theory of parallel lines, including the idea that only one parallel to a given line can be drawn through a given point external to that line. Oddly enough, the Fifth Postulate cannot be proved using the previous four postulates. Attempts to prove the Fifth Postulate have led to the creation of non-Euclidean geometries. The alternate geometries have contributed greatly to the study of physics and relativity. Linear Equations → Lines and Slope → Parallel and Perpendicular Lines Recall from geometry that one definition of parallel lines is two lines in the same plane that never cross. The algebraic definition is a bit more technical. In algebra, we define lines in terms of slope, so it should come as no surprise that we define both parallel and perpendicular lines in terms of their slopes. Parallel and Perpendicular Lines 1.) Two distinct nonvertical lines are parallel if and only if their slopes are equal. That is, m11 = m2. 2.) Two nonvertical lines are perpendicular if and only if their slopes are negative reciprocals of each other. That is, m1 = –1 / m2. Write an equation of the line through the given point (a) parallel to the given line and (b) perpendicular to the given line. 4x – 2y = 3 (2, 1) solution First, we need to rewrite the equation of the line in some form that is easier for us to work with. Slope-intercept form is always a safe bet so let's rearrange the given equation a bit. 4x – 2y = 3 given equation –2y = –4x + 3 subtract 4x from both sides y = y = Linear Equations −4x + 3 −2 2x − 3 2 divide both sides by –2 simplify slope-intercept form Parallel and Perpendicular Lines – 51 a.) b.) Any line that is parallel to the given line above must have the same slope of 2. Thus, the line through (2, 1) that is parallel to the given line must also have a slope of 2 and its equation can be found using the point-slope form for the equation of a line: y – y1 = m(x – x1) point-slope form y–1 = 2(x – 2) substitute in m = 2, x1 = 2, and y1 = 1 y–1 = 2x – 4 Distributive Property y = 2x – 3 add 1 to both sides slope-intercept form answer Any line that is perpendicular to the given line must have a slope of –1/ 2, since this is the negative reciprocal of 2. Thus, the line through (2, 1) that is perpendicular to the given line can be found by again applying the point-slope form for the equation of a line: y – y1 = m(x – x1) point-slope form y–1 = 1 − ( x − 2) 2 substitute in m = –1/2, x1 = 2, and y1 = 1 y–1 = 1 − x+1 2 Distributive Property y = 1 − x+2 2 add 1 to both sides slope-intercept form answer The graph of all three lines is given below: 52 – Parallel and Perpendicular Lines Linear Equations Advanced To find the equation of a line given its slope and its y-intercept. No definitions on this page. Alan Turing (1912-1954) was an English algebraist, logician, and code-breaker. He is also a leading pioneer in computer theory. In one of his papers he describes a theoretical automatic machine that could solve any mathematical problem as long as the machine was given the correct problem-solving instructions or algorithm. It was called the Turing machine after its creator. Turing worked with other mathematicians to develop a system of equations called logic gates. This system relied on the use of binary numbers (1 and 0) to produce the problem solving algorithms that would be used by an automatic calculating machine. Modern-day computers still use the same logic to function; they just do it better and faster than Turing could ever dream of. He died in 1954 after eating a cyanide-laced apple. Linear Equations → Lines and Slope → Slope-Intercept Form of a Line One of the simplest equations for modeling two variables is the linear equation y = mx + b linear equation This equation is called linear because when we graph it on the Cartesian plane, it takes the shape of a line. In mathematics, the term line refers exclusively to straight lines, not curves, whirls, loops, or any other sort of non-straight lines. If we let x = 0 in the above equation, we end up with y = b let x = 0 We call the point (0, b) the y-intercept because this is the point at which the line crosses the y-axis, as shown below. positive slope, line rises negative slope, line falls If b = 0, then the line passes through the origin. If m = 0, then we have a horizontal line given by the equation y = Linear Equations b horizontal line Slope-Intercept Form of a Line – 53 The Slope-Intercept Form of the Equation of a Line The graph of the equation y = mx + b slope-intercept form is a line whose slope is m and whose y-intercept is (0, b). Once we have determined the slope of a line and its y-intercept, it is a relatively simple matter to sketch the graph of the line. If we have a vertical line, then that line has the equation x = equation of a vertical line a The slope of a vertical line cannot be written in the form y = mx + b because the slope of a vertical line is undefined. Find the slope and y-intercept of the equation of the line. Sketch a graph of this line. 2x + 3y – 9 = 0 solution First, let's transform the given equation so that it is in slope-intercept form. In other words, we solve the equation for y: 2x + 3y – 9 = 0 given equation 3y = –2x + 9 isolate the term containing y y = y = −2x + 9 3 2 − x+3 3 divide both sides by 3 simplify slope-intercept form Based on the slope-intercept form of the linear equation, we can see that the slope is m = –2/3 and the y-intercept is (0, 3), as indicated in the graph below: answer 54 – Slope-Intercept Form of a Line Linear Equations Advanced To write a linear equation using the slope of a line and a point on that line. point-slope form – one form of a linear equation derived from knowing one point on the line and its slope. Among the many secrets Thales of Miletus (c.640-546) learned from the Babylonians was the secret of predicting solar eclipses. Using these secrets, Thales was able to predict the solar eclipse of 585 B.C. He still had a tremendous amount of luck to help him, however, as the eclipse just happened to be visible from his school in Miletus. This prediction bolstered Thales's fame and reputation throughout the lands of Greece. Linear Equations → Lines and Slope → Point-Slope Form of a Line One way in which we defined the slope of a line was in terms of two coordinate points (x1, y1) and (x, y). The slope m was defined as: y − y1 = x − x1 m definition of slope Now, suppose we multiply both sides by the quantity (x – x1): y – y1 = m(x – x1) multiply both sides by (x2 – x1) This equation is known as the point-slope form of the equation of a line because it involves two points on the line and their slope. The Point-Slope Form of the Equation of a Line The equation of the line with slope m passing through the point (x1, y1) is: y – y1 = m(x – x1) point-slope form of the equation of a line One of the most useful applications of this form of the equation of a line is actually finding the equation of a line. That is, given a point and the slope of the line, we can easily find an equation for that line. Then we can convert from this form to the slope-intercept form to help us graph the line. Find an equation of the line that passes through the given point and has the indicated slope. Then graph the line. point (–3, 5) slope m = –2 Linear Equations Point-Slope Form of a Line – 55 solution We use the point-slope formula with m = –2 and (x1, y1) = (–3, 5). y – y1 = m(x – x1) point-slope form y–5 = –2(x – (–3)) substitute in m = –2, x1 = –3, and y1 = 5 y–5 = –2(x + 3) simplify inside parentheses y–5 = –2x – 6 Distributive Property y = –2x – 1 add 5 to both sides slope-intercept form answer Based on this equation, we can see that the slope is –2 and the y-intercept is –1, so we have the following graph: answer Two-Point Form of the Equation of a Line If we have two points instead of one point, we can still find the equation of a line passing through those two points. If we have the two points (x1, y1) and (x2, y2), then the slope of the line connecting these two points is given by the difference in the y-coordinates divided by the difference in the x-coordinates: m y2 − y1 = x2 − x1 x1 ≠ x2 When we plug this into the point-slope form of the equation of a line, we obtain: y – y1 = y2 − y1 x2 − x1 (x − x1) two-point form This is called the two-point form of the equation of a line. 56 – Point-Slope Form of a Line Linear Equations Find an equation of the line passing through the two points. Then sketch the graph of the line. (–1, 3) (3, 5) solution We use the two-point form of the equation of a line as given above with (x1, y1) = (–1, 3) and (x2, y2) = (3, 5): y – y1 = y–3 = y–3 = y–3 = y–3 = y = y2 − y1 x2 − x1 5−3 3+ 1 2 4 1 2 1 2 1 2 (x − x1) ( x + 1) two-point form substitute in for x1, y1, x2 and y2 ( x + 1) simplify fraction ( x + 1) simplify fraction x+ x+ 1 2 7 2 Distributive Property add 3 to both sides slope-intercept form answer Thus, we have a line with slope m = 1/2 and y-intercept (0, 7/2), as shown on the graph below: answer Linear Equations Point-Slope Form of a Line – 57 58 – Point-Slope Form of a Line Linear Equations Every line can be written in the form Ax + By + C = 0 standard form of the equation of a line where A, B, and C are all real numbers. Every other form of the equation of a line can be derived from the standard form. If we let A = 0, then we have: By + C = 0 y = − let A = 0 solve for y equation of horizontal line C B If we let B = 0, then we have: Ax + C = 0 x = − let B = 0 solve for x equation of vertical line C A Recall that a vertical line has an undefined slope. Here we just want to summarize the different equations for lines. Using them is largely a matter of preference, depending on the situation. If we know one point and the slope, then we usually use the point-slope form. If we are given the slope and the y-intercept, we use the slope-intercept form. If all we know is two points, then we use the two-point form, and so on. Equations of Lines 1.) General (standard) form: Ax + By + C 2.) = = = b horizontal line mx + b slope-intercept form = m(x – x1) point-slope form Two-point form: y – y1 Linear Equations vertical line Point-slope form: y – y1 6.) a Slope-intercept form: y 5.) general form Horizontal line: y 4.) 0 Vertical line: x 3.) = = two-point form Equations of Lines – 59 60 – Equations of Lines Linear Equations Advanced To solve linear inequalities for the independent variable. No definitions on this page. In 1854, George Boole (1815-1864) wrote An Investigation of the Laws of Thought, wherein he applied mathematics to the study of logic. He reduced logical relationships to statements about equality, inequality, exclusion and inclusion. These statements were expressed symbolically using a two-digit or binary code. The algebraic rules which governed logical relationships became known as Boolean algebra. They have since had a tremendous impact on the fields of electronics and computer science. Linear Equations → Linear Inequalities The simplest type of inequality is a linear inequality in a single variable. That is, we have an algebraic expression with only one variable raised to the first power being compared to some real number. An example of a linear inequality is: x–5 < 8 sample linear inequality Solving a linear inequality is very similar to solving a linear equation. The only thing we have to remember is that when we multiply or divide both sides of an inequality by a negative number, we have to reverse the inequality symbol. Solve the inequality and sketch the solution on the real number line. 5x – 3 < 2x + 9 solution In order to solve this inequality, we try to rearrange the terms so that the variable is on one side and everything else is on the other side, just as we do when solving linear equations. 5x – 3 < 2x + 9 original inequality 5x – 2x < 3+9 add 3 to both sides subtract 2x from both sides 3x < 12 simplify both sides x < 4 divide both sides by 3 we divide by a positive number, so the inequality symbol remains the same answer Linear Equations Linear Inequalities – 61 The graph of the solution looks like the following. answer Note: We use an open circle at x = 4 to indicate that x = 4 is not a part of the solution! Double Inequalities Sometimes it is convenient to express two inequalities as a single expression, combining them into a double inequality. For instance, we can write 2 ≤ 3x + 5 and 3x + 5 < 13 as the following: 2 ≤ 3x + 5 < 13 To solve such an inequality, we work on all three parts simultaneously. Whatever we do to one part of the inequality, we have to do to all three parts. Solve the inequality and sketch its graph on the real number line. 1 ≤ 3x – 5 < 13 original double inequality 6 ≤ 3x < 18 add 5 to all three parts 2 ≤ x < 6 divide all sides by 3 answer So x is greater than or equal to 2 and x is less than 6. We indicate this on the number line below: answer We could also have solved this problem as two inequalities: 1 ≤ 3x – 5 6 ≤ 2 ≤ and 3x – 5 < 13 solve as two inequalities 3x 3x < 18 add 5 to both sides of each inequality x x < 6 divide both sides of each inequality by 3 We still obtain the same solution set. The solution consists of all values of x that satisfy both inequalities. That is, the solution set is 2 ≤ x < 6, which is exactly what we obtained in the example above. Be very careful when combining inequalities, as the inequalities have to satisfy the Transitive Property of Inequalities in order to be combined. That is, if a < b and b < c, then a < b < c. For instance, it is not correct to combine the inequalities 4 < 3x and 3x < –2 into 4 < 3x < –2 because 4 is not less than –2. 62 – Linear Inequalities Linear Equations Advanced To examine inequalities involving the absolute value operator. No definitions on this page. George Boole (1815-1864) married Mary Everest (Boole) in 1855. She was the niece of Sir George Everest, the man for whom the tallest mountain in the world is named. Boole died of pneumonia at the age of 49. Apparently, he had walked two miles through a drenching downpour and caught pneumonia. His wife believed that the cure for the illness should resemble the cause, so she put him to bed and repeatedly doused him with buckets of cold water. Needless to say, this did not help Boole in any way and he died shortly thereafter. Linear Equations → Linear Inequalities → Absolute Value Inequalities Inequalities that involve the absolute value operator "| |" have to be treated carefully. Recall that absolute value is defined as the distance from the origin. The absolute value of a real number is always a positive quantity. When we deal with inequalities, we are actually concerned with finding a range of values for the independent variable x. An absolute value inequality, therefore, involves looking at values of x that are either within a certain distance of the origin (indicated by the < or ≤ symbols) or are located outside a certain distance from the origin (indicated by the > or ≥ symbols). All absolute value inequalities can be broken down into either a conjunction (use of the word "and") or a disjunction (use of the word "or"). For instance, |x| ≥ 5 sample absolute value inequality actually means all values of x that are more than 5 units away from the origin. Since this can include positive or negative values of x, we can break this inequality down into the following two inequalities: x ≥ 5 or x ≤ –5 We can summarize these principles with the following guidelines: Solving an Absolute Value Inequality Let x be a variable of an algebraic expression and let a be a real number such that a ≥ 0. 1.) The solutions of | x | < a are all values of x that lie between –a and a. |x|<a 2.) –a < x < a conjunction The solutions of | x | > a are all values of x that are less than –a or greater than a. |x|>a Linear Equations if and only if if and only if x < –a or x > a disjunction Absolute Value Inequalities – 63 Solve the inequality and sketch the solution on the real number line. | 4x | > 12 solution Since we have the greater than symbol (>), we have to apply the second guideline given above. We rewrite this as two inequalities linked by the term "or": 4x < –12 or 4x > 12 rewrite inequality as disjunction x < –3 or x > 3 divide both sides by 4 answer To graph the solution set, we plot all those points on the number line that are greater than 3 or less than –3, as shown below. Notice that we use open circles at x = –3 and x = 3 to indicate that they are not part of the solution set: answer 64 – Absolute Value Inequalities Linear Equations Advanced To find the solution set of a polynomial inequality. No definitions on this page. Georg Cantor (1845-1918), a Russian-born German algebraist and analyst, was instrumental in the development of set theory. His first paper dealt with algebraic numbers. An algebraic number is any real number that is a solution to an equation with integer coefficients. He proved there is a one-to-one correspondence between the set of all algebraic numbers and the set of all positive integers. Furthermore, the set of all reals cannot be put into the same one-to-one correspondence. As he explained it, the set of positive integers and the set of algebraic numbers have the same power, but the power of the set of all reals is different from either. Linear Equations → Linear Inequalities → Polynomial Inequalities In the previous thought, we only concerned ourselves with solving linear inequalities. Now we are going to take it up a notch and solve inequalities involving higher degree polynomials. Suppose we have the following polynomial inequality: x2 + 2x – 3 < 0 sample polynomial inequality A polynomial can only change its sign at its zeros (points where the graph of the polynomial crosses the x-axis). Between two consecutive zeros, a polynomial has to be entirely positive (above the x-axis) or entirely negative (below the x-axis). When we arrange the zeros of a polynomial in order from least to greatest, we are in effect dividing the number line into manageable intervals. Inside each interval, the polynomial has no sign changes. The zeros for the polynomial are called the critical numbers of the inequality. The resulting intervals are called the test intervals for the inequality. For instance, the polynomial x2 + 2x – 3 = (x + 3)(x – 1) factor the polynomial has two zeros, located at x = –3 and x = 1. These zeros divide the number line into three test intervals: (–∞, –3) first test interval (–3, 1) second test interval (1, ∞) third test interval Once we have determined our test intervals, we choose a value for x from each interval and plug that value into our given inequality. If the inequality holds true for the value of x we choose, we can conclude that the interval that contains that value of x is a valid solution set for the inequality. If the inequality does not hold true for the value of x we choose, then we conclude that the interval that contains that values of x is not a valid solution set for the inequality. This same approach can be used for any polynomial inequality, including rational or linear inequalities. Linear Equations Polynomial Inequalities – 65 Finding Test Intervals for a Polynomial To determine the intervals on which the values of a polynomial are entirely negative or entirely positive, use the following steps. 1.) Find all real zeros of the polynomial, and arrange the zeros in increasing order (from smallest to largest). The zeros are the critical numbers of the polynomial. 2.) Use the critical numbers of the polynomial to determine its test intervals. 3.) Choose one representative value of x in each test interval and evaluate the polynomial at that value. If the value of the polynomial is negative, then the polynomial will have negative values for every x-value in the test interval. If the value of the polynomial is positive, then the polynomial will have positive values for every x-value in the test interval. Solve the inequality and graph the solution on the real number line. x2 – 3 ≥ 1 solution First, we have to rearrange the terms of this inequality so that we have "0" on one side and everything else on the other side. Then we can proceed with finding the zeros of the polynomial using standard techniques. x2 – 3 ≥ 1 given inequality x2 – 4 ≥ 0 subtract 1 from both sides Now we factor the left hand side to find the zeros: x2 – 4 = (x – 2)(x + 2) factor the polynomial expression This polynomial expression only equals zero when x = –2 or x = 2. Thus, the polynomial's test intervals are: (– , –2] Note: first test interval [–2, 2] second test interval [2, third test interval ) If the degree of a polynomial is given by n, then the number of test intervals we obtain will be given by n + 1. Inside each test interval, we choose representative values of x and plug them into our polynomial: Interval (–∞, –2] x-value x = –4 Polynomial Value (–4)2 – 4 = 16 – 4 = 12 ≥ 0 Conclusion positive [–2, 2] x=0 0 – 4 = –4 ≤ 0 negative [2, ∞) x=4 42 – 4 = 16 – 4 = 12 ≥ 0 positive 66 – Polynomial Inequalities Linear Equations We want values of x such that the polynomial inequality x2 – 4 ≥ 0 holds true. This only happens when x is inside the intervals (– , –2] and [2, ). Furthermore, because we use the symbol ≥, we include the endpoints of the intervals. Therefore, we have the following solution sets for the inequality: x ≤ –2 or x ≥ 2 answer The solution sets of this inequality can be graphed as follows: answer Note: In this example, we used the symbol ≥, which means we had to include the endpoints in our intervals, thus we used square brackets "[" and "]" to indicate that –2 and 2 were included in the solution set. Had we been given the symbol > instead, we would have had to use "(" and ")" around the points –2 and 2. Infinity is always written with parentheses because it is never included as part of the solution set. Linear Equations Polynomial Inequalities – 67 68 – Polynomial Inequalities Linear Equations Advanced To solve inequalities involving rational expressions. No definitions on this page. Georg Cantor (1845-1918) held little influence in his home country of Germany, but was widely recognized in the international mathematical community. His theories of sets led to whole new branches of mathematics, such as topology, measure theory, and set theory itself. In 1884, Cantor suffered a nervous breakdown and continued to suffer from many such mental crises over the course of the rest of his life. One of Cantor's more remarkable creations is the concept of a transfinite number. Transfinite numbers describe the number of objects in a set. Linear Equations → Linear Inequalities → Polynomial Inequalities → Rational Inequalities Solving rational inequalities involves many of the principles covered in the previous thought concerning polynomial inequalities. However, since rational inequalities by definition involve dividing a polynomial by another polynomial, we have to pay very close attention to the denominator of our rational expression. As we did for polynomial inequalities, we need to use the concept of critical numbers and test intervals in order to solve a rational inequality. The values of a rational expression can change sign only at its zeros (the x-values for which the numerator is zero) and its undefined values (the x-values for which the denominator is zero). Solve 3x − 5 x−5 > 4 solution First, we need to get all the terms on one side of the inequality and the other side of the inequality is set equal to zero. Then we look for the critical numbers for this inequality that are located at points where the numerator is equal to zero and where the denominator is equal to zero. This establishes the endpoints of our test intervals. We test one number from each interval and see if the inequality is true. If it is true, then that interval forms a solution set for the inequality. Linear Equations Rational Inequalities – 69 3x − 5 x−5 3x − 5 x−5 −4 3x − 5 − 4( x − 5) x−5 3x − 5 − 4x + 20 x−5 −x + 15 x−5 > 4 original inequality > 0 subtract 4 from both sides > 0 combine left hand side into a single rational expression > 0 Distributive Property > 0 simplify numerator At this point, we need to find our critical numbers. When x = 15, the numerator equals zero, so this is one critical number. When x = 5, the denominator is 0, so this is another critical number. Thus, our critical numbers are: x = 15 one critical number x = 5 another critical number Our test intervals then become: first test interval (–∞, 5) (5, 15) second test interval (15, ∞) third test interval We test three points, one from each interval, as shown on the line below. We graph the solution set on the line below: Based on our tests, we discover that the following values of x are the only ones that make the given inequality at the top of the page true. 5 < x < 15 answer All rational inequalities can be solved using this same process, although the details will depend exactly on the inequality in question. 70 – Rational Inequalities Linear Equations In this section of the Algebra Brain, we are going to discuss systems of linear equations and inequalities. A linear system consists of two or more linear equations containing any number of variables all raised to the first power. For instance: 3x – 4y = 5 2x + 5y = –2 and is a linear system. There are numerous ways we can solve systems of linear equations. 1.) Substitution Method – Solve the system of equations for one of the variables, then start backsubstituting into the equations until all the variables have been solved for. 2.) Graphical Approach – Graph the system of linear equations (assuming that you only have the variables x and y). Then the intersection of the lines is the solution of the linear system. 3.) Method of Elimination – Eliminate one or more variables from the system of linear equations, then solve for the remaining variable. Back-substitute the solution for that variable into the linear equations and solve for the remaining variables. 4.) Multivariable Linear Systems – A method for solving systems of linear equations involving three or more variables. This method is closely related to the method of elimination. Another name for this technique is Gaussian elimination. 5.) Systems of Inequalities – Graph inequalities. The solution to the system of inequalities consists of those regions in the plane that satisfy the inequalities. Linear Equations Linear Systems – 71 72 – Linear Systems Linear Equations Advanced To solve a system of linear equations using the Method of Elimination. No definitions on this page. Among Regiomontanus's (1436-1476) accomplishments is his presentation of the first trigonometrical formula for the area of a triangle. It appeared in his 1464 work De triangulis omnimedis libri guingre. This same work also presented innovative notations for the sine and cosine. Another of his works, Tabulae Directionum, or "table of directions" introduced the tangent function. Linear Equations → Linear Systems → Substitution Method In the other sections of the Algebra Brain, we develop methods for solving a single equation involving a single unknown quantity. Now it is time to learn to solve multiple equations involving multiple unknown quantities. Many types of problems in science, engineering, and business involve two or more equations containing two or more variables. To solve these kinds of problems, we need to find solutions of systems of equations. For the time being, we are going to only focus on systems of linear equations. Suppose we have the following system of linear equations: 3x + y = 9 Equation 1 2x – 3y = –5 Equation 2 The solution of this system of linear equations is an ordered pair (x, y) that satisfies both linear equations. In this case, the solution for this system of linear equations is the ordered pair (2, 3). To check, you can substitute in x = 2 and y = 3 into each equation and see if each equation holds: 3x + y = 9 Equation 1 3(2) + 3 = 9 substitute in x = 2, y = 3 6+3 = 9 simplify 9 = 9 both sides are equal check 2x – 3y = –5 Equation 2 2(2) – 3(3) = –5 substitute in x = 2, y = 3 4–9 = –5 simplify –5 = –5 both sides are equal check Now, how did we know that the solution to this system is (2, 3)? We applied the following set of guidelines for solving a system of linear equations. We call this technique the method of substitution (or simply Substitution Method). Linear Equations Substitution Method – 73 Substitution Method Note: 1.) Solve one of the equations for one variable in terms of the other. 2.) Substitute the expression found in Step 1 into the other expression to obtain an equation of one variable. 3.) Solve the equation obtained in Step 2. 4.) Back-substitute the solution in Step 3 into the expression obtained in Step 1 to find the value of the other variable. 5.) Check that the solution satisfies each of the original equations. The term back-substitution implies that we work backwards. First, we solve for one of the variables, and then we substitute that value back into one of our original equations in the system to find the value of the other variable. Solve the system of linear equations using the Substitution Method. 2x – y + 2 = 0 4x + y – 5 = 0 solution Our first step is to solve one of the equations for one of the variables. We will solve the first equation for y. 2x – y + 2 = 0 Equation 1 2x + 2 = y add y to both sides Now that we have an expression for y, we can substitute this expression into the second equation: 4x + y – 5 = 0 Equation 2 4x + (2x + 2) – 5 = 0 substitute in y = 2x + 2 6x – 3 = 0 combine like terms 6x = 3 add 3 to both sides x = x = 3 6 1 2 divide both sides by 6 reduce fraction to lowest terms Now that we have a value for x, we can back-substitute this value into either one of our original equations. We will substitute in x = 1/2 into our first equation and solve the equation for y: 74 – Substitution Method Linear Equations 2x – y + 2 2⎛⎜ = 0 Equation 1 = 0 substitute in x = 1/2 1–y+2 = 0 simplify 3 = y add y to both sides 1⎞ ⎟− y+2 ⎝ 2⎠ So now we have found that y = 3. Therefore, the solution to this system of linear equations is the ordered pair: (x, y) = ⎛ 1 , 3⎞ ⎜2 ⎟ ⎝ ⎠ answer Now we need to double-check to make sure that this is indeed the correct solution. We substitute in (x, y) = (1/2, 3) into both equations and see if they hold true: 2x – y + 2 2⎛⎜ = 0 Equation 1 = 0 substitute in x = 1/2 and y = 3 1–3+2 = 0 simplify 0 = 0 simplify check 4x + y – 5 = 0 Equation 2 = 0 substitute in x = 1/2 and y = 3 2+3–5 = 0 simplify 0 = 0 simplify check 1⎞ ⎟ −3+ 2 ⎝ 2⎠ 4⎛⎜ 1⎞ ⎟ + 3−5 ⎝ 2⎠ Note: When solving systems of linear equations it is extremely important to double-check your solution. Arithmetic errors are extremely common. Linear Equations Substitution Method – 75 76 – Substitution Method Linear Equations Advanced To solve systems of linear equations graphically. No definitions on this page. Subrahmanyan Chandrasekhar (1910-1995), also known as Chandra, was a brilliant astrophysicist and mathematician. His first love was mathematics but he became a physicist at his father's insistence. He is most famous for his work on the origins, dynamics, and structures of stars. In 1983, he was awarded the Nobel Prize for his research into the deaths of aged stars. Linear Equations → Linear Systems → Graphical Approach On occasion, solving a system of equations can be very difficult using straight algebraic techniques. It may be that the equations just do not have anything in common for us to work with. For instance, suppose we have the system y = 1 + ex y = x2 and At first glance, this looks to be an unfriendly system of equations. In order to solve this system, we have to set the equations equal to each other and then solve for x. This is not at all easy to do with this particular system of equations. However, if we graph the equations, then we can determine the solutions to this system by looking for places where the graphs of the equations intersect, as shown below: As we can see from the graph, the given system of equations only has 1 solution located between x = –1 and x = –2. We can easily apply this to systems of linear equations as well. If we have a system of linear equations, then the solution to the equation is the point on the graph where the lines cross. If the graphs of a system of equation have no intersection, then the system of equation has no solution. Linear Equations Graphical Approach – 77 Solve the system of equations graphically. 4x – 5y = 17 2x + 2y = 4 and solution To solve this problem graphically, we have to use a handy graphing tool. You can use a calculator or a software program to draw the graph. As always, we use MathCAD 2000®. Most graphing tools will only graph linear equations in slope-intercept form, so we first need to transform the equations so that they are in the form y = mx + b. y = y = 4 5 x− 17 5 –x + 2 Equation 1 in slope-intercept form Equation 2 in slope-intercept form Then the graph of these two lines is given by the following: Based on the graphs of these two lines, we can see that they intersect at the point (3, –1). Therefore, the solution to this linear system is: (x, y) = (3, –1) 78 – Graphical Approach answer Linear Equations Advanced To identify the number of solutions of a linear system by looking at the graph of the system. consistent – a system of linear equations that has at least one solution. inconsistent – a system of linear equations that has no solution. Subrahmanyan Chandrasekhar (1910-1995) did important research into white dwarf stars. He concluded that the radius of a white dwarf is directly proportional to its mass. Furthermore, he theorized that the white dwarf stage is only one possible final stage of a star's evolution. The primary factor in determining a star's final stage is its mass. Stars larger than 1.2 times the mass of our sun will most likely become neutron stars. These are only 9 miles in diameter but are more massive than our own sun. Supermassive stars, many times the mass of our sun, sometimes end up as black holes. Stars with a mass less than or equal to 1.2 solar masses become white dwarfs, roughly the size of Earth but with all the mass of the sun (or a little bit more). Linear Equations → Linear Systems → Graphical Approach → Graphical Interpretation of Solutions In general, for any given system of equations (linear or otherwise) there are three possibilities for the solutions to the system. There can be only one solution, or there can be two or more solutions, or there can be no solution. If we have a system of equations in which all the equations are linear, then if the system has two different solutions, then it has infinitely many solutions. This stems from the fact that two lines can only intersect at more than one point if they are the exact same line! That is, the two lines are exactly identical. For instance, the system of equations below produces only a single line on the same graph: 2x + 2y = 4 Equation 1 x+y = 2 Equation 2 Equation 1 is simply a multiple of Equation 2. When we solve Equation 1 to put it in slope-intercept form, we will find out that it has the same slope as Equation 2 as well as the same y-intercept. Therefore, Equation 1 is identical to Equation 2 and the solution for this system consists of all real numbers: 2x + 2y = 4 Equation 1 2y = –2x + 4 subtract 2x from both sides y = –x + 2 divide both sides by 2 Equation 1 in slope-intercept form x+y = 2 Equation 2 y = –x + 2 subtract x from both sides Equation 2 in slope-intercept form Linear Equations Graphical Interpretation of Solutions – 79 Graphical Interpretation of Solutions For a system of two linear equations in two variables, the number of solutions is given by one of the following: 1.) Number of Solutions Exactly one solution Graphical Interpretation The two lines intersect at one point 2.) Infinitely many solutions The two lines are identical 3.) No solution The two lines are parallel Each of the three cases is illustrated below: Exactly ONE solution Infinitely MANY solutions NO solution Consistent Two lines that intersect One point of intersection Consistent Two lines that coincide Infinitely many points of intersection Inconsistent Two parallel lines No point of intersection A system of linear equations is called consistent if it has at least one solution (remember that if it has two, then it also has infinitely many others). A linear system is called inconsistent if it has no solution. Use a graphing utility to graph the lines in the system. Determine whether the solution is consistent or inconsistent. If it is consistent, determine the number of solutions. 2x + y = 5 Equation 1 x – 2y = –1 Equation 2 solution To graph the equations, we will first need to put them into slope-intercept form. This may also give us a clue as to whether the solution is consistent or not. 2x + y = 5 Equation 1 y = –2x + 5 subtract 2x from both sides slope-intercept form x – 2y = –1 Equation 2 –2y = –x – 1 subtract x from both sides y = 1 2 x+ 1 2 divide both sides by –2 slope-intercept form 80 – Graphical Interpretation of Solutions Linear Equations Ah ha! We have a clue! Notice that the slopes of the lines are negative reciprocals of each other. That right there tells us that these two lines are perpendicular. Two lines that are perpendicular can only intersect at one point. Therefore, we know even before we graph the lines that the solution will be consistent. Let's go ahead and verify this with the graph: As you can see from the graph, our prediction based on the slopes was correct. The solution is consistent. Furthermore, because the lines only intersect at one point, there can be only one solution to this system of linear equations. answer To actually find the solution to the system of equations, we would have to use one of the methods described in this section of the Algebra Brain, such as the Substitution Method or the Method of Elimination. Linear Equations Graphical Interpretation of Solutions – 81 82 – Graphical Interpretation of Solutions Linear Equations Advanced To solve systems of linear equations by eliminating one or more variables. No definitions on this page. Chandrasekhar's (1910-1995) theories of stellar evolution were ridiculed at first by another prominent astronomer, Sir Arthur Stanley Eddington. Eddington had greater status in the scientific community, so he managed to suppress Chandra\s theories about neutron stars or the more massive black holes. It took 20 years before Chandra's theories resurfaced and gained wide acceptance. It was 50 years before Chandra was awarded the Nobel Prize for his white dwarf research. Oddly, despite the scientific disputes between Chandra and Eddington, they retained a close personal relationship. Linear Equations → Linear Systems → Method of Elimination When we solve systems of linear equations, we can use a variety of methods to find the solution. Which method we use is largely a matter of preference, although some methods are more efficient than others. If we have a system of two linear equations with only two unknowns, then we can use the Method of Elimination to solve the system. This method involves "eliminating" one of the variables in the system by adding one equation to a multiple of the other equation so that one of the variables cancels out. Consider the following simple linear system of equations. x+y = 2 x–y = 0 and If we add the two equations together, we get: x+y = 2 Equation 1 x–y = 0 Equation 2 2x = 2 add Equation 1 to Equation 2 x = 1 divide both sides by 2 to solve for x Notice that we eliminated the variable y and obtained a single equation in x. This principle even works if we have "messy" linear equations involving coefficients of x and y. We add the equations in such a way as to get rid of one of the variables. Once we have found one of the variables, we can back-substitute into either one of the original equations to find the other variable: x+y = 2 Equation 1 1+y = 2 substitute in x = 1 y = 1 subtract 1 from both sides Thus, the solution to the linear system given above is: (x, y) = (1, 1) solution to the linear system In general, we can apply the following guidelines to help us solve systems of linear equations using the Method of Elimination: Linear Equations Method of Elimination – 83 Method of Elimination To use the Method of Elimination to solve a system of linear equations in x and y, use the following steps: 1.) Obtain coefficients for x (or y) that differ only in sign by multiplying all terms of one or both equations by suitably chosen constants. 2.) Add the equations to eliminate one variable and solve the resulting equation. 3.) Back-substitute the value obtained in Step 2 into either of the original equations and solve for the other variable. 4.) Check your solution in both of the original equations. Solve the following system of linear equations by the Method of Elimination. 2x – y = 3 Equation 1 4x + 3y = 21 Equation 2 solution According to Step 1, above, we first need to multiply one of the equations by some constant so that the coefficients of one variable are the same in both equations. Let's multiply Equation 1 by 3 and see what happens: 3(2x – y) = 3(3) multiply both sides of Equation 1 by 3 6x – 3y = 9 simplify Notice that we have obtained an equivalent equation to Equation 1. Furthermore, the coefficient of y is the same magnitude as the coefficient of y in Equation 2, but has a different sign. Therefore, when we add the equivalent equation to Equation 2, we obtain: 6x – 3y = 9 equivalent to Equation 1 4x + 3y = 21 Equation 2 10x = 30 add the two equations = 3 x divide both sides by 10 to solve for x Now that we have solved the system for x, we can use this value in either of the original linear equations to solve them for y. We will use Equation 1 because it is slightly easier to solve for y: 2x – y = 3 Equation 1 2(3) – y = 3 substitute in x = 3 6–y = 3 simplify 3 = y add y to both sides subtract 3 from both sides 84 – Method of Elimination Linear Equations Once we have solved one of the equations for y, we can go ahead and conclude that the solution is: (x, y) = (3, 3) answer We are not quite done, though. Many students feel compelled to stop when they have found the solution. However, we also need to verify our solution. Arithmetic errors can lead to all kinds of wrong answers, so we have to check our solution in both of our original equations. 2x – y = 3 Equation 1 2(3) – 3 = 3 substitute in x = 3, y = 3 3 = 3 check 4x + 3y = 21 Equation 2 4(3) + 3(3) = 21 substitute in x = 3, y = 3 21 = 21 check The key to properly solving systems of linear equations using the Method of Elimination is to choose the right constant to multiply one of the equations by. As long as you end up with the coefficients of one variable having the same magnitude and different signs, you should have no problem. Also remember that when we multiply the equation by a constant, we have to multiply both sides of the equation by the constant, including each and every term on both sides. Finally, when we add equations, we add like terms. That is, we add the terms involving x together, then we add the terms involving y-together, and finally we add the constant terms together. Linear Equations Method of Elimination – 85 86 – Method of Elimination Linear Equations Advanced To solve systems of linear equations that involve several variables. row-echelon form (linear system) – a linear system in a stair-step form with the leading coefficients of each line being 1. Regiomontanus (1436-1476), a German trigonometrist and astronomer, also practiced astrology, observing the close ties between celestial objects and events previously linked to superstition. He predicted that sailors would someday use the moon for navigation, which did occur during the Age of Exploration after his death. He brought both plane and spherical trigonometry into the realm of pure mathematics and out of its apprenticeship to astronomy. Regiomontanus is actually his pen name. It is Latin for his hometown of Königsberg, which translates into English as "King's Mountain". Linear Equations → Linear Systems → Multivariable Linear Systems In the other thoughts in this section of the Algebra Brain, we deal strictly with only two equations involving at most two unknowns (x and y). The processes that we use to solve those equations can be extended to linear equations involving three or more variables. The Method of Elimination is extremely useful for solving multivariable linear systems. Ideally, we use transformations to turn one of the equations of the system into an equation involving a single variable, which we can easily solve for. Using that nugget of information, we proceed to back-substitute that value into one of the other equations so that we can solve for another variable. We continue this process of back-substitution until we have found solutions for each and every variable. The total solution to the system consists of all real numbers that satisfy all of the linear equations in the system. Use back-substitution to solve the following system of linear equations: 3x + 4y – 2z = 13 Equation 1 –x + 2z = –3 Equation 2 z = –1 Equation 3 solution Since we are given that z = –1, we can back-substitute this value into Equation 2: –x + 2z = –3 Equation 2 –x + 2(–1) = –3 substitute in z = –1 –x – 2 = –3 simplify –x = –1 add 2 to both sides x = 1 divide both sides by –1 Linear Equations Multivariable Linear Systems – 87 Now that we know x, we can substitute x = 1 and z = –1 to in Equation 1 and then solve the equation for y: 3x + 4y – 2z = 13 Equation 1 3(1) + 4y – 2(–1) = 13 substitute in x = 1 and z = –1 3 + 4y + 2 = 13 simplify 4y = 8 subtract 5 from both sides y = 2 divide both sides by 4 We now have values for x, y and z, so the solution is the ordered triple: (x, y, z) = (1, 2, –1) answer We leave it up to you to verify that this is the correct solution for this linear system. In the example above, we illustrated the power of back-substitution. Unfortunately, very few linear systems are so neat and tidy. The good news is that we can transform linear systems into a form that will allow us to use back-substitution to solve the system. Gaussian Elimination One method we can use to solve multivariable linear systems is called Gaussian elimination, named after Carl Friedrich Gauss (1777–1855). Two systems are said to be equivalent if the same solution satisfies both systems. In order to solve a system of linear equations, we convert the system into an equivalent system that is in row-echelon form. This refers to the fact that we end up with a "stair-step" pattern in our new linear system. The leading coefficient of each equation in the new system is 1. In order to produce equations that are equivalent to our original system of equations, we apply the following operations on our system: Operations that Produce Equivalent Systems Each of the following row operations on a system of linear equations produces an equivalent system of linear equations. 1.) Interchange two rows 2.) Multiply one of the equations by a non-zero constant. 3.) Add a multiple of one equation to another equation to replace the latter equations Let's see how these operations can help us solve a system of linear equations. Strap yourself in and feel the g's! Solve the system of linear equations and check the solution algebraically. 2x – y + 4z = –8 Equation 1 x + 2y – 3z = 11 Equation 2 4x + 2y + z = 6 Equation 3 solution Allll righty, then. Let's get started. The first thing we notice is that the leading coefficient of Equation 2 is 1. Remember that row-echelon form has the leading coefficient of each equation being 1. Therefore, we will move Equation 2 up to the top of the stack. That is, we will interchange Equation 1 with Equation 2: 88 – Multivariable Linear Systems Linear Equations x + 2y – 3z = 11 2x – y + 4z = –8 4x + 2y + interchange Equation 1 with Equation 2 z = 6 Our next steps will involve eliminating the x-variables in the second and third rows. From now on, we will refer to our operations using the notation Rn, where R represents that we are operating on a row of the linear system and n indicates which row is being manipulated. Our first step above, then, would be represented by "interchange R1 with R2". Now when we refer to Row 1, we will be referring to the current top row of the linear system. We write the operation we are performing next to the row that is being changed. Back to our problem. To eliminate the x-variable in Row 2, we subtract 2 times Row 1 from Row 2 (we write this as R2 – 2R1). Note that we multiply all the terms in Row 1 by the constant 2: x + 2y – 3z = 11 –5y + 10z = –30 4x + 2y + R2 – 2R1 (subtract 2 times Row 1 from Row 2) z = 6 To eliminate the x-variable in Row 3, we subtract 4 times Row 1 from Row 3 and place the result in Row 3: x + 2y – 3z = 11 – 5y + 10z = –30 – 6y + 13z = –38 R3 – 4R1 (subtract 4 times Row 1 from Row 3) At this point we notice that we have completely eliminated the variable x from both the second and third rows of the linear system. Now we can work on one of the other variables. Let's start getting rid of the y's in the third row. First, though, we need to make sure that the leading coefficient of the second row is 1. To do that, we multiply Row 2 by –1/5 (or divide Row 2 by –5, however you prefer to look at it). x + 2y – 3z = 11 y – (–1/5)R2 (multiply Row 2 by –1/5) 2z = 6 – 6y + 13z = –38 Next, we are going to add 6 times Row 2 to Row 3 to get rid of the variable y in Row 3: x + 2y – 3z = 11 y – 2z = 6 z = –2 R3 + 6R2 (add 6 times Row 2 to Row 3) Let's take a moment to see what we have done so far. Right now, the leading coefficient of each equation is 1. Also, the equations are arranged in a "stair step" pattern. Therefore, we conclude that we indeed have transformed our original linear system into row-echelon form. Furthermore, we can see that we have solved the system for one of the variables, namely z = –2. Thus, we can start back-substituting and work our way up through the equivalent linear system. We let z = –2 in Row 2 and solve that equation: y – 2z = 6 Row 2 y – 2(–2) = 6 substitute in z = –2 y+4 = 6 simplify y = 2 subtract 4 from both sides Linear Equations Multivariable Linear Systems – 89 We now know two of the three variables in our linear system: z = –2 and y = 2. We can back-substitute again by putting these values in for y and z into Row 1 and solving for x: x + 2y – 3z = 11 Row 1 (originally this was Equation 2) x + 2(2) – 3(–2) = 11 substitute in y = 2 and z = –2 x+4+6 = 11 simplify x + 10 = 11 simplify x = 1 subtract 10 from both sides We have just found the last piece of the puzzle. Our solution can be written as the ordered triple: (x, y, z) = (1, 2, –2) answer Of course, we will never be absolutely sure that this is the correct solution unless we can verify that it works for all three original equations: 2x – y + 4z = –8 Equation 1 2(1) – 2 + 4(–2) = –8 substitute in x = 1, y = 2, and z = –2 –8 = –8 simplify check x + 2y – 3z = 11 Equation 2 1 + 2(2) – 3(–2) = 11 substitute in x = 1, y = 2, and z = –2 11 = 11 simplify check 4x + 2y + z = 6 Equation 3 4(1) + 2(2) + (–2) = 6 substitute in x = 1, y = 2, and z = –2 6 = 6 simplify check Since the solution fits all three linear equations, we can conclude that this is indeed the correct solution. answer Note: It is actually possible to check our answer graphically if we move from a two-dimensional plane to three-dimensional space. The solution (1, 2, –2) is represented by the point of intersection of three lines in space. Although we have shown a system where we have a unique solution, not all linear systems are so fortunate as to have one. If, for some reason, we are clicking merrily along and come to a point where we end up with an equation that doesn't make sense. For instance: 0 = –2 inconsistency doesn't make sense because this statement is clearly FALSE. Therefore, any system that results in the statement just given must have no solution. On the other hand, if a linear system does not have one solution or no solutions, then it must, by definition, have infinitely many solutions. If we have a system of linear equations where one equation is just a constant multiple of another equation (exactly a constant multiple), then we have a system where we have fewer equations than we have unknowns. In that case, we have infinitely many solutions. 90 – Multivariable Linear Systems Linear Equations In general, a linear system has exactly ONE solution if and only if we have the same number of equations as we have variables. This kind of linear system is called square. One final thought: As we are sure you realize, systems of linear equations are very tricky beasts to handle. One of the most effective methods for solving systems of linear equations involves Matrices (described elsewhere in the Algebra Brain). When we plug the coefficients of a linear system into a matrix, it becomes much, much easier to find the solutions performing exactly the same sort of operations as we perform above. Linear Equations Multivariable Linear Systems – 91 92 – Multivariable Linear Systems Linear Equations Advanced To solve systems of linear inequalities. No definitions on this page. In 1474, Regiomontanus (1436-1476) printed his Ephermides, an almanac of daily planetary rotations and eclipses for the years 1475-1506. Christopher Columbus used this work as a reference to impress the Jamaican natives with awe-inspiring magical powers. One of the celestial objects Regiomontanus researched was the comet of 1472, which would be renamed Halley's Comet 210 years later upon its third return. Linear Equations → Linear Systems → Systems of Inequalities In the rest of the Algebra Brain, when we refer to inequalities, we usually refer to inequalities involving a single variable. Now we are going to expand our discussion to inequalities involving two variables. Furthermore, we are going to talk about systems of inequalities. This is very similar to systems of equations. Consider the system of inequalities: –x + y ≤ 1 Inequality 1 2x + 3y > 13 Inequality 2 If we were talking about equations, then the solution to this system would be all values (a, b) such that when a and b are substituted into each part of the system, the entire system is satisfied. However, these are inequalities. This means that the solution set of the system will be a range of values that have to satisfy both of the inequalities. To sketch the graph of a system of inequalities, we first sketch the graph of the corresponding equations. That is, we replace the inequality symbol with "=" and draw those equations. If we replace "<" or ">" with "=", then we draw a dotted line for the equation. The solution set then consists of all points that fall inside a region in the plane. For instance, in the example above, we would want all points that are below the line x + y = 5 and above the line 2x + 3y = 10, as shown in the shaded region on the graph below: Linear Equations Systems of Inequalities – 93 Let's test the two points given in the graph above and see what happens when we plug them into the system of inequalities. First, we will use the point (–2, 3): –x + y ≤ 1 Inequality 1 –(–2) + 3 ≤ 1 substitute in x = –2 and y = 3 2+3 ≤ 1 simplify 5 ≤ 1 FALSE 2x + 3y > 13 Inequality 2 2(–2) + 3(3) > 13 substitute in x = –2 and y = 3 –4 + 9 > 13 simplify 5 > 13 FALSE Both statements are false, so we conclude that the point (–2, 3) cannot be a solution to the system of equation. In order for a given point to be a solution, both statements have to be TRUE. Let's test the other point, (4, 3): –x + y ≤ 1 Inequality 1 –4 + 3 ≤ 1 substitute in x = 4 and y = 3 –1 ≤ 1 TRUE 2x + 3y > 13 Inequality 2 2(4) + 3(3) > 13 substitute in x = 4 and y = 3 8+9 > 13 simplify 17 > 13 TRUE Both statements are true, so we conclude that all points inside this region are indeed solutions to the system of inequalities. Note: In this system, the point of intersection of the two lines (2, 3) is not a solution to the system of inequalities because we cannot use points that are on the line 2x + 3y = 13. The lines given by the system of inequalities divide the plane into several regions. Inside each region of the plane, exactly one of the following must be true: 1.) All points in the region are solutions of the inequality. 2.) No point in the region is a solution of the inequality. Therefore, we can test each region of the plane by simply trying a point inside that region. If a single point in a region satisfies the inequality, then all points in that region must also satisfy the inequality. Sketching the Graph of an Inequality in Two Variables 1.) Replace the inequality sign by an equal sign, and sketch the graph of the resulting equation. Use a dashed line for < or > and a solid line for ≤ or ≥. 2.) Test one point in each of the regions formed by the graph in Step 1. If the point satisfies the inequality, shade the entire region to denote that every point in the region satisfies the inequality. 94 – Systems of Inequalities Linear Equations Sketch the graph of the solution to the system of inequalities. –x + 3y ≤ 12 Inequality 1 2x + y > 7 Inequality 2 solution First, we will replace the inequality symbol in each inequality by "=" so that we can graph the corresponding lines: –x + 3y ≤ 12 Inequality 1 3y ≤ x + 12 add x to both sides y ≤ y = 2x + y > 7 Inequality 2 y > –2x + 7 subtract 2x from both sides y = –2x + 7 replace > with = to find the corresponding equation x 3 x 3 +4 divide both sides by 3 +4 replace with = to find the corresponding equation We graph the corresponding equations as shown below. However, we use a dashed line to graph the second corresponding equation because we replaced > with =. This means that points that fall on the line y = –2x + 7 are not part of the solution. Once we have graphed both lines, we shade the points in the plane that satisfy both inequalities. That is, we want points that satisfy y ≤ x/3 + 4 and points that satisfy y > –2x + 7. The shaded region shown below gives us these points. answer Linear Equations Systems of Inequalities – 95 96 – Systems of Inequalities Linear Equations Unit 3 Functions Functions – 97 98 – Functions Advanced To learn the basic definition of a function. function – an association of exactly one object from one set (the range) with each object from another set (the domain). A relationship in which different ordered pairs have different first coordinates. domain (of a function) – the set of values which are allowable substitutions for the independent variables. range – the set of values of the function evaluated at points contained within the domain. The function notation f(x) was first introduced by Leonhard Euler in 1734 in Commentarii Academiae Scientiarum Petropolitanae. Other mathematicians later adapted the same sort of notation to represent other commonly occurring types of functions in mathematics, such as gamma functions, beta functions, and Riemann Zeta functions. Functions What is a function? According to the mathematical definition of a function: Definition of Function A function f from a set A to a set B is a rule of correspondence that assigns to each element x in the set A exactly one element y in the set B. The set A is the domain (or set of inputs) of the function f, and the set B contains the range (or set of outputs) of the function f. What does this mean, exactly? A function establishes a relationship. That means that we want a connection between two numbers. One number is going to be dependent on the other number. An element of the domain is paired with an element of the range. This is the connection that we want to establish. But what makes up the domain and what makes up the range? The domain determines what numbers will be in the range. Furthermore, each element inside the domain will produce one and only one element inside the range. The typical example is with two groups of numbers, one designated as the domain and the other designated as the range: function Functions Introduction to Functions – 99 Note that we have "mapped" two elements of the domain onto the same element on the range. This is perfectly legal. However, it is not permitted to map the same element of the domain onto two elements of the range and still call it a function. In graphic terms: not a function Let's look at how this works from a more practical standpoint. Imagine that One-Eyed Squirrel Industries has a stock that sells for $34 a share in March. In April, the stock has raised to $52 a share. In May, the stock market takes a major hit and One-Eyed Squirrel Industries stock plummets to a mere $6 a share. However, in June, it picks itself back up again and climbs up to its original value of $34 a share. Now let's put this scenario in terms of domains and ranges. In this case, the amount the stock is worth is dependent on the month of the year. So that means that the month is the domain and the worth of the stock is the range: We can plot this information on a graph: One-Eyed Squirrel Industry Stock Prices 60 52 50 price 40 30 34 34 Series1 20 10 0 March 6 April May June month If we were just given the graph and not the domain and range that we plotted above, how could we tell if this graph was a function or not? There is a really simple way to tell. If we can draw a vertical line through any 100 – Introduction to Functions Functions point of the graph and it only intersects our graph at exactly one point, then this is a function. This is known as the Vertical Line Test: Vertical Line Test If a vertical line intersects a graph at more than one point, then the graph is not the graph of a function. For most practical applications, a function takes in a number, manipulates it, and spits another number back out the other end. For example, if we plug the numbers 2, 3, 4, and 5 into the function f(x) = 3x + 2, we will get the numbers 8, 11, 14, 17 back out: f(x) = 3x + 2 example of a function f(2) = = 3(2) + 2 8 substitute in 2 for x simplify f(3) = = 3(3) + 2 11 substitute in 3 for x simplify f(4) = = 3(4) + 2 14 substitute in 4 for x simplify f(5) = = 3(5) + 2 17 substitute in 5 for x simplify We could easily plug in any number for x, and get a different number as a result. Functions Introduction to Functions – 101 102 – Introduction to Functions Functions Advanced To review the basic terminology associated with functions. dependent variable – a variable whose value(s) always depend on the values of other variable(s). independent variable – in a formula, a variable upon whose value other variables depend. Leonhard Euler (1707-1783) was a Swiss geometer and number theorist. Among his countless contributions to mathematics is the notation f(x) for a function of x. At the age of 19, he entered a contest involving the best place on a ship to set the masts. He only won honorable mention for his efforts, not bad considering he had never seen a ship before. The greatest accomplishments of his career occurred during the last quarter of his life, when he was totally blind. Euler's final achievement was the calculation of the orbit of the newly discovered planet Uranus. Functions → Function Terminology There is a lot of terminology associated with functions, so we have come up with a handy reference list of the common terms associated with functions. Function Terminology Function: A function is a relationship between two variables such that to each value of the independent variable there corresponds exactly one value of the dependent variable. Another way to put it is: Given two sets, M and N, let f denote a rule that assigns each element in M one and only one element in N. Function Notation: y = f(x), read as "y equals f of x". f is the name of the function y is the dependent variable x is the independent variable f(x) is the value of the function at x Domain: The domain of a function is the set of all values (inputs) of the independent variable for which the function is defined. If x is in the domain of f, we say f is defined at x. If x is not in the domain of f, we say that f is undefined at x. Range: The range of a function is the set of all values (outputs) assumed by the dependent variable (that is, the set of all function values). Implied Domain: If f is defined by an algebraic expression and domain is not specified, the implied domain consists of all real numbers for which the expression is defined. Functions Function Terminology – 103 104 – Function Terminology Functions Advanced To review the definition of polynomial functions and why they are useful. constant function – a function in which f(x) is simply equal to a real (or complex) number. linear function – a polynomial function of the first degree. quadratic function – a polynomial function of the second degree. Niels Henrik Abel (1802-1829), a Norwegian algebraist, proved that fifth and higher order equations have no algebraic solution. He also begun a general proof of the binomial theorem, which at the time had only been proven for specific cases. He died on April 6, 1829, from tuberculosis. Functions → Polynomial Functions Simply put, a polynomial function is any function that is composed of the sum or difference of monomials all set equal to some value determined by the value of x in the polynomial. A formal definition is as follows: Definition of a Polynomial Function Let n be a nonnegative integer and let an, an-1, . . . , a2, a1, a0 be real numbers with an ≠ 0. The function given by: f(x) anxn + an-1xn-1 + . . . + a2x2 + a1x + a0 = polynomial function of degree n is called a polynomial function of x with degree n. Polynomial functions are classed by degree. For instance, the polynomial given by: f(x) = a a≠0 is called a constant function and has degree 0, because n = 0. The graph of this type of function is a horizontal line. The polynomial f(x) = ax + b a≠0 is called a linear function and has degree 1. The graph of any function of this sort is a line whose slope is given by a and whose y-intercept is given by (0, b). Polynomial functions of degree 2 are also called quadratic functions. They take the form: f(x) = ax2 + bx + c a≠0 The word "quadratic" comes from the Latin word for "square", as in "x-squared". Quadratic functions are so important there is an entire section of the Algebra Brain devoted to nothing but quadratics. Polynomial functions of higher degree do not really have special names, although we sometimes refer to thirddegree polynomials (f(x) = ax3 + bx2 + cx + d) as "cubic" functions. Functions Introduction to Polynomial Functions – 105 In general, we use polynomial functions to model different relationships. For example, we might use a quadratic equation to model how much a company spends on advertising versus how much it makes in profit. We might want to model cigarette consumption in the United States to see if putting a warning label on cigarettes is actually doing any good. If we are into marketing, we might want to model how many people bought our product over a course of time. The advantage of modeling data is that we can use the model to make a prediction of the future. Modeling is used extensively in manufacturing to test products before they are even built. It is far cheaper in the long run to test a rocket's trajectory on a computer than it is to actually build one and see if it goes where we want it to go. This is where polynomial functions come in. 106 – Introduction to Polynomial Functions Functions Advanced To learn how to determine just how many solutions a given polynomial has. multiplicity of a root r – for a root r of a polynomial equation P(x) = 0, the highest power of x – r that appears as a factor of P(x). repeated roots (or zeros) – a solution to a polynomial equation that occurs more than once. The Indian astronomer Ǻryabhata had the idea in 510 C.E. to use the thirty-three letters of the Indian alphabet to represent all the numbers between 1 and 1018. This sort of notation is what is known as a 'numeral alphabet'. Functions → Polynomial Functions → Fundamental Theorem - Algebra One of the most elementary operations performed in algebra is finding solutions to polynomial equations. Most of the time, it is convenient if we can tell at a glance how many solutions we will find when we actually try and solve it. The brilliant mathematician Karl Gauss (1777 – 1855) demonstrated a theorem that greatly simplifies the process of finding zeros or roots. We call his result the Fundamental Theorem of Algebra. We present it without proof because in order to prove it, we would need to resort to complex variables, which is far beyond the scope of this current level of instruction. The Fundamental Theorem of Algebra If f(x) is a polynomial of degree n, where n > 0, then f(x) has at least one zero in the complex number system. As a consequence of the Fundamental Theorem of Algebra, any polynomial of degree n > 0 can be written as the product of n factors. This property is called the Linear Factorization Theorem and is written formally as: Linear Factorization Theorem If f(x) is a polynomial of degree n, where n > 0, f(x) = anxn + an – 1xn – 1 + . . . + a1x + a0 then f(x) has precisely n linear factors f(x) = an(x – c1)(x – c2) . . . (x – cn) where c1, c2, . . . , cn are complex numbers and an is the leading coefficient of f(x). Each of the linear factors of f(x) corresponds to each zero of f(x). This theorem can be proved using the Fundamental Theorem of Algebra to conclude that f must have at least one zero, c1. Therefore, (x – c1) is a factor of f(x) and we can rewrite f(x) as f(x) = (x – c1)f1(x) where f1(x) is formed from the remaining factors of f(x) If the degree of f1(x) is greater than 0, then we can apply the Fundamental Theorem of Algebra again to conclude that f1(x) must have a zero, which we will call c2. Thus, we can rewrite f(x) as: f(x) Functions = (x – c1)(x – c2)f2(x) Fundamental Theorem of Algebra – 107 By this time, we should start to see a pattern emerging. The degree of f1(x) is n – 1, where n is the degree of our original polynomial f(x). Then for each successive function, we subtract 1 from the degree of the preceding function. Thus the degree of f2(x) is n – 2, the degree of f3(x) is n – 3 and so on. By repeatedly applying the Fundamental Theorem of Algebra n times, we obtain: f(x) = an(x – c1)(x – c2) . . . (x – cn) where an is the coefficient of the polynomial f(x) Note: Neither of these two theorems tells us how to find the solutions of a polynomial equation nor do they tell us how many solutions are real and how many are complex. Instead they simply tell us that such solutions exist. Thus, if we are solving a cubic equation, then we better end up with three solutions (real, complex, or some combination of real and complex), or else! 1. f(x) = x – 4 has exactly 1 zero located at x = 4. 2. f(x) = x2 – 8x + 16 = (x – 4)(x – 4) has exactly 2 zeros, both located at x = 4. These are known as repeated zeros (or double zeros if only two are the same). Example 2 illustrates the multiplicity of a root r. In this case, we say that 4 is a root with multiplicity 2 (because it occurs twice). In general, the multiplicity of a root r is the highest power of (x – r) that appears as a factor of the polynomial. 3. f(x) = x3 + 9x = x(x2 + 9) = x(x + 3 i)(x – 3 i) has exactly 3 zeros located at x = 0, x = –3 i and x = 3 i. Note: 2 of these zeros are complex. 4. f(x) = x4 – 1 = (x2 – 1)(x2 + 1) = (x – 1)(x + 1)(x – i)(x + i) has exactly 4 zeros located at x = 1, x = –1, x = i and x = –i. Note: 2 of these zeros are complex. The French mathematician Jean Le Rond d'Alembert (1717 – 1783) worked independently of Karl Gauss to prove the Fundamental Theorem of Algebra. As a result of his efforts, in France, the Fundamental Theorem of Algebra is often called the Theory of d'Alembert (not to be confused with any other theories of d'Alembert that you may have heard of). 108 – Fundamental Theorem of Algebra Functions We deal with four fundamental operations in algebra: 1. 2. 3. 4. Addition Subtraction Multiplication Division We can apply these same operations to any combination of polynomial expressions. In the thoughts below, we will discuss how to add and subtract two polynomials, how to multiply two polynomials together, and two distinct methods of dividing polynomials. In general, adding or subtracting polynomials is very similar to adding and subtracting real numbers. Multiplying two polynomials requires one to fully understand how the Distributive Property works. Dividing two polynomials is either a matter of long division, as we do with real numbers, or synthetic division for special cases of polynomial division. Synthetic division is a handy short cut that eliminates all the messiness associated with long division. Functions Polynomial Operations – 109 110 – Polynomial Operations Functions Advanced To simplify, add and subtract polynomials. like terms – terms that have exactly the same variables to the same powers. Isaac Newton (1643-1727) is one of the most famous mathematicians of all time. He invented the calculus, which relies on comparing the rates of change between two or more quantities. With calculus, we can analyze the physics of moving objects using pure mathematics. Newton also did important work in astronomy and optics. His telescope so impressed the Royal Society in London that they invited him to join. After tiring of the academic world, Isaac Newton became Master of the Royal Mint in 1700. He became wealthy from this position because he received a commission on the coins minted. Functions → Polynomial Functions → Polynomial Operations → Adding / Subtracting Polynomials Just as we can add, subtract, multiply and divide real numbers, we can perform the same operations on polynomial expressions. In order to add two polynomial expressions, we have to see if there are any like terms , or terms having the same variables to the same powers. For example, 3x2 and 2x2 are like terms, but 3x2 and 2x3 are not, because one term has x risen to the second power and the other term has x risen to the third power. Once we see that we have like terms in our expression then we can simply add the coefficients of like terms together. Thus, if we have 3x2 and 2x2, which we have already stated as being like terms, then their sum is: 3x2 + 2x2 = (3 + 2)x2 add coefficients of like terms = 5x2 simplify = (3 – 2)x2 subtract coefficients of like terms = 1x2 simplify And their difference is: 3x2 – 2x2 Simplify the following expression (5x2 + 2x) – (x2 + 3x – 6) solution First, remove the parentheses, distributing the negative sign in front of the second set of parentheses. This makes it easier to rearrange the terms so all the like terms are together. Then add the coefficients of like terms. Functions Adding / Subtracting Polynomials – 111 (5x2 + 2x) – (x2 + 3x – 6) 5x2 + 2x – x2 – 3x + 6 remove parentheses (easier to simplify) 5x2 – x2 + 2x – 3x + 6 group like terms (5 – 1)x2 + (2 – 3)x + 6 4x2 – x + 6 Note: given expression add coefficients of like terms to simplify answer When simplifying polynomial expressions, it is generally convenient to list the x-terms in order of decreasing exponents. Thus when grouping the terms together to simplify, place all the terms of the highest-power of x first, then group the terms of the next highest power of x and so on down to the lowest power of x, x0 = 1. A common mistake when distributing the negative sign inside the parentheses is to fail to change the sign of each term inside the parentheses. For instance, –(x2 + 3x – 6 ) = –x2 – 3x + 6 –(x2 + 3x – 6) ≠ –x2 + 3x – 6 112 – Adding / Subtracting Polynomials and Functions Advanced To multiply two or more polynomials together. No definitions on this page. Claudius Ptolemy (c.70-c.130 A.D.) was a famous Greek astronomer, geometer, and geographer. Ptolemy's great work, the Almagest, provides a comprehensive overview of Greek geometry and trigonometry. Ptolemy's geocentric (Earth-centered) view of the universe was so compelling that it remained in force for over 1400 years, until Copernicus set forth his heliocentric (sun-centered) view of the solar system. Ptolemy the geographer attempted to map the Known World at that time, but since his knowledge of geography was effectively limited to the Roman Empire, his maps were very inaccurate. He did, however, introduce the first systematic use of longitudes and latitudes. His directions for creating maps were still being used by cartographers as late as the Renaissance. Functions → Polynomial Functions → Polynomial Operations → Multiplying Polynomials The Distributive Property is used to multiply two polynomials together. We treat one of the polynomials as a single quantity and multiply that single quantity by both terms in the second polynomial. Essentially, we apply the Distributive Property twice. For example, if we want to multiply (3x + 7) by (4x – 1), then we have the following: (3x + 7)(4x – 1) = 3x(4x – 1) + 7(4x – 1) let (4x – 1) be a single quantity and apply the Distributive Property = (3x)(4x) – (3x)(1) + (7)(4x) – (7)(1) apply the Distributive Property again = 12x2 – 3x + 28x – 7 multiply terms = 12x2 + 25x – 7 add like terms to simplify If we take a closer look at what we did, we will see a pattern emerge. We multiplied the First terms of the polynomials together, followed by the Outside terms of the polynomials, followed by the Inside terms, and followed finally by the Last terms. We call this the FOIL method of multiplying two binomials together. Note that the outer (O) and inner (I) terms are like terms and can thus be combined into one term. FOIL Method First terms: (3x)(4x) Outside terms: (3x)(–1) Inside terms: (7)(4x) Last terms: (7)(–1) This method works great for simple binomials, but what happens when we try to multiply two trinomials together? It turns out that we perform a similar series of operations. In effect, we need to multiply each term of the first trinomial by each term of the second trinomial. Functions Multiplying Polynomials – 113 Extended Distributive Property To multiply two polynomials multiply each term in the first polynomial by each term in the second polynomial. Multiply (x2 – 4x + 6) by (5x2 + 2x – 3). solution (x2 – 4x + 6)(5x2 + 2x – 3) Note: = x2(5x2 + 2x – 3) – 4x(5x2 + 2x – 3) + 6(5x2 + 2x – 3) multiply each term of the first trinomial by the second trinomial = (5x4 + 2x3 – 3x2) – (20x3 + 8x2 – 12x) + (30x2 + 12x – 18) apply Distributive Property = 5x4 + 2x3 – 3x2 – 20x3 – 8x2 + 12x + 30x2 + 12x – 18 remove parentheses = 5x4 – 18x3 + 19x2 + 24x – 18 combine like terms together answer When multiplying out polynomials, it is a good idea to not skip any steps. By taking care to multiply each and every term together, it is far less likely that you will make any sign errors or other mistakes. We can easily (so to speak) extend the multiplication of trinomials to polynomials with even more terms. The trick is to be careful and take your time to avoid any inadvertent errors. The most common errors in multiplying any two polynomials are sign errors. In general, if one polynomial has m terms and the other polynomial has n terms, then the total number of terms obtained by multiplying the two polynomials together will be m · n terms. If we multiplied this result by a polynomial with q terms, then we would obtain a polynomial that contains m · n · q terms. 114 – Multiplying Polynomials Functions Advanced To recognize patterns when multiplying polynomials. No definitions on this page. Christiaan Huygens (1629–1695) was a Dutch astronomer, mathematician, and physicist. He was a Cartesian, which meant he followed the ideas of René Descartes rather than the ideas of Isaac Newton. He is generally known as for two important contributions to science. The first is a pendulum clock inspired by the work of Galileo. The principles behind his clock are still used today in grandfather and cuckoo clocks. Huygens is also famous for his wave theory of light, which went against Newton's theories of optics. As it turns out, light is both a wave and a particle, but neither Huygens nor Newton was aware of this fact at the time. Finally, Huygens discovered the rings of Saturn, disproving Galileo's idea that Saturn was actually three planets. He was the first to notice markings on the surface of Mars and discovered Saturn's largest moon, Titan. Functions → Polynomial Functions → Special Product Patterns Oftentimes in algebra, we have to multiply two (or more) binomials together. Obviously, we can always resort to the FOIL method for multiplication, but that can become tedious very quickly. In addition, it is easy to make mistakes, particularly sign errors. However, if we can instead recognize patterns amongst binomials, then we can simply apply one of the following formulas. This requires you to actually memorize the formulas. Sum and Difference (u + v)(u – v) = u2 – v2 = x2 + 4x – 4x – 16 FOIL method = x2 – 16 simplify answer Multiply. (x – 4)(x + 4) solution (x – 4)(x + 4) Square of a Binomial Functions (u+ v)2 = u2 + 2uv + v2 (u – v)2 = u2 – 2uv + v2 Special Product Patterns – 115 Multiply. (5x + 7)2 solution (5x + 7)2 = 25x2 + 2(5x)(7) + 49 apply Square of Binomial formula = 25x2 + 70x + 49 simplify answer Cube of a Binomial (u + v)3 = u3 + 3u2v + 3uv2 + v3 (u – v)3 = u3 – 3u2v + 3uv2 – v3 Multiply. (2x – 4)3 solution (2x – 4)3 = (2x)3 – 3(2x)2(4) + 3(2x)(4)2 – (4)3 apply Cube of Binomial formula = 8x3 – 12(4x2) + (6x)(16) – 64 simplify = 8x3 – 48x2 + 96x – 64 simplify answer 116 – Special Product Patterns Functions Advanced To recognize patterns when factoring polynomials. No definitions on this page. Friedrich Wilhelm Bessel (1784-1846) was a self-taught mathematical and astronomical genius. He was the first to accurately calculate the distances between stars. He is also the first to use the term 'light year', which refers to the distance light travels in a year, roughly 6 trillion miles (give or take a few million miles). In 1840, he discovered odd movements in Uranus's orbit. He theorized that there was another celestial body exerting gravitational influence on Uranus. Unfortunately for him, Neptune, the cause of the disturbances, was not discovered until after Bessel had already died. Functions → Polynomial Functions → Special Factoring Patterns There are a number of useful formulas to help in remembering how to factor. These formulas make it easy to factor a polynomial if we can recognize its pattern. Each is also fairly straightforward to prove and follows from the Distributive Property of Multiplication. Perfect Square Trinomial u2 + 2uv + v2 = (u + v)2 u2 – 2uv + v2 = (u – v)2 Proof: (u + v)2 (u – v)2 = (u + v)(u + v) rewrite = u2 + uv + uv + v2 use FOIL method = u2 + 2uv + v2 simplify = (u – v)(u – v) rewrite = u2 – uv – uv + v2 use FOIL method = u2 – 2uv + v2 simplify proof Factor. 25 – 10x + x2 solution 25 – 10x + x2 Functions = 52 – 2(5)(x) + x2 rewrite = (5 – x)2 answer Special Factoring Patterns – 117 Difference of Two Squares u2 – v2 = (u + v)(u – v) Proof: (u + v)(u – v) = u2 – vu + vu – v2 use FOIL method = u2 – v2 simplify proof Factor. 16 – 9x2 solution 16 – 9x2 = (4)2 – (3x)2 rewrite = (4 – 3x)(4 + 3x) answer Sum of Two Cubes u3 + v3 (u + v)(u2 – uv + v2) = Proof: (u + v)(u2 – uv + v2) = u3 – u2v + uv2 + u2v – uv2 + v3 multiply out all terms = u3 + v3 simplify proof Factor. 8x3 + 27 solution 8x3 + 27 = (2x)3 + (3)3 rewrite = (2x + 3)((2x)2 – (2x)(3) + 32) apply Sum of Two Cubes formula = (2x + 3)(4x2 – 6x + 9) simplify answer 118 – Special Factoring Patterns Functions Difference of Two Cubes u3 – v3 = (u – v)(u2 + uv + v2) Proof: (u – v)(u2 + uv + v2) = u3 + u2v + uv2 – u2v – uv2 – v3 multiply out all terms = u3 – v3 simplify proof Factor. 64 – 27x3 solution (64 – 27x3) = (4)3 – (3x)3 rewrite = (4 – 3x)(42 + (4)(3x) + (3x)2) apply Difference of Two Cubes formula = (4 – 3x)(16 + 12x + 9x2) simplify answer Factoring by Grouping au3 + acu2 + bu + bc = au2(u + c) + b(u + c) = (au2 + b)(u + c) where a and c are constants Proof: (au2 + b)(u + c) = au3 + acu2 + bu + bc use FOIL method proof Factor. 5x3 – 10x2 + 3x – 6 solution 5x3 – 10x2 + 3x – 6 Functions = 5x3 – (2)(5)x2 + 3x – (2)(3) factor coefficients = 5x2(x + (–2)) + 3(x + (–2)) a = 5, b = 3, c = –2 = (5x2 + 3)(x – 2) Distributive Property in reverse answer Special Factoring Patterns – 119 120 – Special Factoring Patterns Functions Advanced 1.) Divide a polynomial by a monomial or binomial factor. 2.) Use polynomial long division. No definitions on this page. Multiplication and division in Ancient Egypt only required one to memorize one\'s two-times tables. They used an ingenious method of doubling and addition to multiply and divide numbers. Functions → Polynomial Functions → Polynomial Operations → Polynomial Division Dividing numbers is a tricky business, and dividing polynomials can be even more so. When we divide two numbers, either one of two things can happen: the numbers divide cleanly, or we obtain a remainder. Consider the example of 9 divided by 3. Everybody knows that the answer is 3, so that tells us that (3)(3) = 9. Now what if we divided 9 by 4? Then we know that 4 goes into 9 evenly 2 times, 2 x 4 only equals 8, so we end up with a remainder of 1. That is, 9 divided by 4 equals 2 R 1. This tells us that 4 and 2 are not factors of 9. In more familiar terms, we can write this as: 2 2 9 –8 1 R 1 We can attribute the same behavior to division of polynomials. If we divide the polynomial f(x) by another polynomial d(x) and obtain a remainder of zero, then we can conclude that the divisor d(x) and the quotient q(x) are both factors of the original polynomial f(x). In the same vein, if f(x) does not divide by d(x) evenly, then we will be left with a remainder function r(x), which, when added to the product of the divisor d(x) and the quotient q(x), will total our original function f(x). Using the same familiar model above, we can write this as: d(x) q(x) f(x) R r(x) This leads us to the following mathematical process of division formally known as the Division Algorithm and is valid for ANY function f(x) (including when f(x) is simply a real number such as the aforementioned 9). Functions Polynomial Division – 121 The Division Algorithm If f(x) and d(x) are polynomials such that d(x) ≠ 0 and the degree of d(x) is less than or equal to the degree of f(x), there exist unique polynomials q(x) and r(x) such that f(x) = d(x)q(x) + r(x) f(x): d(x): q(x): r(x): dividend divisor quotient remainder where r(x) = 0 or the degree of r(x) is less than the degree of d(x). If the remainder r(x) is zero, d(x) divides evenly into f(x) so d(x) and q(x) are both factors of f(x) in that instance. As usual when presenting an algorithm or formula, it is often helpful to see the theorem in action, so let's do a quick example of how this might work in practice: Divide x3 – 1 by x – 1. solution Because there is no x2-term or x-term in the dividend, you need to line up the subtraction by using zero coefficients (or leaving spaces) for the missing terms. divisor x2 x – 1 x + 0x2 – (x3 – x2) x2 – (x2 3 + x + 1 quotient + 0x – 1 dividend subtract x3 – x2 from dividend + 0x bring down the 0x term from the divisor – x) subtract x2 – x x – 1 bring down 1 from dividend – (x – 1) subtract x – 1 0 remainder Thus, x – 1 divides evenly into x3 – 1 and you can write 3 x −1 x−1 = x2 + x + 1 x≠1 answer If we want to check our answer, then we multiply x2 + x + 1 by x – 1 and see if we obtain x3 – 1: (x2 + x + 1)(x – 1) Note: = x3 + x2 + x – x2 – x – 1 multiply together using Distributive Properties = x3 + (x2 – x2) + (x – x ) – 1 regroup like terms = x3 – 1 cancel like terms check In this example, we basically do the same thing that we do anytime we divide two numbers. We first note that the term (x – 1) goes into the term (x3) x2 times, just as 2 goes into 9 four times. Multiplying x2 by (x – 1) gives us x3 – x2, which we subtract from f(x) = x3 + 0x2 + 0x – 1. We continue this process until there we either end up with 0 or a factor which x – 1 is unable to go into evenly. 122 – Polynomial Division Functions Advanced To simplify rational expressions using a special form of polynomial division. No definitions on this page. Thales of Miletus (c.640-546 B.C.) was the first Greek mathematician and philosopher who introduced the basic concepts of geometry to the Greeks. He founded his school in Miletus, teaching mathematics and astronomy. In his younger days, he used his almost mystical ability to foresee the future to corner the olive oil market. Thales foresaw a good season for olives one year and bought every local olive press he could find. With the wealth he gained from this venture, he was able to finance his school. Thales was so honored and revered for his wisdom and knowledge that he became known as one of the Seven Wise Men of Greece. Functions → Polynomial Functions → Polynomial Operations → Polynomial Division → Synthetic Division Dividing two polynomials using long division can be tedious. It is also easy to make mistakes. Therefore, we usually like to try and find a shortcut method for dividing polynomials. One such shortcut can be used when the divisor (the denominator) takes the form x – k. We call this shortcut synthetic division. This is not so much a formula as a technique for solving certain types of problems. This is an important distinction. You will need to be able to apply the technique, not just plug the problem into a specific formula. We give the details of synthetic division for a cubic polynomial. To apply this to higher-degree polynomials, we simply continue the pattern established below. Synthetic Division for a Cubic Polynomial To divide ax3 + bx2 + cx + d by x – k, use the following pattern: Vertical pattern: Add terms Diagonal pattern: Multiply by k The resulting expression gives us: = Functions Synthetic Division – 123 Notes: Synthetic division only works for divisors of the form x – k. [remember that x + k = x – (–k)]. You cannot use synthetic division to divide a polynomial by a quadratic such as x2 + 5. Synthetic division for higher degree polynomials follows a similar pattern as the one given above. If we have any missing terms, then we use a zero for a placeholder just as we do for regular long division of polynomials. Let's look at an example: Use synthetic division to divide 3x3 – 17x2 + 15x – 25 by x – 5. solution Set up an array like the one given below. Apply the pattern for synthetic division given above. Now we can rewrite our quotient (and remainder) in standard form: 3 2 3x − 17x + 15x − 25 = x−5 3x2 – 2x + 5 answer Note that in this example, we have a remainder of 0. To double-check our answer, we can multiply our quotient 3x2 – 2x + 5 by the divisor x – 5 and see if we obtain the dividend: (3x2 – 2x + 5)(x – 5) 124 – Synthetic Division = 3x3 – 15x2 – 2x2 + 10x + 5x – 25 multiply two polynomials using Distributive Property = 3x3 – 17x2 + 15x – 25 add like terms check Functions Advanced To find solutions of a polynomial equation using the Factor Theorem. No definitions on this page. Marin Mersenne (1588-1648) was a French number theorist and good friend of René Descartes. His chief role in mathematics was not developing his own theories, but in spreading the information developed by other mathematicians. It was through Mersenne that both Descartes and Galileo achieved recognition throughout Europe. He did do his own work and attempted to find a rule which would allow him to calculate large prime numbers. Unfortunately his ideas were flawed, but his work stimulated other mathematicians to tackle the problem of finding large prime numbers, which today are called "Mersenne Numbers". Functions → Polynomial Functions → Polynomial Operations → Polynomial Division → Synthetic Division → Factor Theorem The Remainder Theorem tells us that when we divide a polynomial f(x) by a divisor x – k, then the remainder is equal to that function evaluated at x = k. But what happens when the remainder is 0? As it turns out, a remainder of 0 indicates that the divisor x – k is a factor of the polynomial f(x). This result is known as the Factor Theorem: The Factor Theorem A polynomial f(x) has a factor (x – k) if and only if f(k) = 0. This is a simple statement that has a few surprising ramifications, so let's take a moment to see just why the Factor Theorem is true. We can use the Division Algorithm with the factor (x – k) to write down f(x), which is a polynomial of degree n, as: f(x) = (x – k)q(x) + r(x) Division Algorithm By the Remainder Theorem, we know that r(x) = r = f(k), which means that we can now write f(x) as: f(x) = (x – k)q(x) + f(k) Remainder Theorem where q(x) is a polynomial of degree n – 1. If f(k) = 0, then we have: f(x) = (x – k)q(x) From this we can tell that (x – k) is a factor of f(x). Conversely, if (x – k) is a factor of f(x), division of f(x) by (x – k) yields us a remainder of 0. Therefore, by the Remainder Theorem, we have f(k) = 0. Now that we have demonstrated the principle, it only makes sense to look at it from an applied standpoint. Functions Factor Theorem – 125 Use synthetic division to show that x is a solution of the following third-degree polynomial equation. Use the result to factor the polynomial completely. List all real zeros of the function. Polynomial Equation x3 – 7x + 6 = 0 Value of x x = 2 solution First, we divide the polynomial x3 – 7x + 6 by x – 2, using synthetic division. 2 1 1 0 –7 6 2 4 –6 2 –3 0 Since we end up with a remainder of 0, we can apply the Factor Theorem to conclude that (x – 2) is a factor of the polynomial f(x) = x3 – 7x + 6. From the Division Algorithm, we can write f(x) as: x3 – 7x + 6 = (x – 2)(x2 + 2x – 3) + 0 Division Algorithm Now, if we substitute in x = 2, then we end up with one factor becoming zero, which subsequently turns the whole expression to 0, and therefore we can say that x = 2 is indeed a solution to the polynomial equation: x3 – 7x + 6 = 0 We are also asked to factor the polynomial completely. We can apply standard factoring techniques to the quadratic expression (x2 + 2x – 3) to obtain: x3 – 7x + 6 = (x – 2)(x – 1)(x + 3) factor quadratic answer Now that we have factored our expression completely, we can see that the roots of the polynomial equation we were given are: x = 2 x = 1 x = –3 126 – Factor Theorem answer Functions Advanced To evaluate a function for a given value using synthetic division instead of direct substitution. No definitions on this page. Pythagoras of Samos (c.580-500 B.C.), after whom the Pythagorean Theorem is named, established a school in southern Italy to teach philosophy and mathematics. Students at his school lived by very strict rules. They were conscientious vegetarians, took a vow of silence for the first five years of their membership, and were forbidden to keep any written records. Pythagoras opened up his school to both men and women, women even being allowed to teach. Members disdained from taking credit for individual discoveries, instead attributing them to Pythagoras or to the group. Functions → Polynomial Functions → Polynomial Operations → Polynomial Division → Synthetic Division → Remainder Theorem The remainder obtained from synthetic division has an important interpretation. This is given in the Remainder Theorem: The Remainder Theorem Let f(x) be a polynomial of positive degree n. Then for any number k, f(x) = q(x)(x – k) + f(k) remainder theorem where q(x) is a polynomial of degree n – 1. The Remainder Theorem has the interesting effect of allowing us to evaluate a polynomial function. To evaluate the polynomial function f(x) when x = k, divide f(x) by x – k. The remainder will be f(k). To see why this is true, consider the Division Algorithm, which allows us to write f(x) as: f(x) = (x – k)q(x) + r(x) Division Algorithm By the Division Algorithm, we know that either r(x) = 0 or the degree of r(x) is less than the degree of x – k. This second piece of information tells us that r(x) must be a constant. That is, r(x) = r. If we evaluate f(x) at x = k, we get: f(k) f(k) = (k – k)q(k) + r evaluate f(x) at x = k = 0q(k) + r simplify = r Remainder Theorem Let's see an example of how this works. Functions Remainder Theorem – 127 Express the function in the form f(x) = (x – k)q(x) + r for the given value of k and demonstrate that f(k) = r. Function f(x) 15x4 + 10x3 – 6x2 + 14 = Value of k 2 k = − 3 solution In order to get the function in the form f(x) = (x – k)q(x) + r, we need to divide f(x) by x – k using synthetic division. − 2 15 3 10 –6 0 –10 0 4 0 –6 4 15 14 − 8 3 Using the bottom row as the coefficients of the terms in q(x), we have q(x) 15x3 – 6x + 4 = and r 34 = remainder 3 Thus we can write our function f(x) as: f(x) = (15x3 − 6x + 4) + ⎟ 3⎠ 3 ⎛x + ⎜ ⎝ 2⎞ 34 answer In order to demonstrate that f(k) = r, we simply substitute in x = k into our original function: f ⎛⎜ − 2⎞ ⎟ ⎝ 3⎠ ⎟ ⎝ 3⎠ 2 ⎛ 2⎞ ⎛ 2⎞ ⎟ + 10 ⎜ − ⎟ − 6 ⎜ ⎟ + 14 ⎝ 3⎠ ⎝ 3⎠ ⎝ 3⎠ = 15⎛⎜ 16 ⎞ simplify = 15⎛⎜ 16 ⎞ convert all fractions to LCD 8 4 + 10⎛⎜ − ⎞⎟ − 6⎛⎜ ⎞⎟ + 14 ⎟ ⎝ 81 ⎠ ⎝ 27 ⎠ ⎝ 9 ⎠ = 2⎞ 3 15 ⎛⎜ − = f ⎛⎜ − 2⎞ 4 = = 128 – Remainder Theorem 24 36 81 − 10⎛⎜ ⎞⎟ − 6⎛⎜ ⎞⎟ + 14⎛⎜ ⎞⎟ ⎟ ⎝ 81 ⎠ ⎝ 81 ⎠ ⎝ 81 ⎠ ⎝ 81 ⎠ 240 81 918 81 34 3 − 240 81 − 216 81 + 1134 81 substitute in x = k simplify simplify reduce to a fraction in lowest terms answer Functions Advanced To determine the number of real zeros for a given polynomial function. No definitions on this page. René Descartes (1590-1650) was more than just a mathematician. In his younger years he served as a soldierof-fortune in various armies throughout Europe, including the Bavarian Army in present-day Germany. It was during this time, on November 10, 1619, he fell into a fevered delirium and had a mystical experience that changed his life. Supposedly he experienced three vivid dreams in succession. As a result, he became convinced it was his divine mission to found a new philosophical system in which everything in the sciences is connected by a chain of mathematical logic. Furthermore, physics was nothing but geometry. The really strange bit is that modern physics relies more and more on geometry to explain what goes on at the quantum level. Functions → Polynomial Functions → Real Zeros For any polynomial function f of degree n, the following statements are true. Remember that the zeros of a function are the x-values for which the function equals zero. 1. The graph of f has, at most, n – 1 turning points. Turning points are points at which the graph changes from increasing to decreasing and vice-versa. 2. The function f has, at most, n real zeros. This statement comes from the Fundamental Theorem of Algebra. Finding the zeros of a polynomial function is one of the most important applications of algebra. The zeros represent solutions to polynomial equations. It is important to realize the connection between an algebraic approach and a graphical approach to a problem. Given a graph, we can often use that graph to help us determine the location of any zeros of a function. On the other hand, we can use algebraic techniques to determine the shape of a graph based on the solutions to an equation. In general, we can make the following statements about the real zeros of a polynomial function. Real Zeros of Polynomial Functions If f(x) is a polynomial function and a is a real number, the following statements are equivalent: Note: 1. x = a is a zero of the function f. 2. x = a is a solution of the polynomial equation f(x) = 0. 3. (x – a) is a factor of the polynomial f(x). 4. (a, 0) is an x-intercept of the graph of f. Finding real zeros of polynomial functions is closely related to factoring and finding x-intercepts. Let's take a look at an example of how we can use the statements above to find the solutions to a polynomial equation. Functions Real Zeros – 129 Find all the real zeros of the following polynomial function. f(t) t3 – 5t2 + 6t = solution First, we can factor out a t from all three of the terms. Then we can factor the remaining quadratic equation. Then we apply Statement 3 from above to determine the real zeros of the polynomial function. f(t) = t3 – 5t2 + 6t original function = t(t2 – 5t + 6) factor out common monomial factor t = t(t – 2)(t – 3) factor remaining quadratic Now that we have reduced f(t) to its factored form, we want values of t which would make f(t) = 0. According to our factors, t = 0, t = 2 or t = 3. Thus the real zeros are: t = 0 t = 2 t = 3 answer Here is a graph of the function plotted with MathCAD 2000: The corresponding t-intercepts to t = 0, t = 2, and t = 3 are (0, 0), (2, 0), and (3, 0). This figure also has two turning points. Since this is a third-degree polynomial it can have at most two turning points. The Intermediate Value Theorem We can use the values of a function at two distinct points to determine if a function has a zero between those two points. The Intermediate Value Theorem is an existence theorem in that it tells us such zeros exist, but not how to go about finding them. If (a, f(a)) and (b, f(b)) are two points on the graph of a polynomial function such that f(a) f(b), then for any number d between f(a) and f(b) there must be a number c between a and b such that f(c) = d. When we are looking for zeros of a function, then we want f(c) = 0. In even more formal terms, this is written as: 130 – Real Zeros Functions Intermediate Value Theorem Let a and b be real numbers such that a < b. If f is a polynomial function such that f(a) ≠ f(b), then in the interval [a, b], f takes on every value between f(a) and f(b). We can use this theorem to help us find zeros as follows. If we can find a number x = a such that f(a) > 0, and another number x = b such that f(b) < 0, then the function must cross the x-axis at least once somewhere between x = a and x = b. The point(s) where the function crosses the x-axis are the zeros for that function. Let's see how this works. Find at least one interval of length 1 for the following function in which the given function is guaranteed to contain a zero. f(x) = x3 – 3x2 + 3 solution First, let's start with a few trial numbers. A good range of numbers to start with is positive and negative numbers near zero. Then we evaluate the function f at those numbers. When f(x) switches from negative to positive or from positive to negative, we know that we have crossed the x-axis. Therefore, the zero will be contained in that interval. x f(x) –2 –17 –1 –1 0 3 1 1 2 –1 3 3 Because f(–1) is negative and f(0) is positive, you can conclude from the Intermediate Value Theorem that there must be a zero somewhere in the interval [–1, 0]. Similarly, because f(1) is positive and f(2) is negative, there is another zero in the interval [1, 2]. Finally, a third zero is found in the interval [2, 3]. If we look at the graph of this function, we can see that it will give us the same information: answer Functions Real Zeros – 131 132 – Real Zeros Functions Advanced To find the number of real (and imaginary) roots of a polynomial function. positive real zeros – positive real x-values for which the function equals zero. negative real zeros – negative real x-values for which the function equals zero. variation in sign – two consecutive coefficients in a polynomial function having opposite signs. In 1649, René Descartes (1590-1650) traveled to Sweden to tutor 20-year old Queen Christina. She preferred to start her days at 5 am. The combination of extremely cold winters and early rising resulted in Descartes contracting a fatal case of pneumonia. He died early spring of 1650. In 1666, his remains were exhumed and returned to France. The French ambassador received permission to remove Descartes's right forefinger from the corpse. Descartes's skull is said to have been removed by a guard and sold several times. Supposedly, Descartes's skull is currently on display at the Museé de l'Homme in the Palais de Chaillot in France. Functions → Polynomial Functions → Real Zeros → Descartes's Rule of Signs From the previous thought, we know that any nth-degree polynomial can have at most n real zeros. Of course, many polynomials do not have that many real zeros. For example, f(x) = x2 + 1 has no real zeros and f(x) = x3 + 1 has only one real zero. However, no polynomial can have more than n real zeros. We can use the following theorem, called Descartes's Rule of Signs to shed more light on the zeros of a polynomial function. Descartes's Rule of Signs Let f(x) = anxn + an – 1xn – 1 + . . . + a2x2 + a1x + a0 be a polynomial with real coefficients and a0 ≠ 0. 1. The number of positive real zeros of f is either equal to the number of variations in signs of f(x) or less than that number by an even integer. This means that for every change of sign (from +an to –an – 1 to + an – 2, etc), you will have a zero for each change in sign or two. If you do not have a zero for each change in sign, then you will have two less than the maximum number of zeros. If you do not have that many, then you will have four less than the maximum, etc… These are all for positive real zeros. 2. The number of negative real zeros of f is either equal to number of variations of sign of f(–x) or less than that number by an even integer. The same rule that applies to the positive real zeros applies to the negative zeros as well. For example, If you have 4 changes in sign, then you will have either 4 zeros, 2 zeros, or no zeros. You will never have 3 zeros or 1 zero. If you have 5 changes in sign, you can have 5 zeros, 3 zeros or 1 zero. You will always have at least one zero for an odd number of changes in sign. You will never have an even number of zeros if you have an odd number of changes in sign. Functions Descartes's Rule of Signs – 133 Notes: Variation in sign means that two consecutive coefficients have opposite signs. Where there is only one variation in sign, Descartes’s Rule of Signs guarantees the existence of exactly one positive (or negative) real root. Descartes's Rule of Signs does not tell us what the zeros are. It simply states how many zeros we will find, as well as whether they will be positive or negative. We need to use other methods to actually find the zeros. We must note the qualifier that the roots given by Descartes's Rule of Signs are all real. According to the Fundamental Theorem of Algebra, the degree of the polynomial tells us the total number of roots. Therefore, if Descartes's Rule of Signs yields us three real roots (positive, negative, or both) for a 5th degree polynomial, then we must conclude that the remaining 2 roots are imaginary. Use Descartes's Rule of Signs to determine how many real solutions the following function has as well as if they are positive or negative. h(x) = 3x4 – 4x + 7 given function solution Our given function has coefficients that are positive, negative, and positive again. This translates to 2 variations in sign. Descartes's Rule of Signs tells us that the number of positive real solutions is equal to the number of changes in sign or less than that by an even number. Thus, we can have either 2 positive real roots or (2 – 2) = 0 positive real roots. In order to determine our negative real roots, we have to replace "x" with "–x" in the function and simplify, noting any variations in sign again: h(–x) = 3(–x)4 – 4(–x) + 7 replace "x" with "–x" = 3x4 + 4x + 7 simplify All the coefficients are positive. Thus, we conclude from Descartes's Rule of Signs that there are no negative real roots, since there are no variations in sign. Thus we have at most, 2 positive real roots. answer Since we have just found how many real roots there are, we can use the Fundamental Theorem of Algebra to determine that since we have a 4th-degree polynomial, we must obtain 4 solutions. We have already stated that at most 2 of these solutions are real. Therefore, we must conclude that there are at least 2 complex solutions. (Complex solutions always come in pairs, so we can have either two complex solutions or four complex solutions, but not three complex solution.) Use Descartes's Rule of Signs to determine how many real solutions the following function has as well as if they are positive or negative. f(x) = 2x3 + x2 + 3x + 3 given function solution In our original function, we can see that the coefficients are all positive. Since this is the case, we can conclude from Descartes's Rule of Signs that there are no positive real roots. To find the negative real roots, we substitute "–x" in for "x" in our original function and simplify: 134 – Descartes's Rule of Signs Functions f(–x) = 2(–x)3 + (–x)2 + 3(–x) + 3 replace "x" with "–x" = –2x3 + x2 –3x + 3 simplify Now we have a negative coefficient, followed by a positive coefficient, followed by a negative coefficient, followed last by a positive constant. There are three variations in sign (– to + to – to +). We can then use Descartes's Rule of Signs to conclude that there are either 3 negative real roots or (3 – 2) = 1 negative real root. If we later find out that there is only 1 negative real root, then, by elimination, we must conclude that the remaining two roots are complex. Thus we have at most 3 negative real roots. answer Functions Descartes's Rule of Signs – 135 136 – Descartes's Rule of Signs Functions Advanced To find rational zeros of polynomial functions. No definitions on this page. Although Albert Einstein (1879-1955) is most famous for his theories on relativity, he won his Nobel Prize for his work on the photoelectric effect. When light rays strike metal, electrons are released from the metal to fly freely through the air. This is the "photoelectric effect". The number of electrons released is not a function of the intensity or brightness of the light. Einstein demonstrated that the number of electrons released is a function of the color of the light. Color is determined by the wavelength of light. A shorter wavelength corresponds to higher energy. Energy is delivered via packets known as quanta. The more quanta of energy that strike the metal, the higher the probability that an electron will be released. His paper on the photoelectric effect was published in 1905, but he didn't get the Nobel Prize until 1921. Functions → Polynomial Functions → Real Zeros → Rational Zero Test For polynomials with integer coefficients, the zeros of that polynomial may be found by looking at the factors of the constant term and the factors of the leading coefficient. All possible zeros can be expressed as a ratio of one factor of the constant term over one factor of the leading coefficient. However, this just gives us the possible zeros. We have to test each and every zero in the actual polynomial to see if works. We call this procedure the Rational Zero Test, and explain it below: The Rational Zero Test If the polynomial f(x) = anxn + an-1xn–1 + . . . + a2x2 + a1x + a0 has integer coefficients, every rational zero of f has the form Rational zero = where p and q have no common factors other than 1, and p = a factor of the constant term a0 q = a factor of the leading coefficient an Possible rational zeros are formed by listing all rational numbers whose numerators are coefficients of the constant term (a0) and whose denominators are factors of the leading coefficient (an). Possible rational zeros = factors of constant term factors of leading coefficient In essence, we divide each of the factors of the constant term by each of the factors of the leading coefficient. Note that we will need to look at both positive and negative factors. The factors of 6, for example, are ±1, ±2, ±3, ±6 because we can have: Functions Rational Zero Test – 137 (1)(6) = 6 and (–1)(–6) = 6 and (2)(3) = 6 and (–2)(–3) = 6 Dividing the factors will give us a list of possible zeros. Any possible zeros that crop up more than once are disregarded. For example, if we have 6 as the constant term and 2 as the leading coefficient, we end up with: possible rational zeros ±1, ±2, ±3, ±6 ±1, ±2 = factors of 6 factors of 2 After dividing each factor of 6 by each factor of 2, we obtain a list of: ±1, ±1/2, ±2, ±2/2, ±3, ±3/2, ±6, ±6/2 We can reduce some of the fractions to get: ±1, ±1/2, ±2, ±1, ±3, ±3/2, ±6, ±3 Since the factors ±1 and ±3 appear more than once, we really only need to list them once: ±1, ±1/2, ±2, ±3, ±3/2, ±6 possible rational zeros of a polynomial with leading coefficient 2 and constant term 6 Once the list of possible zeros is formed, trial and error methods are needed to determine, which, if any, the actual zeros of the polynomial are. Note that this does not tell us how many zeros we have. We can use Descartes Rule of Signs to tell us how many real zeros we have then apply the Rational Zero Test to determine which of the possible zeros we come up with will work. Once we have found as many zeros (both positive and negative) as are allowed by the degree of the polynomial, we can stop. Find the rational zeros of f(x) = x3 + x2 + 1 solution Here we have a leading coefficient of 1 and a constant of 1. The only possible zeros we can have, from the Rational Zero Test are ±1. However, when we test this, we will find that neither 1 nor –1 will make our function equal to zero: f(1) f(–1) = 1 3 + 12 + 1 substitute in x = 1 = 3 simplify = (–1)3 + (–1)2 + 1 substitute in x = –1 = 1 simplify Thus, we can conclude from the Rational Zero Test that this polynomial function does not have any rational zeros. However, if we graph this function, we can see that it does have a real zero (between –2 and –1): 138 – Rational Zero Test Functions answer Let's look at another example where the leading coefficient is 1 but the constant term is not 1: Find the rational zeros of: f(x) = x3 – 7x – 6 solution Here the leading coefficient is 1 and the constant term is 6, so by the Rational Zero Test, we have the following possible rational zeros: ±6, ±3, ±2, ±1 ±1 factors of 6 (constant term) factors of 1 (leading coefficient) Which we can simplify to: ±6, ±3, ±2, ±1 possible rational zeros Note that we have a total of eight possible zeros: –6, –3, –2, –1, 1, 2, 3, 6. We have to check each of them in our polynomial function and see which ones will make our function equal to zero. Testing these possible zeros, we find the following three rational solutions: x = –1 x = –2 x = –3 These are the only three that work. answer Of course, many polynomials do not have a convenient leading coefficient of 1. When that happens, we dramatically increase the number of possible zeros. We can use a number of methods to shorten the search. First, if we happen to have an expensive calculator, we can probably use its built-in polynomial solving functions. The HP 48G, for example, allows us to simply input the coefficients of the polynomial and the calculator finds the roots for us. Not all calculators are quite so sophisticated. However, a graphing calculator can also be of use. By looking at the graph of a polynomial function, in particular those points where it crosses the x-axis, we can narrow our focus to those possible roots that are likely to be in the region where the graph crosses the x-axis. Finally, if we do not have a polynomial-solving or graphing calculator, we can resort to synthetic division. Synthetic division let's us know if one of our potential zeros is an actual zero if we divide the polynomial by x – k, where k is one of our possible roots. If we obtain a zero remainder, then we know that we have a root. As always, we only need to continue until we obtain n roots, where n is the degree of the polynomial. Let's see an example of a more complicated polynomial function and apply the Rational Zero Test. Functions Rational Zero Test – 139 Find all the rational zeros of the following function f(x) 4x3 – 12x2 – x + 15 = solution The leading coefficient is 4 and the constant term is 15, so we have the following possible zeros: ±1, ±3, ±5, ±15 ±1, ±2, ±4 factors of 15 (constant term) factors of 4 (leading coefficient) Written out, this gives us: ±1, ±1/2, ±1/4, ±3, ±3/2, ±3/4, ±5, ±5/2, ±5/4, ±15, ±15/2, ±15/4 for a total of 24 possible zeros. This would be extremely tedious to try and plug each one into our function to see if it works. Let's graph the function and see if we can eliminate some zeros. Since this is a cubic polynomial, we expect to have at most three solutions, but not all of them need be rational. Here is the graph: Based on this graph, it looks as though we have a zero located at x = –1. Let's use synthetic division to check it out. –1 4 4 –12 –1 15 –4 16 –15 –16 15 0 Since we have a remainder of 0, we can conclude that x = –1 is a rational solution of f(x). Another possible solution is located at x = 3/2. Again, we use synthetic division to check it out: 3 2 4 4 140 – Rational Zero Test –12 –1 15 6 –9 –15 –6 –10 0 Functions Again, we end up with a remainder of 0, so we conclude that x = 3/2 is a rational zero of the function f(x). Finally, we see that there is a third (and final) possible zero located at x = 5/2. Let's try it and see: 5 4 2 4 –12 –1 15 10 –5 –15 –2 –6 0 Once again, lo and behold, we end up with a remainder of 0! We have found three rational zeros: x = x = x = –1 3 2 5 2 and and answer Since our polynomial is cubic, we know that we have found all three of the possible solutions. Functions Rational Zero Test – 141 142 – Rational Zero Test Functions Advanced To add, subtract, multiply and divide complex numbers. complex number – any number, real or imaginary, of the form a + bi, where a and b are real numbers and imaginary number – a number which is the square root of a negative real number. complex conjugate – the complex conjugate of a + bi is a – bi, and vice versa. The product of complex conjugates is a real number. Girolamo Cardano (1501-1576) was more than just an Italian algebraist and physician. He was the illegitimate son of a noted lawyer, a compulsive gambler, a popular astrologer, a former prisoner of the Inquisition (largely due to his astrology practice), and the father of a convicted murderer. Despite the many sordid details of his life, he still made significant contributions to mathematics and medicine. Cardano published over 200 papers on human medicine, physics, natural science, philosophy, and music. Cardano was one of the first to recognize the existence of the square root of negative numbers, otherwise known as imaginary numbers, although he did not quite know how to deal with them. Functions → Polynomial Functions → Complex Numbers For certain quadratic equations, we are unable to find any real solutions to the equation. The classic example of such an equation is: x2 + 1 = 0 equation with no real solutions We cannot factor this equation. If we subtract 1 from both sides, we end up with: x2 = –1 There is no real number x which we can square to give us –1. However, mathematicians refused to let this stop them from finding a solution to the equation. Since they were unable to find a real solution, they decided that there could only be imaginary solutions (sometimes called fictitious solutions). The imaginary solutions became an entire new system of numbers, the basis of which is the solution to the equation given above. They defined this new unit, called i, as: i = −1 imaginary unit Thus, i2 = –1. Any imaginary number is written as a multiple of the imaginary unit above. For example, if we have the equation: x2 + 4 = 0 another equation with no real solution Then the solutions of this equation are: Functions Complex Numbers – 143 x2 = x = −4 take square root of both sides = 4( −1) factor out a 4 = 4⋅ −1 use a property of radicals to separate –4 subtract 4 from both sides = 2 −1 simplify first radical = 2i substitute in i = (–1) We do not have to work with imaginary numbers by themselves. We can combine real numbers with imaginary numbers. The result is what is known as a complex number. All complex numbers can be written in the standard form, a + bi. Definition of Complex Numbers For real numbers a and b, the number a + bi is a complex number written in standard form. If a = 0 and b ≠ 0, then the complex number bi is an imaginary number. If b = 0 and a ≠ 0, then the complex number a is a real number. The above definition leads to an interesting conclusion. Although the set of real numbers and the set of imaginary numbers are independent of each other, both are subsets of the set of complex numbers. That is, the set of complex numbers contains all the real numbers AND all the imaginary numbers, as shown in the diagram below: Equality of Complex Numbers Two complex numbers a + bi and c + di, written in standard form, are equal to each other a + bi = c + di equality of two complex numbers if and only if a = c and b = d. Operations with Complex Numbers Naturally, it is not enough to know what a complex number is. We also have to know how to use them. Adding and subtracting complex numbers is fairly straightforward. We add (or subtract) the real parts together. We then add (or subtract) the complex parts together. 144 – Complex Numbers Functions Addition and Subtraction of Complex Numbers If a = bi and c + di are two complex numbers written in standard form, then their sum and difference are defined as follows: Sum (a + bi) + (c + di) = (a + c) + (b + d)i adding complex numbers Difference (a + bi) – (c + di) = (a – c) + (b – d)i subtracting complex numbers Simplify. a.) (4 – i) + (7 – 3i) b.) (5 + 5i) + 3i c.) (3 – 2i) + (4 + 2i) a.) (4 – i) + (7 – 3i) solution b.) c.) Note: (5 + 5i) + 3i = 4 – i + 7 – 3i remove parentheses = 4 + 7 – i – 3i group like terms = (4 + 7) – (1 + 3) i put inside parentheses = 11 – 4i simplify into standard form answer = 5 + 5i + 3i remove parenetheses = 5 + (5 + 3)i factor out an i-term = 5 + 8i simplify into standard form answer (3 – 2i) + (4 + 2i) = 3 – 2i + 4 + 2i remove parentheses = 3 + 4 – 2i + 2i group like terms = 7 simplify answer In example (c), we ended up with a real number for the answer. Remember that a real number can be written in complex form as a + 0b. Multiplying two complex numbers involves applying the Distributive Property as we would do when multiplying two polynomials. In the case of complex numbers, we treat i as we would treat any variable, such as x. Functions Complex Numbers – 145 Simplify. d.) (i)(4 – 3i) e.) (2 – i)(5 – 3i) f.) (3 + 2i)(3 – 2i) d.) (i)(4 – 3i) solution e.) f.) Note: = 4i – 3i2 Distributive Property = 4i – 3(–1) remember that i2 = –1 = 3 + 4i rewrite in standard form answer (2 – i)(5 – 3i) (3 + 2i)(3 – 2i) = 10 – 6i – 5i + 3i2 FOIL method = 10 – 11i + 3i2 simplify = 10 – 11i + 3(–1) remember that i2 = –1 = 10 – 3 – 11i group like terms = 7 – 11i simplify in standard form answer = 9 – 6i + 6i – 4i2 FOIL method = 9 – 4(–1) remember that i2 = –1 6i terms cancel each other = 9+4 simplify = 13 simplify answer Example (f) shows us that we can also obtain a real number by multiplying two complex numbers. This phenomenon is explained in more detail below. Complex Conjugates and Division Whenever we see any type of rational expression or function, we like to try and simplify the denominator because that makes it a whole lot easier to work with. Complex rational expressions are no exception. In general, we like to try to convert the denominator to a real number. Example (f) from above showed us how this is possible. We can obtain a real number from complex numbers by multiplying a complex number by its complex conjugate. A pair of complex numbers of the form a + bi and a – bi are called complex conjugates. When we multiply these two numbers together, we end up with: (a + bi)(a – bi) = a2 – abi + abi – b2i2 = a2 – b2(–1) remember that i2 = –1 = a2 + b2 real number result If we want to find the quotient of a + bi and c + di where c and d are both not equal to zero, then multiply the numerator and the denominator by the complex conjugate of the denominator. We are essentially multiplying the top and bottom of the rational expression by a very special form of "1": 146 – Complex Numbers Functions a + bi a + bi ⎛ c − di ⎞ ⎜ = c + di ⎟ c + di ⎝ c − di ⎠ ( ac + bd ) + ( bc − ad ) i = 2 c +d 2 multiply numerator and denominator by complex conjugate multiply numerator and denominator Now we have a complex number in the numerator and a real number in the denominator. Simplify. 4 2+ i solution 4 2+ i = = = = Note: Functions 4 2+ i ⋅ ⎛⎜ ⎟ ⎝2−i⎠ 8 − 4⋅ i 4+ 1 8 − 4⋅ i 5 8 5 − 4 5 2−i⎞ i multiply top and bottom by complex conjugate of denominator expand numerator and denominator simplify denominator rewrite in standard form answer From this example, we can see that converting the denominator to a real number allows us to write the complex number in standard form. Complex Numbers – 147 148 – Complex Numbers Functions Advanced To review the definition of rational functions. rational function – a function defined by a simplified rational expression. domain – the set of values which are allowable substitutions for the independent variable. asymptote – a line approached by the graph of a function. Thales of Miletus (c.640-546 B.C.) was an extremely well-traveled mathematician. He went to Egypt to either pursue business opportunities or to study Egyptian science and philosophy or both. It is in Egypt where he learned the basic principles of geometry. According to one legend, Thales was observing the Great Pyramid of Cheops one day when Pharaoh and his entourage were passing by. Pharaoh had heard of Thales's reputation as a wise man and asked Thales to calculate the height of the pyramid. One way he might have accomplished this is with the "magical" Egyptian shadow-stick, which, when placed in the ground allows one to establish a relationship between the height of the stick's shadow and the height of the object to be measured. A more plausible method is that Thales measured the pyramid's shadow at the same time when his own shadow appeared to be the same length as himself, thus assuring the Great Pyramid's shadow accurately represented its true height. Functions → Rational Functions A rational function, contrary to popular belief, is not a function that makes sense. The name comes from the fact that a rational function consists of a ratio of two other functions. Consider the expression: 5 4 This is the ratio of 5 to 4, also written as 5:4. In the same way, if we have a function p(x) = 3x and wish to express this function in relation to some other function q(x) = 4x2 + 1, then we can write it in exactly the same form as above: 3x 2 4x + 1 ratio of p(x) = 3x to q(x) = 4x2 + 1 In more general terms: Definition of Rational Function A rational function can be written as: f(x) = q(x) ≠ 0 where p(x) and q(x) are both polynomials and q(x) is not equal to zero. For the time being, we will assume that p(x) and q(x) have no common factors. (If they do, then we can apply the principles of polynomial division in order to simplify the polynomial f(x)). The domain of a rational function of x includes all real numbers except those values which would cause the denominator q(x) to return a value of zero. In our rational expression above, we can use all real numbers because there are no real numbers that will cause the denominator q(x) = 4x2 + 1 to equal zero. Functions Rational Functions – 149 In this section of the Algebra Brain, we will examine rational functions in some detail, paying particular attention to the asymptotes of rational functions. We will also look at the notion of partial fractions, whereby we write rational expressions as the sum of two or more simpler rational expressions. 150 – Rational Functions Functions Advanced To find the asymptotes of a rational function. asymptote – a line approached by the graph of a function such that the graph never touches or crosses the line. vertical asymptote – a vertical line approached by the graph of a function such that the graph never touches or crosses the vertical line. horizontal asymptote – a horizontal line approached by the graph of a function such that the graph never touches or crosses the horizontal line. The Pythagoreans shared a mystical belief that the order inherent in the universe could be revealed in numbers. From this belief, they developed representations of numbers: 1 was the point, 2 the line, 3 the surface, and 4 the solid. They even assigned moral qualities to numbers. 4, for example, represented justice, while 10 (a.k.a. the tetractys) indicated the sum of all nature because it was the sum of 1 + 2 + 3 + 4 (the point, line, plane, and solid). Functions → Rational Functions → Asymptotes Rational functions do not have the same sort of graphs as other polynomial functions. Remember that a rational function is defined as one polynomial function divided by another polynomial function. Let's discuss the behavior of the simplest rational function. This will give us the tools needed to make some generalizations about their behavior. Consider the function: f(x) = 1 x rational function The domain of f(x) is all real values of x such that the denominator of f(x) does not equal zero. In this case, if x = 0, then f(x) becomes undefined and therefore x = 0 is excluded from the domain of f(x). Let's take a look at what happens to f(x) at values near x = 0. x f(x) –1 –1 –0.5 –2 x f(x) 0← ← 1 1 –0.25 –4 0.5 2 –0.1 –10 0.25 4 –0.01 –100 0.1 10 0.01 100 –0.001 –1000 → 0 →– 0.001 1000 We can graph these values of x and f(x) to obtain the following: Functions Asymptotes – 151 Note: As x approaches 0 from the left, f(x) decreases without bound (goes to – ). As x approaches 0 from the right, f(x) increases without bound (goes to + ). Also note that the graph of f(x) never crosses either the x- or y-axes. Because of this fact, we call the x- and y-axes the asymptotes of the graph of f(x). In other words, the line x = 0 is the vertical asymptote of the graph of f and the line y = 0 is the horizontal asymptote of the graph of f. This means that the values of f(x) = 1/x approach zero as x increases or decreases without bound (as x moves toward ). The formal definition of vertical and horizontal asymptotes of a function is as follows: Definition of Vertical and Horizontal Asymptotes The line x = a is a vertical asymptote of the graph of f if f(x) → as x or f(x) → – a, either from the right or from the left. The line y = b is a horizontal asymptote of the graph of f if f(x) → b as x → or x → – . Eventually (as x → or x → – ) the distance between the graph of f and the asymptote must approach zero. However, by definition, the graph never actually touches the asymptote. You will learn more about this astonishing behavior when you study calculus. Now that we know the definition of asymptotes, we need some method by which we can go about finding them for a particular rational function. 152 – Asymptotes Functions Asymptotes of a Rational Function Let f be the rational function given by: f(x) = = where p(x) and q(x) have no common factors. 1. The graph of f has vertical asymptotes at the zeros of q(x). 2. The graph of f has one or no horizontal asymptotes determined as follows: a. If n < m, the graph of f has the x-axis (y = 0) as a horizontal asymptote. b. If n = m, the graph of f has the line y = as a horizontal asymptote c. If n > m, the graph of f has no horizontal asymptotes. The theorem basically boils down to comparing the degree of the polynomial in the numerator to the degree of the polynomial in the denominator. In the example we used to introduce asymptotes, the degree of p(x) = 0 and the degree of q(x) = 1, which means n < m, therefore we used part (2a) of our theorem on asymptotes to determine what the horizontal asymptote was (y = 0). Similarly, the only time f(x) = 1/x has a zero is when x is infinitely large, which means that the vertical asymptote is x = 0. Let's look at an example. Find the domain of the function and any vertical or horizontal asymptotes. f(x) 2+ x = 2−x solution First of all, the domain of this function includes all values of x except those values that cause the function to be undefined. In the case of rational functions, they are undefined whenever the denominator is zero. We want to find values of x which will make q(x) = x – 2 = 0. Those values of x are excluded from our domain. In this example, if x = 2, then q(x) = 0 and f(x) becomes undefined. Therefore, the domain of f(x) is: all real numbers except x = 2 domain of f(x) answer We use the numbers excluded from the domain of f(x) to find our vertical asymptotes. In other words, vertical asymptotes are located at values of x that make the denominator of f(x) go to zero. We have already found such a number for this function. Therefore, the vertical asymptote for f(x) is: x Functions = 2 vertical asymptote answer Asymptotes – 153 Horizontal asymptotes are found by comparing the degrees of the polynomials in numerator and denominator. Both happen to be of degree "1" in this example. Since they are equal, then according to our definition above, a horizontal asymptote is located at: y = y = y = a1 b1 1 −1 –1 horizontal asymptote when p(x) and q(x) have same degree substitute in a1 = 1 and b1 = –1, leading coefficients of p(x) and q(x), respectively simplify answer The graph of this function looks like: Applications of Asymptotes Believe it or not, asymptotes have an important role in real-world applications. Of particular interest is costbenefit analysis. A particular process, whether it is manufacturing or something else, often has a function associated with it that determines how much it costs to obtain a certain percentage benefit. For example, it may cost a power plant only $10,000 to remove 10% of the pollutants it emits, but it may cost them $100,000 to remove 30% of the pollutants and $1,000,000 to remove 70% of the pollutants. As the percentage of pollutants removed increases, the cost also increases. Removing 100% of the pollutants it emits would bankrupt the power plant. It just does not have the capacity to remove every single piece of unclean material from its emissions. This is important information to know because laws determine how much the power plant needs to remove in order to be compliant with the law. The bean-counters in Accounting then have to decide where to find the extra money they need when the law starts tightening its emission controls. Another interesting example of rational functions is population control. When a population of animals is introduced into an environment, it increases dramatically at first, but tapers off as the resources in the environment decrease. The population can only grow if there are sufficient resources. 154 – Asymptotes Functions Advanced To use a set of guidelines to sketch rational functions. slant asymptote – an asymptote to a rational function that occurs when the degree of the numerator is greater than the degree of the denominator. Pythagoras (c.580-500 B.C.) was a noted astronomer. He was the first to declare the spherical nature of the Earth. He also deduced that the sun, moon, and planets each rotate around their own axes. Pythagoras noted that celestial bodies orbit around a central point which he mistakenly identified as the Earth. Later Pythagoreans deduced there was another central point not on the Earth, but never identified this point as the sun. Finally, Pythagoras established the period of the Earth's rotation: 24 hours. Functions → Rational Functions → Sketching Rational Functions Rational functions can be difficult to graph if we don't know what we are doing. Fortunately, there are several guidelines we can use to help us sketch the graph of a rational function. Guidelines for Graphing Rational Functions Let f(x) = p(x)/q(x), where p(x) and q(x) are polynomials with no common factors. Note: 1. Find and plot the y-intercept (if any) by evaluating f(0). 2. Find the zeros of the numerator (if any) by solving the equation p(x) = 0. Then plot the corresponding x-intercepts. 3. Find the zeros of the denominator (if any) by solving the equation q(x) = 0. Then sketch the corresponding vertical asymptotes. 4. Find and sketch the horizontal asymptote (if any) by using the rule for finding the horizontal asymptote of a rational function. 5. Plot at least one point between and one point beyond each x-intercept and vertical asymptote. 6. Use smooth curves to complete the graph between and beyond the vertical asymptote. Testing for symmetry can also be very useful. For example, the graph of f(x) = 1/x is symmetrical with respect to the origin and the graph of g(x) = 1/ x2 is symmetrical with respect to the y-axis. It is usually not enough to simply give you some guidelines. It is also helpful to see how these guidelines can be applied. Sketch the graph of the rational function. As sketching aids, check for intercepts, symmetry, vertical and horizontal asymptotes. g(x) Functions = x 2 x −9 Sketching Rational Functions – 155 solution Let's go down the list of guidelines given above. 1. The y-intercept occurs at g(0): g(0) = = 0 2 0 −9 let x = 0 0 Thus, the y-intercept is located at (0, 0). 2. The x-intercepts occur when y = 0, which only happens when p(x) = 0. In this case, p(x) = x, so that implies that p(0) = 0 and we get: x = 0 x-coordinate of x-intercept Thus, the x-intercept is also located at (0, 0) in this particular example. 3. Vertical asymptotes occur when q(x) = 0. So we solve the equation x2 – 9 = 0: x2 – 9 = 0 q(x) = 0 (x – 3)(x + 3) = 0 factor x = 3 one solution vertical asymptote = –3 other solution vertical asymptote or x Sketch these asymptotes on your graph. 4. To find horizontal asymptotes, we compare the degree of the numerator to the degree of the denominator. Here we have p(x) = x, which is degree 1. We also have q(x) = x2 – 9, which is degree 2. Since 1 < 2, we can conclude that the horizontal asymptote is located at y = 0. This is just the x-axis, so it is probably already part of your graph and we don't need to sketch it. 5. We next plot one point on each side of the left hand vertical asymptote x = –3. We plot one point on each side of the x-intercept x = 0. We plot one point on each side of the right hand vertical asymptote x = 3. We should end up with 6 points total. x g(x) 6. –4 –0.571 –3 undefined –2 0.4 –1 0.125 0 0 1 –0.125 2 –0.4 3 undefined 4 0.571 Finally, we connect our points with smooth curves. Remember that the graph can never cross an asymptote! We show you the graph twice, once with all the bells and whistles and once without all the fancy sketching tricks we used. 156 – Sketching Rational Functions Functions answer If we remove the asymptotes (remember that they are not actually part of the graph) we have: Zooming out on the graph, we can get an even clearer picture of what it looks like: Functions Sketching Rational Functions – 157 Slant Asymptotes Not all rational functions have just horizontal or vertical asymptotes. If the degree of the polynomial in the numerator is exactly one more than the degree of the polynomial in the denominator, then the rational function has a slant asymptote. The exact equation of the slant asymptote is found by long division. For instance, if we divide x2 – x by x + 1, we end up with: 2 f(x) x −x = divide two polynomials numerator is one degree higher than polynomial x+1 x−2+ = 2 x+1 The slant asymptote is the quotient part of the resulting expression, not the remainder part. In the above example, the slant asymptote would be: y = x–2 quotient part Sketch the graph of the function. 2 f(x) = x −x x+1 solution To sketch the graph of this function we use all the same tricks and tools that we did before. 1. The y-intercept is located at (0, 0) because f(0) = 0. 2. The x-intercepts can be found by factoring the numerator and solving the equation p(x) = 0. In this case factoring the equation produces: x2 – x = 0 p(x) = 0 x(x – 1) = 0 factor x or x = 0 x-coordinate of one intercept = 1 x-coordinate of other intercept The x-intercepts are located at (0, 0) and (0, 1). 3. The vertical asymptotes are located wherever the denominator q(x) = 0. In this case, that happens to occur when x = –1, so this is also the one and only vertical asymptote for this function. Sketch the vertical asymptote. 4. The horizontal asymptote is found by comparing the degree of the numerator to the degree of the denominator. Here the degree of p(x) is 2 and the degree of q(x) is 1. Since 2 > 1, we can conclude that this rational function has no horizontal asymptote. However, since the degree of p(x) is exactly one more than the degree of q(x), it does have a slant asymptote, which we have already determined exists at y = x – 2. Sketch this slant asymptote. 5. Now we need a few points. We plot points on either side of each asymptote and on either side of each x-intercept. x f(x) –3 –6 –2 –6 –1 undefined 158 – Sketching Rational Functions –0.5 1.5 0 0 0.5 –0.167 1 0 2 0.667 3 1.5 Functions 6. Finally, we sketch the graph of this rational function, using the asymptotes to help us: answer Functions Sketching Rational Functions – 159 160 – Sketching Rational Functions Functions Advanced To decompose a rational function into its component parts. partial fraction decomposition – the method by which we transform one rational function into the sum of two or more simpler rational functions. S. I. Ramanujan (1887-1920) was an Indian number theorist and one of the most unusual mathematicians of all time. Before he had completed high school, he came across G. S. Carr's A Synopsis of Elementary Results in Pure and Applied Mathematics, a compilation of five thousand formulae with few proofs. Ramanujan decided to prove the theorems in Carr's compilation. Since Ramanujan was unfamiliar with Western mathematical tools, he quickly fashioned his own and often wandered into strange mathematical realms of his own creation. Had Ramanujan not caught the attention of mathematics professors in England, we might never have uncovered his Notebooks, theorems of his own which baffle mathematicians to this very day. Functions → Rational Functions → Partial Fractions Sometimes we are faced with a rational function problem that seems too complicated for us. For example, if we have the function: f(x) = −5x + 15 2 x + 3x − 4 rational function how do we go about simplifying this expression? As it turns out, this expression is nothing more than: f(x) = 2 x−1 − 7 x+4 equivalent rational function We have a difficult time seeing the direct connection. We are about to show you how to transform the first rational function above into the second rational function. The method is known as partial fraction decomposition. Before we demonstrate this method with algebraic expressions, let's first use a concrete example with some real numbers. The method is exactly the same regardless if we use real numbers or algebraic expressions. Recall that when you simplify the expression: 1 3 + 1 2 you find the least common denominator, which in this case is 6. Then you can rewrite the fraction as: 1 3 + 1 2 = = 2 6 + 3 6 5 6 Now what if you had simply been given the fraction 5/6 and had been told to break it down into its parts? How would you go about solving it? Functions Partial Fractions – 161 The answer lies with partial fraction decomposition. In a sense, it is the reverse of simplification of fractions. To "decompose" the fraction 5/6, we first need to write the denominator in its factored form: 5 6 5 = factor denominator ( 2) ( 3) Now we can write the fraction as the sum of two fractions, each having in its denominator one of the two factors: 5 6 = 2 + 3 the black squares represent unknown quantities. In order to fill in the blanks, let's assign the numerators values of A and B respectively: 5 6 A = 2 + B 3 Next, we want to get rid of the 6 on the left, so we multiply both sides by 6: 5 = 5 = 6A 2 + 6B multiply both sides by 6 3 3A + 2B simplify fractions on right Now to find A and B, we have to do a little bit of guesswork since we have 1 equation and 2 unknowns. In general, you will end up with n equations with n unknowns. Then you can solve the system of linear equations using standard algebraic techniques. If we let A = 1, then we have: 5 = 3(1) + 2B let A = 1 2 = 2B subtract 3 from both sides 1 = B divide both sides by 2 We have determined that A and B are both 1. Therefore, we can now write our original fraction 5/6 in its decomposed form: 5 6 = 1 2 + 1 3 partial fraction decomposition Now we can do the same sort of tricks to transform −5x + 15 2 x + 3x − 4 into 2 x−1 − 7 x+4 partial fraction decomposition solution First, we factor the denominator, like so: −5x + 15 2 x + 3x − 4 162 – Partial Fractions = −5x + 15 ( x − 1) ( x + 4) factor denominator Functions Next, we let the expression equal the sum of two fractions, each having in its denominators one of the two factors. Again, we use A and B to represent the unknown quantities of the numerator. −5x + 15 A = ( x − 1) ( x + 4) x−1 B + x+4 rewrite fraction as sum of two fractions partial fraction decomposition Then we multiply both sides of the equation by the denominator on the left: −5x + 15 ( x − 1) ( x + 4) ⎛ A + B ⎞ ( x − 1) ( x + 4) ⎜ −1 ⎟ x + 4⎠ ⎝x multiply both sides by (x – 1)(x + 4) A( x − 1) ( x + 4) Distributive Property on right hand side simplify left hand side ( x − 1) ( x + 4) = –5x + 15 = –5x + 15 = A(x + 4) + B(x – 1) cancel common factors in each rational expression –5x + 15 = Ax + 4A + Bx – B Distributive Property x−1 + B( x − 1) ( x + 4) x+4 Now at this point it looks like we have 1 equation with 3 unknowns because we still have no idea what A and B are. However, for both sides to be equal the coefficients of x on both sides have to be the same and the constant terms must also add up to the same number. Therefore, the sum of A and B is equal to –5 and the quantity 4A – B is equal to 15. In other words, we can rewrite the last step above as: –5x + 15 = (A + B)x + 4A – B rewrite last step from above We next break this down into a system of two linear equations with two unknowns: A+B and 4A – B = –5 coefficients of x = 15 constant terms We can solve this system by substitution: 4A – 15 = B solve second linear equation for B A + (4A – 15) = –5 substitute in B = 4A – 15 into first linear equation 5A = 10 combine like terms and add –15 to both sides A = 2 divide both sides by 5 2+B = –5 substitute in A = 2 into first linear equation B = –7 subtract 2 from both sides Once we have found A and B, we can plug these into our partial fraction decomposition: −5x + 15 2 x + 3x − 4 = 2 x−1 − 7 x+4 answer It really works! Although it does take a great deal of effort, you will be glad you learned about how to do this when you take calculus. In general, there are five basic situations that occur with partial fraction decomposition. The denominator of a rational function can have distinct linear factors, distinct quadratic factors, repeated linear factors, repeated quadratic factors, or any combination of the previous four situations. Functions Partial Fractions – 163 Decomposition of N(x) / D(x) into Partial Fractions 1. Divide if improper: If N(x) / D(x) is an improper fraction, divide the numerator into the denominator to obtain: = (polynomial) + where N1(x) is the remainder from the division of N(x) by D(x). Then apply Steps 2, 3, and 4 (below) to the proper rational expression N1(x) / D(x). 2. Factor denominator: Completely factor the denominator into the form: (px + q)m and (ax2 + bx + c)n where (ax2 + bx + c) is irreducible. 3. Linear factors: For each factor of the form (px + q)m, the partial fraction decomposition must include the following sum of m fractions: 4. Quadratic factors: For each factor of the form (ax2 + bx + c)m, the partial fraction decomposition must include the following sum of n fractions: Instead of providing an example of each possible situation here, we have decided to give each situation its own thought. When decomposing fractions, you will usually end up with a "basic equation" that contains the numerator of the original fraction on the left and an expression formed from the factors of the denominator of the original fraction on the right. Solving this "basic equation" varies depending on if we have linear or quadratic factors. Here are some guidelines for solving these "basic equations": Guidelines for Solving the Basic Equations Linear Factors 1. Substitute the roots of the distinct linear factors into the basic equation 2. For repeated linear factors use the coefficients determined in step 1 to rewrite the basic equation. Then substitute other convenient values of x and solve for the remaining coefficients (for example, let x = 0). Quadratic Factors 1. Expand the basic equation. 2. Group terms according to powers of x. 3. Equate the coefficients of like powers to obtain a system of linear equations involving A, B, C, and so on. 4. Solve the system of linear equations. 164 – Partial Fractions Functions Advanced To decompose a rational function that has only distinct linear factors in the denominator. No definitions on this page. Leonardo Pisano Fibonacci (1170?-1250?), an Italian number theorist, is most famous for his recursive sequence of numbers: 1, 1, 2, 3, 5, 8, 13, 21, etc. This sequence stems from a problem involving rabbits. A pair of rabbits are kept in an enclosure and allowed to breed, producing offspring at the rate of one pair of rabbits per month, beginning with the second month. How many rabbits will exist at the end of the year, assuming none of the rabbits die? Each number in the Fibonacci Sequence corresponds to the number of pairs of rabbits at the end of each month. That is, in January and February you would have 1 pair of rabbits, in March you have 2 pairs, in April you have 3 pairs, and so on. Functions → Rational Functions → Partial Fractions → Distinct Linear Factors In the previous thought, we introduced you to the notion of partial fractions. Now we are going to give an indepth example of one of the five possible cases that occur with rational functions. In this example, we will be looking at the case where the denominator of the rational function consists of distinct linear factors. By "distinct", we mean that each linear factor only occurs one time in the denominator of the rational function. We also reproduce the method for decomposing fractions into their component parts (we leave out the part about quadratic factors in the denominator because it doesn't apply here). Decomposition of N(x) / D(x) into Partial Fractions 1. Divide if improper: If N(x) / D(x) is an improper fraction, divide the numerator into the denominator to obtain: = (polynomial) + where N1(x) is the remainder from the division of N(x) by D(x). Then apply Steps 2, 3, and 4 (below) to the proper rational expression N1(x) / D(x). 2. Factor denominator: Completely factor the denominator into the form: (px + q)m and (ax2 + bx + c)n where (ax2 + bx + c) is irreducible. 3. Functions Linear factors: For each factor of the form (px + q)m, the partial fraction decomposition must include the following sum of m fractions: Distinct Linear Factors – 165 Decompose the following rational function into its partial fraction decomposition. 2 f(x) = x + 3x + 6 3 x − 2x − x + 2 solution According to the rules of partial fraction decomposition given above, the first thing we need to do is see if we can divide the numerator by the denominator. That only happens when the degree of p(x) is greater than or equal to the degree of q(x). In this case the degree of p(x) is 2 and the degree of q(x) is 3, so we cannot use polynomial long division. Next, we factor the denominator if possible. After examining the denominator for awhile, we notice that we can apply the FOIL method in reverse: 2 x + 3x + 6 3 x − 2x − x + 2 2 = x + 3x + 6 FOIL method in reverse (x2 − 1)( x − 2) 2 = x + 3x + 6 factor difference of two squares completely factored denominator ( x − 1) ( x + 1) ( x − 2) Once we have completely factored the denominator, we can begin the process of decomposing our rational function. First, we set up the basic equation, where the rational function is set equal to the sum of its parts: 2 x + 3x + 6 ( x − 1) ( x + 1) ( x − 2) = A x−1 + B x+1 + C x−2 Basic Equation Next, we multiply both sides of the basic equation by the denominator on the left and simplify the right hand side of the equation. 2 x + 3x + 6 ( x − 1) ( x + 1) ( x − 2) ( x − 1) ( x + 1) ( x − 2) = ⎛ A + B + C ⎞ ( x − 1) ( x + 1) ( x − 2) ⎜ x − 1 x + 1 x − 2⎟ ⎝ ⎠ multiply both sides by denominator on left x2 + 3x + 6 = A(x + 1)(x – 2) + B(x – 1)(x – 2) + C(x – 1)(x + 1) simplify both sides x2 + 3x + 6 = A(x2 – x – 2) + B(x2 – 3x + 2) + C(x2 – 1) FOIL method on right hand side x2 + 3x + 6 = Ax2 – Ax – 2A + Bx2 – 3Bx + 2B + Cx2 – C Distributive Property x2 + 3x + 6 = Ax2 + Bx2 + Cx2 – Ax – 3Bx – 2A + 2B – C regroup terms according to degree of x x2 + 3x + 6 = (A + B + C)x2 – (A + 3B)x – 2A + 2B – C factor out an x2 and an x At this point, we take a moment to step back and see what we have accomplished. In order for both sides to be equal, the coefficients of the polynomial on the left have to equal the coefficients of the polynomial on the right. For instance, on the left, the coefficient of the x2-term is 1, so the quantity (A + B + C) has to be equal to 1. Using this line of reasoning, we set up a system of linear equations where the coefficients of the left hand side have to be equal to the sums of A, B and C on the right hand side: 166 – Distinct Linear Factors Functions 1 = A+B+C coefficients of x2 3 = –(A + 3B) coefficients of x 7 = –2A + 2B – C constant terms We use substitution to solve this, although we could use a variety of methods, including a method involving a matrix, but we won't do that to you here. Solve the second linear equation for A: 3 = –(A + 3B) second linear equation coefficients of x 3 = –A – 3B Distributive Property A = –3 – 3B solve for A Now substitute this value for A into the remaining two linear equations and solve: 1 = (–3 – 3B) + B + C let A = 3 – 3B in first linear equation coefficients of x2 1 = –3 – 2B + C simplify 2 + 2B = C solve for C Substitute A = 3 – 3B and C = –2 + 2B into the third linear equation 6 = –2(–3 – 3B) + 2B – (2 + 2B) substitute in A = –3 – 3B and C = 2 + 2B constant terms 6 = 6 + 6B + 2B – 2 – 2B Distributive Property 2 = 6B simplify = B divide both sides by 6 1 3 Now that we know B, we can back-substitute this into the second linear equation. 3 – 3B −3 − 3⎛⎜ 1⎞ ⎟ ⎝ 3⎠ –4 = A second linear equation = A substitute in B = 1/3 = A simplify So B = 1/3 and A = –4. Now we can find C by substituting these values into the first or third linear equations. For the sake of argument, we will substitute our values for A and B into the first linear equation: Functions Distinct Linear Factors – 167 1 = A+B+C 1 = −4 + = C 8 3 1 3 +C first linear equation coefficients of x2 substitute in A = –4 and B = 1/3 solve for C Therefore, we have: A = B = C = –4 1 3 and and 8 3 so our Basic Equation looks like: 2 x + 3x + 6 ( x − 1) ( x + 1) ( x − 2) = = 168 – Distinct Linear Factors A x−1 −4 x− 1 + B x+1 + 1 + 3 x+ 1 C x−2 8 + 3 x− 2 Basic Equation partial fraction decomposition answer Functions Advanced To decompose a rational function that has only distinct quadratic factors in the denominator. No definitions on this page. Pierre de Fermat (1601-1655) was a French number theorist. He is regarded by some mathematicians as the inventor of differential calculus, having set out the fundamentals of differential calculus in a 1636 paper. Among his more famous contributions to math and science is Fermat\s Principle of optics, wherein he states that light travels by the path of shortest duration. Fermat's son published much of Fermat's work after his death, including a theorem scribbled in a margin that had no corresponding proof. The solution to Fermat's Last Theorem puzzled mathematicians for almost 300 years until Andrew J. Wiles developed the first acceptable proof in 1944. Functions → Rational Functions → Partial Fractions → Distinct Quadratic Factors If we do not have any linear factors in the denominator of a rational function, then we must have an irreducible quadratic expression in the denominator. An irreducible quadratic expression is one that has no real linear factors. The most common example of an irreducible quadratic is: x2 + 1 irreducible quadratic This expression can only be factored by using complex numbers. There are no real numbers that will make this expression equal zero. If we have distinct irreducible quadratic factors in our denominator, we can still use the method of partial fractions to break the rational function up into the sum of its parts. However, the numerator of each of the component bits is a linear expression in x (or some other independent variable), whereas with linear factors in the denominator, we had a partial fraction decomposition involving constant terms in the numerators. An example of what we are talking about would probably be helpful right about now: A x+1 + Ax + B 2 x +1 B x−1 + partial fraction decomposition distinct linear factors constant terms in numerators Cx + D 2 x +4 partial fraction decomposition distinct irreducible quadratic factors linear expressions in numerators Note the differences between the two partial fraction decompositions. For your convenience, we once again provide you with the method of partial fraction decomposition, this time omitting the part about linear factors. Functions Distinct Quadratic Factors – 169 Decomposition of N(x) / D(x) into Partial Fractions 1. Divide if improper: If N(x) / D(x) is an improper fraction, divide the numerator into the denominator to obtain: = (polynomial) + where N1(x) is the remainder from the division of N(x) by D(x). Then apply Steps 2, 3, and 4 (below) to the proper rational expression N1(x) / D(x). 2. Factor denominator: Completely factor the denominator into the form: (px + q)m (ax2 + bx + c)n and where (ax2 + bx + c) is irreducible. 3. (the part about linear factors goes here) 4. Quadratic factors: For each factor of the form (ax2 + bx + c)m, the partial fraction decomposition must include the following sum of n fractions: Let's see an example of how this works for distinct irreducible quadratic factors. Distinct Quadratic Factors Write the partial fraction decomposition for the following rational expression. 3x 4 2 x + 5x + 4 solution Before we do anything else, we first need to make sure that this is a "proper" fraction. A proper rational expression is one where the degree of the numerator is less than the degree of the denominator. If it is not, then we have to perform long division in order to transform the rational expression into a polynomial plus a "proper" rational expression. However, in this case we don't need to worry ourselves about that, as the degree of the numerator, 1, is clearly less than the degree of the denominator, 4. Now we need to see if we can factor the denominator. By carefully looking at the denominator, we realize that it can indeed be factored: 3x 4 = 2 x + 5x + 4 3x (x2 + 1)(x2 + 4) factor the denominator Note that each factor in the denominator is an irreducible quadratic. Now we use the method of partial fraction decomposition in order to write the basic equation for this rational expression: (x 3x 2 )( 2 ) +1 x +4 170 – Distinct Quadratic Factors = Ax + B 2 x +1 + Cx + D 2 x +4 Basic Equation Functions We next need to multiply both sides of the Basic Equation by the denominator on the left: 3x = ⎛ Ax + B + Cx + D ⎞ ( x2 + 1) ( x2 + 4) ⎜ 2 ⎟ 2 x +4 ⎠ ⎝ x +1 multiply both sides by (x2 + 1)(x2 + 4) 3x = (Ax + B)(x2 + 4) + (Cx + D)(x2 + 1) Distributive Property cancel common factors in numerator and denominator 3x = Ax3 + 4Ax + Bx2 + 4B + Cx3 + Cx + Dx2 + D multiply out all binomials 3x = Ax3 + Cx3 + Bx2 + Dx2 + 4Ax + Cx + 4B + D rearrange terms in decreasing order of powers of x 3x = (A + C)x3 + (B + D)x2 + (4A + C)x + 4B + D factor out all x-terms In order for both sides to be equal, the coefficients of all x-terms have to be equal. Since there are no x3-terms on the left hand side, the sum (A + D), which is the coefficient of x3 on the right hand side, must also be equal to zero. We compare the remaining coefficients of the x-terms and construct a system 4 linear equations with 4 unknowns (A, B, C, and D). 0 = A+C coefficient of x3 0 = B+D coefficient of x2 3 = 4A + C coefficient of x 0 = 4B + D constant term We have several choices with which we can begin solving this system. Notice that both the second and fourth equations involve D and both equal 0. By toying with these two equations, we can discover: 0 = B+D second linear equation B = –D solve for D 0 = 4B + D fourth linear equation 0 = 4(–D) + D substitute in B = –D 0 = –3D simplify 0 = D divide both sides by –3 which implies that B = D = 0. Thus, B = 0 and D = 0. Next, we solve the first and third equations. Functions 0 = A+C first linear equation A = –C solve for A 3 = 4A + C third linear equation 3 = 4(–C) + C substitute in A = –C 3 = –3C simplify –1 = C divide both sides by –3 Distinct Quadratic Factors – 171 which also gives us A = –C = –(–1) = 1. Thus, A = 1 and C = –1. Altogether, we have: A = 1 and B = 0 and C = –1 and D = 0 We plug these values for A, B, C and D into our Basic Equation and get: (x 3x ) = (x2 + 1)(x2 + 4) = 2 )( 2 +1 x +4 3x Ax + B 2 + x +1 x 2 x +1 Cx + D 2 x +4 − x 2 x +4 Basic Equation substitute in A = 1, B = 0, C = –1, D = 0 partial fraction decomposition answer As we can see from this example, partial fraction decomposition is a nasty, convoluted, exhausting process. Many people would rather have their teeth extracted without any anesthetic rather than decompose rational functions. The only time we actually want to perform partial fraction decomposition is in calculus when we have to integrate unpleasant-looking rational functions. Then partial fraction decomposition is a positive boon. 172 – Distinct Quadratic Factors Functions Advanced To decompose a rational function that contains repeated linear factors in the denominator. No definitions on this page. Leonardo Pisano Fibonacci (1170?-1250?) is credited with bringing the Hindu-Arabic number system to the rest of Western Europe. He studied the Hindu-Arabic system while in Africa and quickly saw the advantages of using Hindu-Arabic numbers for calculation as opposed to the cumbersome Roman numerals in use at the time. His book Liber abaci (Book of Calculations), published in 1202, was intended to introduce Hindu-Arabic numbers and show how they could be used in business transactions. Functions → Rational Functions → Partial Fractions → Repeated Linear Factors Using the method of partial fraction decomposition is very much the same when we have repeated linear factors as when we have distinct linear factors. There is only one main difference. If we have a denominator in our rational expression that can be factored into repeated linear factors, then its partial fraction decomposition takes the form: A ( x − c) + B ( x − c) 2 C + ( x − c) 3 + .... partial fraction decomposition repeated linear factors c is an integer Note that we specified that c must be an integer. Partial fractions will still work if we have "messy" real numbers for c (such as radicals or fractions), but it gets horribly, horribly complicated when c is not a "nice" number, like 2. Again, we provide you with the partial fraction decomposition method for reference: Decomposition of N(x) / D(x) into Partial Fractions 1. Divide if improper: If N(x) / D(x) is an improper fraction, divide the numerator into the denominator to obtain: = (polynomial) + where N1(x) is the remainder from the division of N(x) by D(x). Then apply Steps 2, 3, and 4 (below) to the proper rational expression N1(x) / D(x). 2. Factor denominator: Completely factor the denominator into the form: (px + q)m and (ax2 + bx + c)n where (ax2 + bx + c) is irreducible. 3. Functions Linear factors: For each factor of the form (px + q)m, the partial fraction decomposition must include the following sum of m fractions: Repeated Linear Factors – 173 Note: We omit the decomposition into partial fractions when we have quadratic factors because that doesn't apply to this case. Repeated Linear Factors Write the partial fraction decomposition for the rational expression. Check your result algebraically. 2 4x + 3 ( x − 5) 3 solution As always, the first thing we need to do is check to make sure this is a proper fraction. Since the degree of the numerator is 2 and the degree of the denominator is 3 and 2 < 3, then we conclude that this is indeed a proper fraction and we can proceed to factor the denominator. The denominator can easily be factored: 2 2 4x + 3 ( x − 5) = 3 4x + 3 factor the denominator ( x − 5) ( x − 5) ( x − 5) Once we have factored the denominator, we are ready to build our Basic Equation, using the guidelines outlined above: 2 4x + 3 ( x − 5) ( x − 5) ( x − 5) = A x−5 B + ( x − 5) 2 + C ( x − 5) 3 Basic Equation As we do every time we apply partial fraction decomposition, we multiply both sides by the denominator on the left: 4x2 + 3 = ⎡ A + B + C ⎤ ( x − 5) ( x − 5) ( x − 5) ⎢x − 5 2 3⎥ ( x − 5) ( x − 5) ⎦ ⎣ multiply both sides by (x – 1)(x – 1)(x – 1) 4x2 + 3 = A(x – 5)(x – 5) + B(x – 5) + C Distributive Property cancel common factors in numerator and denominator 4x2 + 3 = A(x2 – 10x + 25) + Bx – 5B + C start multiplying out everything 4x2 + 3 = Ax2 – 10Ax + 25A + Bx – 5B + C Distributive Property 4x2 + 3 = Ax2 + (B – 10A)x + 25A – 5B + C rearrange terms At this point, we can make the brilliant leap of logic that A = 4, since the coefficients of x2 on each side have to be equal in order for the equation to be valid. We also note that the quantity (B – 10A) = 0 since there are no xterms on the left hand side. Finally, we can conclude that 25A – 5B + C = 3, which is the constant term on both sides. In short, we have the following system of three linear equations with three unknowns: 4 = A coefficient of x2 0 = B – 10A coefficient of x 3 = 25A – 5B + C constant term 174 – Repeated Linear Factors Functions Since we already know A, we can substitute it into the second equation to find B. Once we know A and B, we substitute them into the third linear equation and solve for C: 0 40 = B – 10A second linear equation = B – 10(4) substitute in A = 4 = B add 40 to both sides Next we substitute A = 4 and B = 40 into the third linear equation: 3 = 25A – 5B + C third linear equation 3 = 25(4) – 5(40) + C substitute in A = 4 and B = 40 3 = 100 – 200 + C simplify 103 = C add 100 to both sides to solve for C Now that we know: A = 4 and B = 40 and C = 103 we can plug these values into our Basic Equation to complete the partial fraction decomposition of our original rational expression: A 2 4x + 3 ( x − 5) ( x − 5) ( x − 5) = x−5 4 2 4x + 3 ( x − 5) ( x − 5) ( x − 5) = x−5 B + ( x − 5) 2 40 + ( x − 5) 2 C + ( x − 5) 3 Basic Equation 3 partial fraction decomposition answer 103 + ( x − 5) In order to check our partial fraction decomposition, we multiply each numerator on the right by the missing factors in its denominator. We then simplify the resulting expression and compare it to the numerator on the left. If they match, then the partial fraction decomposition is correct. 4 x−5 + 40 ( x − 5) 2 + 103 ( x − 5) 3 = = 4( x − 5) ( x − 5) + 40( x − 5) + 103 ( x − 5) ( x − 5) ( x − 5) (2 ) 4 x − 10x + 25 + 40x − 200 + 103 ( x − 5) ( x − 5) ( x − 5) 2 = 4x − 40x + 100 + 40x − 200 + 103 ( x − 5) ( x − 5) ( x − 5) 2 = Functions 4x + 3 ( x − 5) ( x − 5) ( x − 5) rewrite partial fraction decomposition in terms of LCD multiply out polynomial in numerator expand polynomial in numerator simplify numerator check Repeated Linear Factors – 175 176 – Repeated Linear Factors Functions Advanced To decompose a rational function that contains repeated quadratic factors in the denominator. No definitions on this page. Benoit B. Mandelbrot (1924- ) is most famous for his psychedelic Mandelbrot Set, an application of fractal geometry. The basic idea behind fractal geometry is that a very simple set of instructions can produce a highly complex situation, such as a picture or even a living organism. Furthermore, it is infinitely complex in that attempting to zoom in on a particular feature yields just as much complexity as the whole situation. Mandelbrot's theories have led to advances in engineering, chemistry, biology, and physics. Functions → Rational Functions → Partial Fractions → Repeated Quadratic Factors In related thoughts, we looked at examples of partial fraction decomposition involving distinct linear factors, distinct quadratic factors, and repeated linear factors. Now we will look at an example where the denominator of a rational expression has repeated quadratic factors. The principles of how to solve this type of problem remain the same as in all the other examples; we just add one slight wrinkle. If we have repeated quadratic factors in our denominator, then the partial fraction decomposition takes the form: B 1⋅ x + C 1 2 ax + bx + c + B 2⋅ x + C 2 (ax 2 + bx + c ) 2 + .... + Bn ⋅ x + C n (ax 2 + bx + c ) n partial fraction decomposition repeated quadratic factors In other words, if the denominator has the factor (ax + bx + c) repeated 3 times, then we end up with the sum of 3 fractions, which take the form given above. As always, we once again give you the summary of the method of partial fraction decomposition (omitting the part about repeated linear factors, because it just doesn't apply here). Functions Repeated Quadratic Factors – 177 Decomposition of N(x) / D(x) into Partial Fractions 1. Divide if improper: If N(x) / D(x) is an improper fraction, divide the numerator into the denominator to obtain: = (polynomial) + where N1(x) is the remainder from the division of N(x) by D(x). Then apply Steps 2, 3, and 4 (below) to the proper rational expression N1(x) / D(x). 2. Factor denominator: Completely factor the denominator into the form: (px + q)m (ax2 + bx + c)n and where (ax2 + bx + c) is irreducible. 3. (the part about linear factors goes here) 4. Quadratic factors: For each factor of the form (ax2 + bx + c)m, the partial fraction decomposition must include the following sum of n fractions: Repeated Quadratic Factors Write the partial fraction decomposition of the following rational expression. Check your result algebraically. 2 2x + x + 8 4 2 x + 8x + 16 solution First, we check to make sure that we don't have to perform long division. Since the degree of the numerator is 2 and the degree of the denominator is 4 and 2 < 4, we don't have to perform long division to get this into a proper rational expression. It is already proper. Our next step is to factor the denominator if possible. After inspecting the denominator, we can see that we can factor it as follows: 2 2x + x + 8 4 2 x + 8x + 16 2 2x + x + 8 = (x2 + 4)(x2 + 4) factor the denominator Notice that we have a repeated quadratic factor. Next, we build the Basic Equation, according to the guidelines we established at the top of the page: Ax + B 2 2x + x + 8 (x2 + 4)(x2 + 4) = 178 – Repeated Quadratic Factors 2 x +4 + Cx + D (x2 + 4)2 Basic Equation Functions Now we continue solving the problem as we do for any partial fraction decomposition problem. We multiply both sides by the denominator on the left and cancel common factors in the numerators and denominators on the right: 2x2 + x + 8 = ⎡ Ax + B + Cx + D ⎤ ( x2 + 4) ( x2 + 4) ⎢ x2 + 4 (x2 + 4)2 ⎥⎦ ⎣ multiply both sides by (x2 + 4)(x2 + 4) 2x2 + x + 8 = (Ax + B)(x2 + 4) + Cx + D cancel common factors in numerators and denominators 2x2 + x + 8 = Ax3 + 4Ax + Bx2 + 4B + Cx + D multiply out all polynomials 2x2 + x + 8 = Ax3 + Bx2 + 4Ax + Cx + 4B + D rearrange terms in decreasing order of powers of x 2x2 + x + 8 = Ax3 + Bx2 + (4A + C)x + 4B + D factor out an x-term Now we can compare the coefficients of all the x-terms on both sides of this equation. We have 0 x3-terms on the left and A x3-terms on the right, so this implies that A = 0. We have 2 x2-terms on the left and B x2-terms on the right, so this implies that B = 2. We can use A = 0 and B = 2 to find the remaining 2 unknown quantities. We set up a system of two linear equations: 1 = 4A + C coefficient of x 8 = 4B + D constant terms We already said that A = 0 and B = 2, so we can substitute these into the linear equations and solve for the unknown quantities: 1 = 4(0) + C substitute in A = 0 1 = C simplify 8 = 4(2) + D substitute in B = 2 8 = 8+D simplify 0 = D subtract 8 from both sides Thus, we have: A = 0 and B = 2 and C = 1 and D = 0 Finally, we substitute our values for A, B, C and D into our Basic Equation to complete the partial fraction decomposition: Functions Repeated Quadratic Factors – 179 Ax + B 2 2x + x + 8 (x2 + 4)(x2 + 4) = 2 2 (x2 + 4)(x2 + 4) (x2 + 4)2 x +4 2 2x + x + 8 Cx + D + = + 2 x +4 x (x2 + 4)2 Basic Equation substitute in A = 0, B = 2, C = 1 and D = 0 partial fraction decomposition answer To check our answer, we try to reverse the process we just completed. We simplify the partial fraction decomposition so that it matches our original rational expression. 2 2 x +4 + x (x2 + 4)2 = = (2 ) x + 2 (x2 + 4) (x2 + 4)2 2x +4 (2 ) (x2 + 4)2 2x +4 +x 2 = 180 – Repeated Quadratic Factors 2x + 8 + x (x2 + 4)2 rewrite expression in terms of LCD combine fractions with like denominators expand polynomial in numerator matches our original rational expression check Functions Advanced To decompose a rational function into its component parts using the method of partial fractions. No definitions on this page. Leonardo Pisano Fibonacci (1170?-1250?) was so famous in his time he even attracted the attention of royalty. In 1225, Holy Roman Emperor Frederick II traveled to Pisa to hold a mathematical tournament to test Fibonacci\'s skills. Fibonacci was the only competitor to correctly answer all three of the tournament questions. Here is the third question, posed as a riddle: Three men owned, respectively, a half, a third, and a sixth of an unknown quantity of money. Each man took an unspecified amount, leaving none left. Then each man returned a half, a third, and a sixth of what he had first taken. The returned money was divided into equal thirds and redistributed to the men. This resulted in each man acquiring his fair share. How much money did each man own? Functions → Rational Functions → Partial Fractions → Mixed Factors Before you look at this thought, you should go and review the other thoughts under Partial Fractions. In this thought, we will combine all the techniques we used in the other thought to decompose rational expressions into their partial fraction components. If we have a mixture of linear and quadratic factors in the denominator, then we apply the method of partial fractions separately for each of those factors. For instance, if we have x(x2 + 1) in the denominator, then the partial fraction decomposition takes the form: A x + Bx + C 2 x +1 partial fraction decomposition mixed linear and quadratic factors Let's look at an example of how it works in practice. Again, we give you the method of partial fraction decomposition for your convenience. Functions Mixed Factors – 181 Decomposition of N(x) / D(x) into Partial Fractions 1. Divide if improper: If N(x) / D(x) is an improper fraction, divide the numerator into the denominator to obtain: = (polynomial) + where N1(x) is the remainder from the division of N(x) by D(x). Then apply Steps 2, 3, and 4 (below) to the proper rational expression N1(x) / D(x). 2. Factor denominator: Completely factor the denominator into the form: (px + q)m (ax2 + bx + c)n and where (ax2 + bx + c) is irreducible. 3. Linear factors: For each factor of the form (px + q)m, the partial fraction decomposition must include the following sum of m fractions: 4. Quadratic factors: For each factor of the form (ax2 + bx + c)m, the partial fraction decomposition must include the following sum of n fractions: Mixed Linear and Quadratic Factors Write the partial fraction decomposition for the rational expression. Check the result algebraically. x ( 2 ) ( 2x − 1) 2x + 1 solution The degree of the numerator is 1 and the degree of the denominator is 3. Since 1 < 3, we do not have to perform long division on this rational expression and can proceed with factoring the denominator. x ( 2 ) ( 2x − 1) 2x + 1 = x ( ) 2 ( 2x − 1) 2x + 1 denominator is already factored Note that we have a mixture of linear and irreducible quadratic factors in the denominator. We now build a Basic Equation using the guidelines outlined above for each linear and quadratic factor. The Basic Equation turns out to look like: x ( 2 ) ( 2x − 1) 2x + 1 182 – Mixed Factors = A 2x − 1 + Bx + C 2 2x + 1 Basic Equation Functions As always, our next step is to multiply both sides of the Basic Equation by the denominator on the left and cancel common factors in the numerators and denominators on the right. x = ⎛ A + Bx + C ⎞ ( 2x − 1) ( 2x2 + 1) ⎜ 2x − 1 ⎟ 2 2x + 1 ⎠ ⎝ multiply both sides by (2x – 1)(2x2 + 1) x = A(2x2 + 1) + (Bx + C)(2x – 1) cancel common factors in numerators and denominators x = 2Ax2 + A + (2Bx2 – Bx + 2Cx – C) start expanding polynomials x = 2Ax2 + 2Bx2 – Bx + 2C + A – C rearrange terms in order of decreasing powers of x x = (2A + 2B)x2 + (–B + 2C)x + A – C factor out x-terms Once again, we compare coefficients on both sides of the equation. We set up a system of three linear equations with three unknowns: 0 = 2A + 2B coefficient of x2 1 = –B + 2C coefficient of x 0 = A–C constant term If we solve the third equation for A, we get: 0 = A–C third linear equation C = A solve for C If we solve the first linear equation for A, we get: 0 = 2A + 2B first linear equation 0 = A+B divide both sides by 2 –B = A subtract B from both sides We substitute A = –B and C = A into the second equation to find a value for A: 1 = –B + 2C second linear equation 1 = A + 2A substitute in A = –B and C = A 1 = 3A simplify = A divide both sides by 3 1 3 Once we have found A, we back-substitute into the first and third equations to find out that B and C are: Functions Mixed Factors – 183 A = B = C = 1 and 3 − 1 and 3 1 3 Now that we have found values for A, B and C, we can plug these values into our Basic Equation: x ( ) 2 ( 2x − 1) 2x + 1 2 ) = ) = ( 2x − 1) 2x + 1 x ( 2x − 1 Bx + C + −1 2 ( 2x − 1) 2x + 1 3 2⋅ x − 1 Basic Equation 2 2x + 1 1 x ( A = 3 + 1 ⋅x + 3 substitute in values for A, B and C 2 2x + 1 1 3( 2x − 1) − x−1 ( rewrite rational expression partial fraction decomposition answer ) 2 3 2x + 1 In order to check our answer, we rewrite all rational expressions in terms of the Least Common Denominator (LCD): 1 3( 2x − 1) − ( x−1 2 ) 3 2x + 1 = ( ) 2 1 2x + 1 ( ) 2 ( x − 1) ( 2x − 1) − ( 2 = 2x + 1 − ( x − 1) ( 2x − 1) ( = ( 2 = = 184 – Mixed Factors ) 2x + 1 − 2x − 3x + 1 ( ) 2 3( 2x − 1) 2x + 1 2 = ) 2 3( 2x − 1) 2x + 1 2 2 ) 3( 2x − 1) 2x + 1 3( 2x − 1) 2x + 1 rewrite rational expressions in terms of LCD combine fractions with like denominators expand polynomial in numerator 2 2x + 1 − 2x + 3x − 1 ( ) 2 3( 2x − 1) 2x + 1 3x ( ) 2 3⋅ ( 2⋅ x − 1) ⋅ 2⋅ x + 1 x ( 2 ) ( 2x − 1) 2x + 1 simplify numerator simplify numerator cancel like factors in numerators and denominators check Functions There are two main categories of functions studied in algebra. By now you should already be familiar with algebraic functions, which include polynomial functions and rational functions. In this section of the Algebra Brain, we will study the second main category: transcendental functions. An algebraic function is one that can be expressed as a finite number of sums, differences, multiples, quotients, and radicals involving xn. A transcendental function is one that does not include any of the above operations. Common transcendental functions include logarithmic functions, exponential functions, trigonometric functions (including inverse trigonometric functions) and hyperbolic functions. In the Algebra Brain, we will only concentrate on exponential and logarithmic functions. An exponential function is one in which the independent variable becomes the exponent. Logarithmic functions are used to solve exponential functions for the independent variable. A logarithm is the inverse of an exponent just as subtraction is the inverse of addition. Functions Transcendental Functions – 185 186 – Transcendental Functions Functions Advanced To define the exponential function and see how it can be used to model real world situations. exponential function – a function with the independent variable as an exponent. A function with an equation of the form y = abx. base – the number b in the expression bn. Gottfried Wilhelm Leibnitz (1646-1716), one of the fathers of calculus, first wrote of the binary numbers (base 2) in his treatise De Progressione Dyadica. He also corresponded with a Jesuit missionary in China, Pere Joachim Bouvet. Through Bouvet, Leibnitz realized a direct connection between the hexagrams of the Chinese I Ching and his own binary system, as shown below: Functions → Transcendental Functions → Exponential Functions Recall that a polynomial function usually involves powers of x, such as x4, x3, and x2. Here we have a variable number raised to some integer power. Now we are going to flip things around and raise an integer number to some variable power. For example, we might need to deal with 2x. Note that 2x ≠ x2 unless x happens to equal 2. A function that involves real numbers raised to variable powers is called an exponential function, because the variable is now the exponent. The real number is called the base of the exponential function and is read "base a", where a is the real number, such as 2. Definition of Exponential Function The exponential function f with base a is denoted by f(x) bax = exponential function where b ≠ 0, a > 0, a ≠ 1, and x is any real number. Note: The base a = 1 cannot be used in an exponential function because then we have: f(x) = 1x = 1 which is a constant function and not an exponential function. In order to be able to work effectively with exponential functions, you will need to be very familiar with the properties of exponents. It might be a good time at this point for you to review how exponents work before continuing with this discussion. Evaluating exponential functions can be tricky. It is relatively simply to evaluate 4x = 64 when x = 3 or 4x = 2 when x = 1/2. However, to evaluate 4x for any real number x, it is necessary to include all the irrational numbers as well, such as √2. For most applications involving irrational exponents, we have to work with approximate values that we obtain by using a calculator. Functions Exponential Functions – 187 Let's look at the behavior of exponential functions. Their graphs are quite a bit different from any other type of function you may have studied so far. However, the graph of an exponential function is distinctive enough to be immediately recognizable as such once you know what to look for. In the same coordinate plane, sketch the graph of each function. a.) f(x) = 2x b.) g(x) = 4x and solution We plot points using standard plotting techniques. We assign integer values for x, then evaluate f(x) and g(x) at those points, listing them in a table and then plotting them in the plane. x f(x) g(x) –2 1 4 1 16 –1 1 2 1 4 0 1 2 3 1 2 4 8 1 4 16 64 Note that the function g is increasing much more rapidly than the function f. This will easily show up on our graph, which is shown below: answer For values of x > 1, the function g increases faster than f. For values of 0 < x < 1, the function g decreases faster than f. This is true in general of exponential functions. If we have two exponential expressions ax and bx such that a > b, then ax > bx for values of x > 1 and ax < bx for values of x < 1. In the same coordinate plane, sketch the graph of each function. a.) F(x) = 2–x b.) G(x) = 4–x 188 – Exponential Functions and Functions solution Again we are not restricting x to any particular domain, allowing for x to be any real number. We list some values for each function based on convenient integer values for x. We could just as easily let x be irrational. x –3 –2 –1 0 F(x) 8 4 2 1 G(x) 64 16 4 1 1 1 2 1 4 2 1 4 1 16 Note that this table is basically the mirror image of the table in the previous example. Also note that both graphs are decreasing. The graph of G(x) = 4–x is decreasing more rapidly than the graph of F(x) = 2–x. The graph of these functions looks like: answer The graphs of F and G are the mirror images of f and g. That is, F is a reflection of f (in the y-axis). Notice that each of these graphs has only one y-intercept. They also have a horizontal asymptote located at y = 0. They are also continuous for all real values of x. We can summarize the behavior of the graphs of y = ax, a > 1 and y = a–x, a > 1 as follows. Graph of y = ax, a > 1 · Domain: (– , ) · Range: (0, ) · Intercept: (0, 1) · Increasing · x-axis is horizontal asymptote (ax → 0 as x → – ) · Continuous Graph of y = a–x, a > 1 · Domain: (– , ) · Range: (0, ) · Intercept: (0, 1) · Decreasing · x-axis is horizontal asymptote (a–x → 0 as x → ) · Continuous Finally, we need to see what happens when we transform the function f(x) = ax so some new function of the form: g(x) = b ± ax + c Let's see if we can’t guess what will happen to the function. If we add or subtract b from the function f, all we are doing is moving the y-intercept of the function. So if we have the function h(x) = 2x – 1, the graph of this function will look identical to the function f(x) = 2x given in the first example except that we move the entire function 1 unit down the y-axis so that its new y-intercept is (0, 0). If we have k(x) = 2x + 1, then we are also changing the y-intercept of the function. The new function k will have its y-intercept located at the point (0, 2) when x = 0, because k(0) = 20 + 1 = 21 = 2. The function k(x) = 2x + 1 will increase faster than f(x) = 2x. Functions Exponential Functions – 189 If we have j(x) = –2x, this is equivalent to –f(x), which means that the graph of j(x) can be obtained by reflecting f(x) in the x-axis. Finally, if we have m(x) = 2–x, this is equivalent to f(–x), which means that the graph of m(x) can be obtained by reflecting f(x) in the y-axis. Each of these four situations is shown below. f(x) = 2x h(x) = 2x – 1 f(x) = 2x k(x) = 2x + 1 f(x) = 2x j(x) = –2x f(x) = 2x m(x) = 2–x 190 – Exponential Functions Functions Advanced To learn the definition of logarithmic functions. common logarithmic function – logarithmic function that uses base 10. John Napier (1550-1617), the Scottish mathematician who invented logarithms, also created an early calculating device in the form of a chess board. He expressed his numbers in normal base ten notation, but the means of calculation relied on base two. Essentially, he used the fact that the sum or product of any whole number can be written as the sum or product of powers of 2. For instance, the number 17 is equal to 24 + 20 = 16 + 1 = 17. Functions → Transcendental Functions → Logarithmic Functions What are logarithms and what is a logarithmic function? We can define a logarithm in several different ways. One way is graphically. To understand the graphical interpretation of the logarithm, you will first need to understand inverse functions and the exponential function. A logarithmic function can be defined as the inverse of the exponential function. That is, the domain of an exponential function is the range of a logarithmic function and the range of an exponential function is the domain of a logarithmic function. The main reason why exponential functions all have an inverse is because they pass the Horizontal Line Test. In other words, any horizontal line drawn across the graph of the exponential function only crosses the exponential function once. The formal definition of a logarithmic function is as follows: Definition of Logarithmic Function For x > 0 and 0 < a ≠ 1, y = loga x if and only if x = ay The function given by f(x) = loga x is called the logarithmic function with base a. Note: The equations y = loga x and x = ay are equivalent. The first equation is in logarithmic form and the second equation is in exponential form. This will become very useful knowledge when we begin solving exponential and logarithmic equations. A logarithm is an exponent, plain and simple. The expression loga x is the exponent the base a must be raised to obtain x. For example, log 2 16 = 4 because 2 must be raised to the fourth power (24) to get 16. A logarithm can have any base we need. Common bases include base 2 (binary), base 8 (octal), and base 16 (hexadecimal). The most common base by far is base 10 (decimal). Since it is so common, we don't even write log10 x, we simply use log x. A logarithmic function with base 10 is called the common logarithmic function. Logarithmic functions have a number of properties that we can exploit when dealing with them. Functions Logarithmic Functions – 191 Properties of Logarithms 1.) loga 1 = 0 because a0 = 1 2.) loga a = 1 because a1 = a 3.) loga ax = x because ax = ax 4.) If loga x = loga y, then x = y Graphs of Logarithmic Functions What does the graph of a logarithmic function look like? We have already stated that logarithmic functions are inverses of exponential functions. By definition, the graphs of inverse functions are reflections of each other in the line y = x. In the same coordinate plane, sketch the graph of each function. a.) f(x) = 3x b.) g(x) = log3 x solution a.) For the function f(x) = 3x, we can construct a table of values as follows: x f(x) –2 1 9 –1 1 3 0 1 2 3 1 3 9 27 We can then plot these points and connect them with a smooth curve to obtain the red curve on the graph shown below. b.) Since g(x) = log3 x is the inverse of f(x) = 3x, the graph of g(x) is the reflection of f(x) about the line y = x, as shown by the blue curve on the graph shown below. answer Compare the logarithmic curve (blue) to the exponential curve (red) above. The exponential function has one yintercept and one horizontal asymptote located at y = 0. By contrast, the logarithmic function as one x-intercept and one vertical asymptote located at x = 0. This behavior is typical for logarithmic functions, although we can 192 – Logarithmic Functions Functions move them around the Cartesian plane in much the same fashion as we move exponential functions. The basic characteristics of logarithmic graphs are summarized below: Graph of y = loga x, a > 1 · domain: (0, ) · range: (– , ) · intercept: (1, 0) · increasing · y-axis is a vertical asymptote (loga x → – as x → 0–) · continuous · reflection of the graph of y = ax about the line y = x Find the domain, vertical asymptote and x-intercept of the following logarithmic function. Then sketch its graph. f(x) = –log6 (x + 2) solution First of all, remember that a logarithm of any kind is just an exponent. The negative sign tells us that this is a negative exponent. The graph of an exponential function with a negative exponent is reflected about the y-axis. Since a logarithmic function is the inverse of an exponential function, it stands to reason that a negative logarithmic function is reflected about the x-axis. Therefore, the graph of this function looks like the upsidedown image of the logarithmic function given above. Next, we need to find the domain of this function. Logarithmic functions are only defined when the argument of the function is greater than zero. In this case, the argument of f(x) is (x + 2). In order for (x + 2) = 0, x has to be equal to –2. Therefore, the domain of this function is all values of x such that x > –2. answer The vertical asymptote also occurs at the point where the argument of the function goes to zero. We already stated that f(–2) = log6 (–2 + 2) = log6 (0) = undefined, so the vertical asymptote for this function is the line x = –2. answer The x-intercept occurs at the point f(x) = 0. Therefore, we need to solve the equation f(x) = 0: –log6 (x + 2) = 0 set f(x) = 0 x+2 = 1 equivalence of logarithm and exponent: 60 = (x + 2) = 1 x = –1 subtract 2 from both sides Thus, the x-intercept is located at (–1, 0). The graph of this function looks like: Functions Logarithmic Functions – 193 answer 194 – Logarithmic Functions Functions Advanced To review the properties of logarithms, which are closely related to the properties of exponents. No definitions on this page. Logarithms are a result of John Napier's (1550-1617) interest in astronomy. Astronomy relies heavily on tedious calculation of trigonometric functions. Napier invented logarithms to speed the calculations. Henry Biggs worked with Napier and constructed a set of standardized logarithmic tables. These tables were used until computers and calculators became powerful enough to render the tables obsolete (around the mid to latter part of the 20th century). Functions → Transcendental Functions → Logarithmic Functions → Properties of Logarithms Logarithms are exponents. As such, it should come as no surprise that the logarithm operator has several properties associated with it. Many of these properties are similar to those exhibited by exponents. Again, this is because a logarithm is exactly the same as an exponent. Here is the list of basic properties, which are also found in the thought Logarithmic Functions: Properties of Logarithms 1.) loga 1 = 0 because a0 = 1 2.) loga a = 1 because a1 = a 3.) loga ax = x because ax = ax 4.) If loga x = loga y, then x = y Change of Base In addition to these basic properties, we often find it useful or necessary to change the base of a logarithm to something more practical. For instance, many scientific calculators cannot evaluate log5 12 because they are not equipped to handle base 5 numbers. However, they can evaluate logs of base 10 (common logs). So if we can somehow transform log5 to log10, we can go ahead and use our calculator to evaluate the expression log5 12 once it has been converted from base 5 to base 10. This is known as "change-of-base". The formula for changing from one base to another base is given by: Change-of-Base Formula Let a, b, and x be positive real numbers such that a 1 and b 1. Then loga x is given by: loga x Note: Functions = Change-of-Base Formula One way to think of the change-of-base formula is that logarithms to base a are simply constant multiples of logarithms to base b. The constant multiplier is 1 / (logb a). Properties of Logarithms – 195 Evaluate the logarithm using the change-of-base formula. Round the answer to three decimal places. a.) log3 7 b.) log9 0.4 c.) log15 1250 solution Although we are free to change the base of each logarithm to any base we want, a convenient base is 10, since most calculators can deal with log base 10 rather easily. Thus, we will use the change-of-base formula to convert each logarithm to base 10. a.) log3 7 log10 7 = log10 3 0.845 ≈ b.) log9 0.4 log 10 0.4 = log 10 9 −0.398 0.954 ≈ log15 1250 simplify, rounding to three decimal places answer 1.771 ≈ c.) use a calculator to evaluate each logarithm round to three decimal places 0.477 ≈ Change-of-Base Formula let b = 10, x = 7 and a = 3 –0.417 = ≈ ≈ Change-of-Base Formula let b = 10, x = 0.4, and a = 9 use a calculator to evaluate each logarithm round to three decimal places simplify, rounding to three decimal places answer log 10 1250 log 10 15 3.097 1.176 2.634 Change-of-Base Formula let b = 10, x = 1250, and a = 3 use a calculator to evaluate each logarithm round to three decimal places simplify, rounding to three decimal places answer Simplifying Logarithms Many times we are given a logarithm that is unnecessarily complicated. There are a number of properties we can use to transform a complicated logarithm into something that is much simpler for us to work with. These are collectively given as some more properties of logarithms, below: 196 – Properties of Logarithms Functions More Properties of Logarithms Let a be a positive number such that a 1. Let n be a real number. If u and v are positive real numbers, then the following properties are true. 1.) loga (uv) 2.) loga un 3.) Note: = loga u + loga v Product Property of Logarithms = loga u – loga v Quotient Property of Logarithms = n loga u Power Property of Logarithms There is no general property that can be used to rewrite loga (u that loga (u v) = loga u + loga v. v). More specifically, it is not true In order to see why the first property is true, consider the following. Let x = loga u y = loga v and Then their corresponding exponential forms are given by: ax = u exponential form ay = v exponential form If we multiply u and v, we get: uv = a xa y multiply u and v together = ax + y add exponents of like bases The corresponding logarithmic form of uv is: loga (uv) = x+y logarithm form of uv = loga u + loga v substitute in x = loga u and y = loga v proof We can prove the other two properties with a similar approach. Use the properties of logarithms to write the logarithm as the sum, difference, and/or constant multiples of logarithms. Assume all variables are positive. Functions a.) log8 x3 b.) log xyz c.) log ⎛⎜ ⎞ ⎟ ⎝ x + 1⎠ x 2 Properties of Logarithms – 197 solution a.) Here we clearly need to apply the Power Property of Logarithms to move the exponent of x3 out in front of the logarithm: log8 x3 b.) = Power Property of Logarithms answer Since we have a product of three variables as the argument of our logarithm, the Product Property of Logarithms tells us that we can write this as the sum of three logarithms, one for each variable. Note that we do not specify the base of the log, so it is implied that the base is 10. log xyz c.) 3 log8 x = Product Property of Logarithms answer log x + log y + log z This one is tricky. First, we use the Quotient Property of Logarithms to separate the rational expression inside the parentheses into the difference of two logarithms. Then we note that the denominator of the rational expression is the equivalent of (x2 + 1)1/2, which means that we next need to apply the Product Property of Logarithms to the second term in our difference: log ⎛⎜ ⎞ ⎟ ⎝ x + 1⎠ x 2 = ( ) 2 (2 log ( x) − log x + 1 = log ( x) − 1 2 (2 1 )2 = 198 – Properties of Logarithms Quotient Property of Logarithms log ( x) − log x + 1 convert square root to fractional exponent ) log x + 1 Product Property of Logarithms answer Functions Advanced To learn about base e and some of its applications. natural logarithm – logarithms using the base e. David Hilbert's (1862-1943) reputation as a mathematician was founded on his research into invariant theory. An invariant is an expression that remains the same under various transformations. For instance, multiplying a linear equation by 2 does not change the solution of the equation. Most research on invariants up until that time required massive amounts of calculation. Hilbert took a path that did not require explicit calculation. Other mathematicians who studied invariant theory were taken aback. One described Hilbert's approach as "not mathematics, but theology." Functions → Transcendental Functions → Logarithmic Functions → Natural Base e Natural Exponential Function One of the most useful and interesting numbers in mathematics is the number e, where e ≈ 2.718281828…. Along with π, the number e is one of the most famous irrational numbers. The number e has a number of applications, particularly when we use it as a base in exponential functions. The function f(x) = ex is called the natural exponential function. The graph of this function is shown below. Notice its similarity to other exponential functions. What makes the natural exponential function different from all other exponential functions is that it is the only function whose slope at any point along the curve is equal to the function evaluated at that point. For instance, if we draw a line so that it is tangent to the curve at the point (1, e), the slope of the tangent line is e. If we draw the tangent line at the point (2, e2), the slope of the tangent line is e2. You will learn much more about the behavior of e and the slope of curves in calculus. Functions Natural Base e – 199 Since the number e is a base used in exponential functions, all the properties of exponential functions apply equally to the natural exponential function. For example: exe2 = e2 + x add exponents of like bases (ex)2 = e2x multiply exponents One particular application that may be of interest (pardon the pun) is compound interest on a bank account. It can be shown that the expression n ⎛1 + 1 ⎞ ⎜ ⎟ n⎠ ⎝ approaches the number e as n approaches infinity. Compound interest is defined in terms of this expression. For interest that is compounded continuously, n → , and thus, we replaces the expression above with e. Natural Logarithmic Function In addition to its usefulness in exponential functions, e is so useful that we use it as one of the two most common bases for logarithms (base 10 is the other base we usually use). We even give the logarithm of base e a special symbol: ln x logarithm of base e of x The symbol "ln" stands for natural logarithm. The symbol "ln x" is read as "el en of x". More formally, we can define the natural logarithmic function as: The Natural Logarithmic Function The function defined by f(x) = loge x = ln x x>0 is called the natural logarithmic function. All of the properties that apply to common logarithms apply equally to natural logarithms. The number e is just another base, like base 2, base 6, or base 10, so it only stands to reason that it would obey those properties. Properties of Natural Logarithms 1.) ln 1 = 0 because e0 = 1 2.) ln e = 1 because e1 = e 3.) ln ex = x because ex = ex 4.) If ln x = ln y then x = y The domain and the range of the natural logarithmic function are the same as for the common logarithmic function. The argument of the natural logarithm can never be less than or equal to zero. The graph of the natural logarithmic function is shown below. Notice its similarity to the common logarithmic function. 200 – Natural Base e Functions If you compare this to the graph of the natural exponential function given above, you can see that the domain of the natural exponential function is the range of the natural logarithmic function. Since this is the case, we can conclude that the natural exponential function and the natural logarithmic function are inverses of each other. Shown below are both functions on the same graph. Notice that they are reflections of each other about the line y = x, just like the normal exponential function and the normal logarithmic function: Functions Natural Base e – 201 202 – Natural Base e Functions Advanced To solve exponential and logarithmic equations. inverse properties – one property that can be used to undo another property, and vice versa. In 2002, the executives and auditors at Enron, WorldCom, Arthur Andersen and a host of other companies were commended by the Ig Nobel committee (a spoof of the actual Nobel committee) "for adapting the mathematical concept of imaginary numbers for use in the business world." Functions → Transcendental Functions → Exponential Functions → Exponential / Logarithmic Eq So far in this section of the Algebra Brain, we have looked at the definitions and graphs of exponential and logarithmic functions. Now it is time to put the two together and start solving equations involving exponents and logarithms. Let's start simple. Suppose we have the equation 2x = 64. How do we go about solving this equation for x? One of the properties of exponents states that ax = ay if and only if x = y. We can solve 2x = 64 by rewriting the equation in the form 2x = 26. This implies that x = 6, which is indeed the case. Most of the time, the problem is not so simple and we have to rely on other means to solve the equation. What if we had 2x = 9? This is not quite so easily solved using the property we used before. However, we can take advantage of the fact that exponential and logarithmic functions are inverses of each other. Thus, if we are presented with an exponential equation, we can use a logarithm to solve it. If we have a logarithmic equation, we can exponentiate to solve it. The following are the inverse properties of logarithmic and exponential functions: Inverse Properties of Logarithmic and Exponential Functions base a loga ax = x 1.) base e ln ex = x eln x = x 2.) If we use the above properties to solve 2x = 9, we can take the log2 of both sides to obtain: 2x = 9 original equation logz 2x = log2 9 take log2 of both sides x = log2 9 log2 2x = x because 2x = 2x Using the Change-of-Base Formula and a calculator, we can approximate a value for x: x = = ≈ log2 9 log( 9) log( 2) 3.17 value for x in log2 form Change-of-Base Formula approximate logarithms using a calculator As we can see, this means that 23.17 ≈ 9, which makes sense since 9 is between 8 (23) and 16 (24). Functions Exponential / Logarithmic Equations – 203 Here are a few guidelines to use when solving exponential and logarithmic equations: Guidelines for Solving Exponential and Logarithmic Equations 1.) To solve an exponential equation: First isolate the exponential expression, then take the logarithm of both sides and solve for the variable. 2.) To solve a logarithmic equation: Rewrite the equation in exponential form and solve for the variable. As always, it is nice to see these guidelines in action, so here are a couple of examples. Solve the exponential equation algebraically. Round the result to three decimal places. 7 – 2e3x = 1 solution Since we have an exponential equation, we apply the first guideline given above. We start by rearranging the equation so that the exponential term (2e3x) is all by itself. Then we proceed to take the natural logarithm of both sides (since we have base e). This will allow us to solve for the variable by applying one of the inverse properties given above. 7 – 2e3x = 1 given equation 6 = 2e3x subtract 1 from both sides add 2e3x to both sides 3 = e3x divide both sides by 2 ln 3 = ln e3x take natural log of both sides ln 3 = 3x since ln ex = x, it stands to reason that ln e3x = 3x = x divide both sides by 3 ≈ x use a calculator to evaluate the left hand side round to three decimal places answer ln⋅ 3 3 0.366 Solve the logarithmic equation algebraically. Round the result to three decimal places. ln x + ln (x – 2) = 1 solution This is somewhat tricky. First of all, we recognize that we can, in fact, combine the logarithms into a single logarithm. Then we exponentiate both sides. This will allow us to use the second inverse property of logarithms to solve for the variable. 204 – Exponential / Logarithmic Equations Functions ln x + ln (x – 2) = 1 given equation ln (x(x – 2)) = 1 sum of logarithms of same base is equal to logarithm of the product of the arguments of the logarithms ln (x2 – 2x) = 1 simplify expression inside parentheses (2 ) = e1 exponentiate both sides x2 – 2x = e since eln x = x, it makes sense that eln (argument) = (argument) x2 – 2x + 1 = e+1 add 1 to both sides to complete the square on the left (x – 1)2 = e+1 factor left hand side x–1 = take square root of both sides x = add 1 to both sides x ≈ e ln x − 2x 2.928 round to three decimal places answer Note that we discarded the possible negative solution for x. This is because if x is –0.928, we have ln (–0.928 – 2) in our original equation, which will give us ln (–2.928). We can never take the logarithm of a negative number, so we can disregard any negative values of x. Functions Exponential / Logarithmic Equations – 205 206 – Exponential / Logarithmic Equations Functions Exponential and logarithmic functions have a wide range of applications in the real world. Exponential functions are used to model exponential growth and decay, particularly population growth and radioactive decay. We can also use an exponential function to compile statistical data. The famous "bell curve" is one such device. A group of test scores, for example, will often be displayed as a bell curve, where the peak of the curve represents the median scores on the exam. High scores are on the right side of the curve, and low scores are on the left side of the curve. Although an exponential growth model can be used to predict population growth under certain conditions for certain organisms, a much more realistic model is the logistics growth model. An environment can only support so much growth. After a while, the rate of growth tapers off as the resources of the environment diminish. The logistics growth model reflects this more realistic trend of population growth. Finally, we have logarithmic models, which use logarithms to give us information about a situation. Often a logarithmic model is more convenient because the numbers involved are really large or really small. Thus, we only talk about the exponent, which is a more manageable number. Situations that involve logarithms include the pH scale for measuring hydrogen ion concentrations in solution, the decibel scale for measuring relative sound intensity, and the Richter scale, used to measure the magnitude of earthquakes. Thus, we have four basic models where logarithmic and exponential functions come in handy: 1.) Model Exponential Growth / Decay: • If b > 0, exponential growth • If b < 0, exponential decay Basic Form of Equation y = aebx − ( x− b) 2.) Gaussian: y = 3.) Logistics Growth: y = ae 2 c 1 − ( x− c) 1 + be 4.) Logarithmic: d y = a + b ln x y = a + b log10 x or The graphs of the basic forms of these models are shown below: Exponential Growth Model Functions Exponential Decay Model Exponential and Logarithmic Models – 207 Gaussian Model Logistics Growth Model Note: The logistics growth model contains two horizontal asymptotes. Logarithmic Model (base e) Logarithmic Model (base 10) In the thoughts below this one, we will look at each of the models in more detail. 208 – Exponential and Logarithmic Models Functions Advanced To demonstrate how to use exponential functions to model real world situations. No definitions on this page. Omar Khayyam (c.1048-1131) was a Persian mathematician, poet, astronomer, and philosopher. His greatest mathematical achievement was the application of Greek geometric methods to solve cubic equations. Greek algebra used geometry to solve algebraic problems involving quadratic equations. Khayyam used similar techniques to solve cubic equations, using the conic sections. Functions → Transcendental Functions → Exponential Functions → Exponential / Logarithmic Eq → Logarithmic / Exponential Models → Exponential Growth and Decay Exponential growth and decay is one of the more common applications of exponential functions. The basic equation for any type of growth/decay problem is given by: y aebx = exponential growth / decay equation If b > 0, then we have an exponential growth problem. If b < 0, then we have an exponential decay problem. Exponential Growth Many types of organisms experience exponential growth in the initial stages of their population. Bacteria are a prime example, but this also applies to viruses, parasites, even larger organisms, such as frogs and snakes, if they have enough food to eat. Exponential growth equations often take the form: y(t) = Ce kt exponential growth equation where C is the initial amount of whatever is growing, k is a constant of proportionality and t is time. A certain colony of genetically engineered bacteria doubles its population every 3 hours. Suppose we start out with a mere 250 bacteria. How long will it take to for the colony to reach 100,000 members? solution First, we always write down what we are given in any kind of word problem. Here, we know that we start with 250 bacteria. After 3 hours, this number has doubled to 500. Thus we have: C = 250 number of bacteria we started with y(3) = 500 number of bacteria after 3 hours t = 3 length of time needed to double colony size k = ? constant of proportionality we don't know In order to find out how long it will take for the colony to reach 100,000 members, we first need to find k, the constant of proportionality: Functions Exponential Growth and Decay – 209 y(t) = Cekt exponential growth equation 500 = 250e3k substitute in y(3) = 500, C = 250, and t = 3 2 = e3k divide both sides by 250 ln 2 = ln e3k take natural log of both sides ln 2 = 3k Power Property of Logarithms and ln e = 1 = k divide both sides by 3 ≈ k approximate k to three decimal places ln( 2) 3 0.231 Now that we have an approximate value for k, we can go ahead and use the exponential growth equation again, this time letting y(t) = 100,000, C = 250, and k ≈ 0.231. This will let us solve the equation for t. y(t) = Cekt exponential growth equation 100,000 = 250e0.231 t substitute in y(t) = 100,000, C = 250 and k = 0.231 400 = e0.231 t divide both sides by 250 ln 400 = 0.231 t take natural log of both sides = t divide both sides by 0.231 ≈ t round t to three decimal places answer ln( 400) 0.231 25.937 After about 26 hours, our colony of 250 bacteria will have increased to 100,000 members. Exponential Decay This is essentially the opposite of exponential growth. Here we start out with some initial amount of material and over time the amount decreases at an exponential rate. The basic equation for exponential decay looks almost identical to the equation for exponential growth with one slight difference: y(t) = Ce − kt exponential decay equation Note that we have added a negative sign in front of the k. This tells us immediately that we are dealing with exponential decay. The rest of the terms have the exact same meaning as they do for exponential growth. C is our initial amount, y(t) is the amount we have left after time t, and k is the constant of proportionality. The minus sign in front of the k tells us that the initial amount is decreasing over time. Radium-226 (226Ra) has a half-life of 1620 years. This means that if we have a 1-kg sample of radium right now, in 1620 years, the amount of radium we have left will be around 0.5 kg, one-half the mass of our original sample. Suppose we have a 10-g sample of 226Ra now. How much will we have left 10 years from now? 20 years? 10,000 years? 210 – Exponential Growth and Decay Functions solution Once again, we have to determine the constant of proportionality, k. We start be letting C = 10 (our original amount), y(1620) = 5 (one-half our original amount) , and let t = 1620, the half life of radium. 5 = 10e–k (1620) exponential decay equation with y(1620) = 5, C = 10, t = 1620 0.5 = e–1620 k divide both sides by 10 ln 0.5 = –1620 k take natural log of both sides = k divide both sides by –1620 ≈ k approximate value for k −ln( 0.5) 1620 0.0004279 Now that we know k, we can start using k to help us figure out how much radon we have after 10 years, 20 years, and 10,000 years. y(10) = 10e–0.0004279 (10) substitute in k = 0.0004279, C = 10, and t = 10 y(10) ≈ 9.957 evaluate with a calculator answer After 10 years, our 10-g sample has only decreased by about 43 milligrams. Now let's see how much we have left after 20 years: y(20) = 10e–0.0004279 (20) substitute in t = 20 y(20) ≈ 9.915 evaluate with a calculator answer After 20 years, our 10-g sample has only decreased by 85 milligrams. Finally, let's see how much we have left after 10,000 years. We can expect that it should be reduced by a significant amount since half of it disappears after only 1620 years. y(10000) = 10e–0.0004279 (10000) substitute in t = 10000 y(10000) ≈ 0.139 evaluate with a calculator answer We have only 1.4% of our original sample left after 10,000 years. Functions Exponential Growth and Decay – 211 212 – Exponential Growth and Decay Functions Advanced To learn about one of the basic exponential functions used to model data. normal distribution – the continuous function that the binomial distribution approaches as n, the number of trials, increases without bound. bell-shaped curve – The graph of a normal distribution. Nicolaus Mercator (1619?-1687) is a lesser-known, but still important mathematician. In 1668, he published his greatest work, Logarithmicotechnia>, wherein he computes a table of logarithms. He also described an equation that now bears his name. It is a series expansion that computes the area under a hyperbola. Besides his mathematical skills, he was an inventor, designing a pendulum watch named after Christiaan Huygens, creator of the pendulum clock. Functions → Transcendental Functions → Exponential Functions → Exponential / Logarithmic Eq → Logarithmic / Exponential Models → Gaussian One of the most useful models for looking at data is the Gaussian model, named after that paragon of mathematics, Carl Friedrich Gauss. In general, Gaussian models take the form: − ( x− c) y = 2 c ae Gaussian model This type of model is commonly used in probability and statistics to represent populations that are normally distributed . For standard normal populations, the model takes the form: 2 −x y = 1 σ 2π e 2 standard normal distribution model 2σ where σ is the standard deviation (σ is the lowercase of the Greek letter sigma). When we graph the Gaussian model, we end up with a bell-shaped curve . Bell-shaped curves are useful in that they can tell us the percentage of a certain event occurring. Let's look at an example. A certain game requires us to roll two six-sided dice (abbreviated as 2d6) and add the two numbers together. If we roll "1" and "5", the total result is "6". The smallest number we can roll is 2 and the largest number we can roll is 12. The number we roll on 2d6 roughly follows a normal distribution given by: − ( x− 7) y = 0.1445e 15.2 2 2 ≤ x ≤ 12 where x is the number we roll on 2d6. Sketch the graph of this function. From the graph, estimate the average number we can roll on 2d6. Functions Gaussian Model – 213 solution Using an automatic graphing utility, we can graph the function below: The peak of the bell-curve corresponds to the mean, or average. In this case, the average number rolled on 2d6 is 7. answer What the graph illustrates is that we can expect about 50% of our rolls to total 7 or more and about 50% of the rolls to total 7 or less. We will explore probabilities in much more detail in another section of the Algebra Brain. 214 – Gaussian Model Functions Advanced To model real-world situations using logarithmic functions. No definitions on this page. John Napier (1550-1617) was rumored to be a magician as well as a mathematician. Supposedly he possessed supernatural powers and kept a black rooster as a supernatural familiar. Functions → Transcendental Functions → Exponential Functions → Exponential / Logarithmic Eq → Logarithmic / Exponential Models → Logarithmic The logarithmic model is used in a variety of situations. In general, the logarithmic model takes the form: y = a + b ln x base e y = a + b log x base 10 Which one we use depends largely on the situation. Base 10 is very commonly used to measure the magnitude of earthquakes, the intensity of sound, or the hydrogen ion concentration in aqueous solution (called pH). The level of sound β (measured in dB), with an intensity of I (measured in watts per square centimeter W / cm2) is given by: β (I) = 10log 10⎛⎜ I ⎞ ⎟ ⎝ I0 ⎠ level of sound in decibels (dB) where I0 is an intensity of 10–16 W / cm2, the faintest sound that can be heard by the human ear. If the loudest car stereo on earth can produce 165 dB level sounds, what is the intensity of the sound wave it produces? Functions Logarithmic Model – 215 solution We let β(I) = 165 and solve the equation decibel equation for I: 165 = 10log 10⎛⎜ 16.5 = log 10⎛⎜ 16.5 = log I – log 10–16 Quotient Property of Logarithms 16.5 = log I – (–16) simplify second logarithm using Properties of Logarithms 16.5 = log I + 16 simplify 0.5 = log I subtract 16 from both sides 100.5 = 10log I exponentiate both sides 100.5 = I simplify 3.162 ≈ I approximate value of I to three decimal places answer I − 16 ⎝ 10 I − 16 ⎝ 10 ⎞ ⎟ ⎠ ⎞ ⎟ ⎠ substitute in β(I) = 165 and I0 = 10–16 divide both sides by 10 Thus, the intensity of the sound wave produced by the car stereo is about 3.162 W / cm2. This is a considerable amount of energy per unit area. It is a blast loud enough to pop balloons, shatter glass, and even make your hair stand on end. It will also cause you to go permanently deaf if you do not take precautions. Let's compare this car stereo (featured on Ripley's Believe it or Not!) to a commercial jetliner, which produces a noise level of about 140 dB when it takes off. How much more intense is the sound produced by the car stereo above than that of a commercial jetliner? solution First, we need to find the intensity of the sound produced by the jetliner. Since we already stated that a jetliner has a sound level of about 140 dB, we solve for the intensity exactly the same as we did above: 140 = 10log 10⎛⎜ 14 = log 10⎛⎜ 14 = log I – log 10–16 Quotient Rule of Logarithms 14 = log I + 16 simplify second log with Properties of Logarithms –2 = log I subtract 16 from both sides 10–2 = I exponentiate both sides 0.01 = I simplify I − 16 ⎝ 10 I − 16 ⎝ 10 216 – Logarithmic Model ⎞ ⎟ ⎠ ⎞ ⎟ ⎠ substitute in β(I) = 140 and I0 = 10–16 divide both sides by 10 Functions The sound that the jetliner produces has an intensity of about 0.01 W / cm2. If we divide the car stereo sound by the sound of the jetliner, we will find how much louder the stereo is than the jetliner: 3.162 0.01 = 316.2 divide intensity of stereo by intensity of jetliner Thus, the car stereo is 316 times louder than the jetliner. Functions answer Logarithmic Model – 217 218 – Logarithmic Model Functions Advanced To see a more realistic model of population growth. logistics curve – a growth curve described by the equation below: sigmoidal curve – another name for a logistics growth curve. Omar Khayyam (c.1048-1131), in attempting to prove Euclid's Fifth Postulate, inadvertently discovered nonEuclidean geometries. The Fifth Postulate states that, given a line a, only one line parallel to a can be drawn through a point not on a. Khayyam tried to prove the postulate by constructing a quadrilateral with two equal sides that are perpendicular to the base. By definition, the quadrilateral's upper angles must be equal. Khayyam hypothesized three possible scenarios: the angles are right angles, acute angles, or obtuse angles. He rejected the last two hypotheses because the Fifth Postulate says that converging lines must intersect. Although he dismissed the last two hypotheses, other mathematicians later speculated about non-Euclidean spaces where converging lines don't meet. Functions → Transcendental Functions → Exponential Functions → Exponential / Logarithmic Eq → Logarithmic / Exponential Models → Logistics Growth We have already looked at one model of population growth: exponential growth. This model is only accurate under extremely favorable conditions, when the population has abundant resources that it can use to expand itself. This model is generally only accurate within a controlled environment, such as a laboratory. The real world is a much harsher place. In the real world, an environment cannot support population growth indefinitely. Eventually an area will run out of food or water. To ensure the survival of the species, a population of animals automatically adjusts its growth rate so that it does not exceed the maximum capacity of the environment (humans tend to be the exception to this rule). The population initially grows exponentially just as it did before, but after awhile the growth rate declines. The model used to describe this growth pattern is known as the logistics curve given by the function: a y − ( x− c) = 1 + be logistics growth equation d where y is the population and x is the time. A graph of this curve (called a sigmoidal curve) is shown below: Functions Logistics Growth – 219 A conservation organization releases 200 animals of an endangered species into a game preserve. The organization believes that the preserve has a carrying capacity of 2000 animals and that the growth of the herd will be modeled by the logistics curve. p(t) = 2000 1 + 5e − 0.1643 t where t is measured in months. a) Graph the function. Use the graph to determine the horizontal asymptotes and interpret the meaning of the larger p-value in the context of the problem. b) Estimate the population after 6 months c) After how many months will the population be 1600? solution a) The graph of the function looks as follows: From the graph, it can be seen that there are horizontal asymptotes at y = 2000 and y = 0. These represent the boundaries for which the environment is viable. The larger p-value represents the maximum number of animals who can live in this environment. After the population reaches 2000, the environment is no longer able to support the population. b) The graph shows that after 6 months, the population has jumped from 200 to around 600 animals. 220 – Logarithmic Model Functions c) Use the logistics growth formula to find out when the population reaches 1600. 2000 substitute known quantity for p 750 = 750(1 + 5e–0.1643 t) = 2000 multiply both sides by (1 + 5e–0.1643 t) 1 + 5e–0.1643 t = 1.25 divide both sides by 750 5e–0.1643 t = 0.25 subtract 1 from both sides e–0.1643 t = 0.05 divide both sides by 5 ln e–0.1643 t = ln 0.05 take ln of both sides –0.1643t = ln 0.05 simplify using properties of ln t = t ≈ 1 + 5e − 0.1643 t ln( 0.05) −0.1643 18.23 months solve for t evaluate to two decimal places The population will reach 1600 around 18 months and 7 days from the initial time. Functions answer Logistics Growth – 221 222 – Logarithmic Model Functions In this section of the Algebra Brain, we will examine in detail several procedures we can do with functions. Arithmetic Combinations – Adding, subtracting, multiplying and dividing two functions by one another. This creates a new function whose domain consists of all values common to the domains of f and g. Composition of Functions – Using one function as the argument for another function. For example, if we have two functions f(x) = 2x and g(x) = x2, then f(g(x)) = f(x2) = 2(x2) = 2x2. Reflections in Coordinate Axes – Reflecting the graph of a function about the x- or y-axis. Very handy when trying to draw certain functions. If we know one half of the function, then we can use this property to draw the other half of the function. Vertical and Horizontal Shifts – Moving the graph of a function around in the coordinate plane. This is extremely important when analyzing conic sections. Functions Translations and Combinations – 223 224 – Translations and Combinations Functions Advanced To learn how to add, subtract, multiply and divide two or more functions. arithmetic combination – the sum, difference, product, or quotient of two or more functions. Nicole d'Oresme (c.1320-1382) was a French algebraist, physicist, philosopher, and theologian. Among his contributions to mathematics is the realization that exponents can be, and often are, negative integers or even irrational numbers. He used a graphical approach in his mathematical analysis which became a cornerstone for the development of analytic geometry in the 17th century by René Descartes. d'Oresme even understood basic properties of exponents, such as xm + xn = xm+n. Functions → Translations and Combinations → Arithmetic Combinations We can combine two functions by the operations of addition, subtraction, multiplication, and division. The result is a new function that is the sum/difference/product/quotient of the original two functions. For instance, if we have the functions f(x) = 3x2 – 2 g(x) = 2x + 1 and then the sum, difference, product, and quotient of f and g are as follows: f(x) + g(x) f(x) – g(x) f(x)g(x) f ( x) g ( x) = (3x2 – 2) + (2x + 1) add two functions together = 3x2 + 2x – 1 simplify sum of functions f and g = (3x2 – 2) – (2x + 1) subtract one function from another function = 3x2 – 2 – 2x – 1 Distributive Property = 3x2 – 2x – 3 simplify difference of functions f and g = (3x2 – 2)(2x + 1) multiply two functions together = 6x3 + 3x2 – 4x – 2 FOIL method product of functions f and g 2 = 3x − 2 2x + 1 x ≠ –1/2 divide one function by another function As always, we have to keep the domains of our given functions in mind when we combine them. When we divided f by g, for instance, we had to note that the value x = –1/2 is excluded from the domain because otherwise we have a division by zero, which is a big no-no in algebra. In other words, g(x) ≠ 0 when we divide two functions f and g. In more general terms, the domain of the arithmetic combination of functions consists of all those terms that are common to both f and g. If the domain of f is [–5, 3] and the domain of g is [–1, 7], Functions Arithmetic Combinations – 225 then the domain of any combination of f and g will be [–1, 3], because the numbers inside this range are common to both f and g. Sum, Difference, Product, and Quotient of Functions Let f and g be two functions with overlapping domains. Then, for all x common to both domains, the sum, difference, product, and quotient of f and g are defined as follows. Note: 1.) Sum: (f + g)(x) = f(x) + g(x) 2.) Difference: (f – g)(x) = f(x) – g(x) 3.) Product: (fg)(x) = f(x) · g(x) 4.) Quotient: = g(x) ≠ 0 While here we only list the sum/difference/product/quotient of two functions, these properties easily extend to three or more functions. That is, we can have m(x) = f ( x) + g ( x) − p ( x) q ( x) arithmetic combination of 5 functions k ( x) Find (a) (f + g)(x), (b) (f – g)(x), (c) (fg)(x), and (d) (f / g)(x). What is the domain of f / g? f(x) = x2 + 5 g(x) = 1–x solution a.) b.) c.) d.) f(x) + g(x) f(x) – g(x) f(x)g(x) f ( x) g ( x) = (x2 + 5) + (1 – x) add the two functions together = x2 – x + 6 write in standard form answer = (x2 + 5) – (1 – x) subtract g from f = x2 + x – 4 rewrite in standard form answer = (x2 + 5)(1 – x) multiply f and g together = x2 – x3 + 5 – 5x FOIL method = –x3 + x2 – 5x + 5 rewrite in standard form 2 = x +5 x−1 divide f by g answer The domain of f / g includes only those values of x such that g(x) ≠ 0. In this case, g(x) = x – 1, so that means we exclude x = 1 from our domain. Therefore, our domain for f / g is all real numbers except x = 1. answer 226 – Arithmetic Combinations Functions Advanced To learn how to take a function of a function. composition of functions – the function that results from first applying one function, then another; denoted by the symbol o. S. I. Ramanujan (1887-1920) was an Indian mathematician of great genius. He was also a devoutly religious man who was reluctant to travel because of certain religious restrictions. Ramanujan attributed his mathematical gifts to his household deity, the goddess Namagiri. He combined his passion for mathematics with his devout faith, once commenting to a friend, "An equation for me has no meaning unless it expresses a thought of God." Functions → Translations and Combinations → Composition of Functions Suppose we have the functions f(x) = x2 + 1 g(x) = x+1 and One way to combine these functions is to take a function of a function. That is, the function g(x) becomes the argument for the function f(x). We write this as: f(g(x)) take a function of a function When we substitute in g(x) = x + 1 inside the parentheses, we get: f(x + 1) = (x + 1)2 + 1 substitute in g(x) = x + 1 = x2 + 2x + 1 + 1 expand binomial = x2 + 2x + 2 simplify We call this ability to take a function of a function composition of functions, and denote it as f o g. Definition of Composition of Two Functions The composition of the function f with the function g is given by: (f o g)(x) = f(g(x)) composition of f with g Similarly, the composition of the function g with the function f is given by: (g o f)(x) = g(f(x)) composition of g with f The domain of (f o g) is the set of all x in the domain of g such that g(x) is in the domain of f. Note: Composition of function is not a commutative operation. That is, f(g(x)) Functions ≠ g(f(x)) composition is not commutative! Composition of Functions – 227 Also, we can extend this definition for more than two functions. Thus, we can find (f o g o h)(x) as: (f o g o h)(x) = f(g(h(x))) composition of three functions In such cases, we usually start with the inner function first and work our way to the outer function. Given f(x) = 2x – 3 and g(x) = x + 6, find: a.) (f o g)(2) b.) (g o f)(2) solution a.) To compose f with g, we find f(g(x)) and evaluate it for x = 2: (f o g)(2) b.) = f(g(2)) definition of composition of functions = f(2 + 6) substitute in g(2) = 2 + 6 = 2(8) – 3 let f(2 + 6) = 2(2 + 6) – 3 = 13 simplify answer Now we reverse the order of the composition and evaluate it for x = 2: (g o f)(2) = g(f(2)) definition of composition of functions = g(2(2) – 3) substitute in f(2) = 2(2) – 3 = g(1) simplify inside parentheses = 1+6 evaluate g(1) = 7 simplify answer Here is a classic example of the fact that (f o g)(x) ≠ (g o f)(x), thus proving the point that this is not a commutative operation. 228 – Composition of Functions Functions Advanced To learn how to reflect a graph of a function in the coordinate plane. No definitions on this page. The original discoverer of conic sections was Manaechmus, a fourth century B.C. Greek geometer. He used simple properties of the parabola and the rectangular hyperbola to solve the problem of the mean proportionals. It is unknown exactly how he proved that the curves were conics. Fifty years after his time, a considerable amount of work had been done on the conics. At that time, the parabola, the ellipse, and the hyperbola were known, respectively, as the sections of a right-angled, acute-angled, and obtuse-angled cone. Later it was shown that all three conics could be derived from the same cone. Functions → Translations and Combinations → Reflections in Coordinate Axes One of the most common types of ways of transforming a function is to reflect the function about either the xor the y-axis. This means that the function on one side of the axis is a mirror image of the function on the other side of the axis. There are two ways of transforming a function so that it has a reflection. To reflect a function about the x-axis, we let our new function h(x) = –f(x). Suppose we let f(x) = x2, then h(x) = –x2. When we graph f(x) and h(x) in the same coordinate plane, we obtain the following graph: To reflect the function f about the y-axis, we let h(x) = f(–x). That is, we replace x in the function f with –x. Suppose we have f(x) = x3, then h(x) = f(–x) = (–x)3. As a result, when x = –1, we have h(–1) = (–(–1))3 = (1)3 = 1. That is, when x is a negative number, we obtain a positive result. If we graph f(x) = x3 and h(x) = (–x)3 on the same coordinate plane, we obtain the following graph: Functions Reflections in Coordinate Axes – 229 To summarize, we can use the following guidelines for reflecting functions along the axes: Reflections in the Coordinate Axes Reflections in the coordinate axes of the graph of y = f(x) are represented as follows: 1.) Reflections in the x-axis: h(x) = –f(x) 2.) Reflections in the y-axis: h(x) = f(–x) 230 – Reflections in Coordinate Axes Functions Advanced To learn how to move the graph of a function around in the Cartesian plane. No definitions on this page. The Greek word arithmetike—derived from the Greek word arithmos, meaning "number"—does not refer to calculation. Instead it denotes theoretical, speculative thinking about numbers. Nichomachus of Gerasa (c.60c.100 A.D.) discusses numbers in his work Introduction to the Arithmetic. He introduces various kinds of numbers, such as odd, even, prime, and perfect. He considered them to have human-like qualities, like goodness. One theorem he developed involves grouping positive odd integers like so: 1; 3 + 5; 7 + 9 + 11; 13 + 15 + 17 + 19; etc. Nichomachus discovered the sum of each grouping is equal to the cubed integers: 13 = 1; 23 = 3 + 5; 33 = 7 + 9 + 11; 43 = 13 + 15 + 17 + 19; and so forth. Functions → Translations and Combinations → Vertical and Horizontal Shifts Suppose we have the simple function of f(x) = x. This is a diagonal line of slope 1 that passes through the origin. If we want to move the line up along the y-axis by 1 unit, then we simply add 1 to the overall function to obtain a new function: f(x) = x original function g(x) = x+1 add 1 to our original function = f(x) + 1 substitute in x = f(x) The net result is that we have shifted the graph of f(x) upward by 1 unit. If we wanted to move the graph of f(x) downward by 1 unit, then logic would suggest that we subtract 1 from f(x): h(x) = f(x) – 1 subtract 1 from f(x) = x–1 substitute in f(x) = x This behavior holds true even if we have more complicated functions, such as f(x) = x3 – 3x2 + 4x – 1. In order to move the graph of this function up 1 unit, we add 1 to the overall function: g(x) = f(x) + 1 add 1 to f(x) = (x3 – 3x2 + 4x – 1) + 1 substitute in f(x) = x3 – 3x2 + 4x – 1 = x3 – 3x2 + 4x simplify Now, what if we want to shift the graph to the right or to the left? How do we go about doing that? Suppose we have k(x) = x2. This is just a parabola centered on the origin. To move the graph to the right by 1 unit, we replace x with (x – 1) to obtain a new function: j(x) Functions = k(x – 1) substitute in x – 1 for x = (x – 1)2 graph of k shifted 1 unit to the right Vertical and Horizontal Shifts – 231 In order to move the graph of k(x) = x2 to the left by 1 unit, we reverse the sign, replacing x with (x + 1): q(x) = k(x + 1) substitute in x + 1 for x = (x + 1)2 graph of k shifted 1 unit to the left We have a few rules we can use to tell us how the graph of a function f is shifted. Vertical and Horizontal Shifts Let c be a positive real number. Vertical and horizontal shifts in the graph of y = f(x) are represented as follows: 1.) Vertical shift c units upward: h(x) = f(x) + c 2.) Vertical shift c units downward: h(x) = f(x) – c 3.) Horizontal shift c units to the right: h(x) = f(x – c) 4.) Horizontal shift c units to the left: h(x) = f(x + c) We can combine these guidelines to move graphs of functions in both the horizontal and vertical directions. Sketch on the same set of coordinate axes a graph of f for c = –2, 0, 2. f(x) = (x – c)3 + c solution Notice that f contains the terms (x – c), which means that we shift the graphs 3 units to the right or left (depending on the value of c). Also notice that we add c to this term, which means we move the graph up or down, again depending on the value of c. Thus, we have three functions: f1(x) f2(x) f3(x) = (x – (–2))3 + (–2) substitute in c = –2 = (x + 2)3 – 2 simplify function is shifted 2 units left and 2 units down = (x – 0)3 + 0 substitute in c = 0 = x3 simplify function is not shifted at all, centered on the origin = (x – 2)3 + 2 substitute in c = 2 function is shifted 2 units right and 2 units up The graphs of these functions look like: 232 – Vertical and Horizontal Shifts Functions answer Notice that while we move the function around the Cartesian plane, we do not change its overall shape. The group of function represented on the graph above is called a family of functions. You will study families of functions in much more detail when you study integration in calculus. Functions Vertical and Horizontal Shifts – 233 234 – Vertical and Horizontal Shifts Functions Advanced To define inverse functions. inverse function – the functions f and g are inverses of each other if f(g(x)) = x in the domain of g and g(f(x)) = x for all x in the domain of f. George Berkeley (1685-1753)—Irish philosopher, Anglican cleric, and mathematician—is known for his rejection of contemporary mathematics for being unclear and logically deficient. He is most famous for attacking Isaac Newton's infinitesimal calculus. He accepted the fact that Newton's calculations provided true results, but he believed the reasoning upon which those calculations were based was flawed. This critique inspired other mathematicians to develop a logical clarification of calculus. Functions → Inverse Functions Recall that one way we can represent a function is as a set of ordered pairs. Suppose we have the function f(x) = x + 3 for values of x from the set A = {1, 2, 3, 4}. This function maps onto the set B = {4, 5, 6, 7}. We can write this as follows: f(x) = x+3 {(1, 4), (2, 5), (3, 6), (4, 7)} Now, if we interchange the first and second coordinate of each of the ordered pairs, we form the inverse function of f, which we denote by f –1 (read as "f-inverse"; this should not be confused as being an exponent). In essence we have a function mapped from the set B to the set A, which we write as follows: f –1 = x–3 {(4, 1), (5, 2), (6, 3), (7, 4)} Notice that the domain of f (the first coordinate) is equal to the range for f –1, and vice versa, as shown below: Also notice that the functions f and f –1 have the effect of "undoing" each other. That is, we can use f(x) to "undo" f –1(x) and vice versa. Suppose we form the composition of f and f –1. Then we have: f(f–1(x)) Functions = f(x – 3) composition of functions = (x – 3) + 3 substitute in (x – 3) for x in the function f(x) = x simplify Inverse Functions – 235 Similarly, if we form the composition of f –1 and f, we obtain: f –1(f(x)) = f –1(x + 3) composition of functions = (x + 3) – 3 substitute in (x + 3) for x in the function f –1(x) = x simplify In general, for two functions to be inverses, both compositions of the functions must return a value of x. More specifically, we can define an inverse function as follows: Definition of the Inverse of a Function Let f and g be two functions such that f(g(x)) = x for every x in the domain of g g(f(x)) = x for every x in the domain of f and Under these conditions, the function g is the inverse of the function f. The function g is denoted by f –1 (read "f-inverse"). Thus: f(f –1(x)) = x f –1(f(x)) = x and The domain of f must be equal to the range of f –1 and the range of f must be equal to the domain of f –1. Note: Do not be confused by the use of –1 to denote the inverse of the function f. Whenever we write f –1, we are always referring to the inverse function, not the reciprocal of f(x). Show that f and g are inverses of each other. f(x) = 1 – x3 g(x) = 3 1−x solution To demonstrate that they are inverses, we form the compositions f(g(x)) and g(f(x)) and see if they both equal the identity function x: f(g(x)) ( 3 1 − x) composition of f with g = f = 1− = 1 – (1 – x) simplify radical/exponent = 1–1+x Distributive Property = x check 236 – Inverse Functions ( 3 1 − x) 3 substitute in for x Functions Now we form the composition of g with f: g(f(x)) = g(1 – x3) = 3 = 3 = 3 = x ( composition of g with f 1− 1−x 1−1+ x x 3 3 3 ) substitute in for x Distributive Property simplify under the radical sign simplify check Since we obtain the identity function x when we form both compositions, we conclude that these functions are inverses of each other. answer Functions Inverse Functions – 237 238 – Inverse Functions Functions Advanced To find the inverse of a function. No definitions on this page. The curve known as the "Witch of Agnesi" is named after Marie Gaëtann Agnesi (1718-1798), an Italian algebraist, geometer, linguist, and philosopher. The "Witch" part of the name derives from a confusion of terms. The Italian word versiera (to turn) somehow became associated with the Italian word avversiera, which means "devil's wife", or "witch". The curve itself is the graph of the Cartesian equation y(x2 + a2) = a3, where a is a constant. Oddly, the graph resembles the point hat of a witch, possibly another reason for its name Functions → Inverse Functions → Finding Inverse Functions In the previous thought, we introduced you to the idea that some functions have an inverse. We also showed you how to demonstrate that two functions are inverses of each other. However, we did not give you any means for finding the inverse of a function if it has one. For simple functions, we can find inverses by inspection. For instance, if we have f(x) = x + 3, then the obvious inverse of this function is f –1(x) = x – 3. Below we list some guidelines for finding the inverse of a given function. The key step in these guidelines is Step 2—interchanging the roles of x and y. The reason why we do this is because of the fact that inverse functions have ordered pairs with the coordinates reversed—i.e. the domain of a function f is the range of the function f –1. Guidelines for Finding the Inverse of a Function Note: 1.) In the equation for f(x), replace f(x) with y. 2.) Interchange the roles of x and y. 3.) If the new equation does not represent y as a function of x, the function f does not have an inverse function. If the new equation does represent y as a function of x, solve the new equation for y. 4.) Replace y with f –1(x). 5.) Verify that f and f –1 are inverses of each other by showing that the domain of f is equal to the range of f –1, the range of f is equal to the domain of f –1, and f(f –1(x)) = x = f –1(f(x)). In Step 3, above, it is possible that a function has no inverse. For instance, the function f(x) = x2 has no inverse function for all values of x in the domain of f. However, if we restrict the domain of f to only positive values of x, then the function f(x) = x2 does have an inverse. Knowing the domain and ranges of our given functions is extremely important in determining if they have inverses or not. Find the inverse. f(x) Functions = 2x + 3 7 Finding the Inverse – 239 solution We use the guidelines given above. 1.) 2.) First, we replace f(x) with y: f(x) = y = 2x + 3 2y + 3 = interchange x and y 7 This new equation does represent y as a function of x, the function f does have an inverse function, so we solve the equation in Step 2 for y: 2y + 3 x = 7x = 2y + 3 multiply both sides by 7 7x – 3 = 2y subtract 3 from both sides = y divide both sides by 2 2 equation from Step 2 7 Now we replace y with f –1(x). f –1(x) 5.) replace f(x) with y 7 7x − 3 4.) given equation 7 Next, we interchange x and y: x 3.) 2x + 3 7x − 3 = replace y with f –1(x) answer 2 Finally, we notice that both f and f –1 have domains and ranges that consist of the entire set of real numbers. That is, the range of f is the set of all real numbers, and the domain of f –1 is the set of all real numbers. We also need to double check that f(f –1(x)) = x = f –1(f(x)). f(f –1(x)) = f ⎛⎜ 7x − 3 ⎞ 2⎛⎜ 7x − 3 ⎞ ⎝ ⎝ = ⎟ ⎠ 2 ⎟+3 ⎠ 2 composition of f with f –1 evaluate f(f –1(x)) 7 ( 7x − 3) + 3 = 7 7x = = 7 x 240 – Finding the Inverse cancel common factors combine like terms in numerators cancel common factors check Functions We also find f –1(f(x)): f –1(f(x)) f = − 1⎛ ⎜ ⎝ 7⎛⎜ ⎝ = 2x + 3 ⎞ ⎟ ⎠ 7 7 2x + 3 ⎞ ⎟−3 ⎠ 7 composition of f –1 with f evaluate f –1(f(x)) 2 ( 2x + 3) − 3 = 2 2x = = cancel common factors combine like terms in numerators 2 cancel common factors check x Since both compositions give us the identity function x, we feel confident that we have indeed found the inverse of the function f. Find the inverse of the function g. g(x) = x3 + 2 solution Here we will move a bit more quickly than on the last example. First, we substitute in y for g(x), then we switch the variables x and y. We solve the equation for y, and then substitute in g–1(x) for y. We will leave the process of double-checking the inverse up to you to do on your own. g(x) = x3 + 2 given equation y = x3 + 2 substitute in y = g(x) x = y3 + 2 switch the variables x and y x–2 = y3 subtract 2 from both sides 3 x−2 = y take cube root of both sides 3 x−2 = g–1(x) replace y with g–1(x) answer Notice that the domain of g(x) is the range of g–1(x), namely both are the set of all real numbers. Similarly, the range of the function g(x) is the domain of g–1(x) and both are again the set of all real numbers. Functions Finding the Inverse – 241 242 – Finding the Inverse Functions Advanced To determine if a function has an inverse by applying the Horizontal Line Test. No definitions on this page. al-Battani (858?-929), also known by his Latinized names of Albatenius, Albategnius, and Albategni, was a prominent Arab astronomer and trigonometer. His innovative use of trigonometric methods to perform astronomical calculations led to much greater accuracy in his measurements. He determined the length of the solar year to be 365 days, 5 hours, 48 minutes, and 24 seconds, very close to the value we use today. He also applied trigonometric techniques to astrology, as well as astronomy. Although scientists today have little or no use for astrology, to the ancient Arabs, astrology was an integral part of everyday life. They firmly believed the positions of the stars and planets had a direct effect on people's lives. Functions → Inverse Functions → Horizontal Line Test The graphs of inverse functions are related to one another. This should not come as too big a surprise. Remember that one definition of inverse functions involves swapping the order of their coordinate points. For example, if the function f contains the point (a, b), then its inverse f –1 contains the point (b, a) and vice versa. As a result, the graph of f –1 is a reflection of the graph of f about the line y = x, as indicated by the diagram below: In the thought Finding the Inverse, we told you that some functions do not have an inverse. The classic example of a function that does not have an inverse is the function f(x) = x2. The reason why this function has no inverse has to do with the basic definition of a function. If we let x be any real number, then we will have elements from the set of the domain mapping onto the same element in the set of the range. For example, if we let x1 = 2 and x2 = –2, then f(x1) = 4 and f(x2) = 4. In order to have an inverse, we reverse the order of the coordinates. But then we have one element of the domain of f –1 pointing to two different elements in its range, which is not allowed according to the basic definition of a function. However, if we restrict the domain of f to include only nonnegative values of x, then the function does have an inverse. We can use a geometric means to find the inverse of a function. The reflective property of the graphs of inverse functions gives us a nice test we can utilize to determine whether or not a function has an inverse. We call this remarkably simple test the Horizontal Line Test. Functions Horizontal Line Test – 243 Horizontal Line Test A function f has an inverse function if and only if no horizontal line intersects the graph of f at more than one point. One of the disadvantages of this test is that there is no way for you do determine what the inverse actually is. It is an existence theorem because it tells us only if an inverse function exists or not. Use a graphing utility to graph the function and use the Horizontal Line Test to determine whether the function has an inverse. g(x) = −2x 16 − x 2 solution We use MathCAD 2000® to draw our graphs, but you are free to use whatever tool you prefer. The graph of g(x) is shown below. The blue line represents the application of the Horizontal Line Test. Since the line clearly intersects the graph at more than one point, we conclude that this function fails the Horizontal Line Test and hence has no inverse. answer 244 – Horizontal Line Test Functions Unit 4 Quadratics Quadratics – 245 246 – Quadratics General Information about Quadratic Equations Besides linear equations, the other type of equation that shows up most often in algebra are the quadratic equations. In most general terms, a quadratic equation in x is one that takes the form ax2 + bx + c = 0 quadratic equation in x where a, b, and c are all real numbers and a ≠ 0. If a = 0, then the first term goes to zero and we are left with a linear equation. This equation is also known as a second-degree polynomial equation in x. While we have stated that the general form has one side equal to zero. If we allowed one side to be another variable, such as y, then we have what is known as a quadratic function in x because the value for y is entirely dependent on the value for x: y = ax2 + bx + c quadratic function On a graph, this function can be described by a parabolic curve. Note that only the variable x is raised to the second power. We will study quadratic equations in detail, learning the four basic methods of solving a quadratic equation: 1. Factoring 2. Completing the Square 3. Extracting Square Roots 4. The Quadratic Formula (the granddaddy of all algebraic formulas) Generally, we try to factor first. If that fails, then we try to complete the square. If we can neither factor nor complete the square, then we apply the Quadratic Formula. Extracting square roots is only reserved for special occasions. Conic Sections If we raise the variable y to the second power as well as x, then we end up with what is called a conic section. A conic section is what happens when we intersect a cone with a plane. We will examine these in intense detail in the appropriate section of the Algebra Brain. There are four basic conic sections: 1. Parabola 2. Ellipse 3. Circle (yes, that's right, a circle) 4. Hyperbola (a what?) The parabola is a fairly common shape. Throw a baseball across a field and it will travel in a parabolic path. An ellipse is also a fairly common shape. The planets all travel in elliptical orbits. The circle is by far one of the most recognizable shapes. Coins are circular, compact discs are circular, pens can be circular in crosssection, pipes, wires, and all kinds of other things are made to be circular. By far the most obscure conic section is the hyperbola. However, it does play an important role in the motions of comets and asteroids, as well as in long-distance communication. Quadratics – 247 248 – Quadratics Advanced To learn the basic definition of a quadratic equation and some applications of how they are used in the real world. No definitions on this page. The ancient Chinese had a simple process for finding the square root of a number. It was derived from the geometrical division of a square into smaller areas. The offshoot of this method was a solution to the quadratic equation of the form x2 + bx = c, when b and c are positive. This is all explained in detail in the Chinese work Jin zhang shuan shu (Nine Chapters on the Mathematical Art). Quadratics → Quadratic Equations Next to linear equations, quadratic equations are among the major emphasis areas of algebra. Indeed, much of algebra was developed specifically to help solve problems dealing with quadratic equations. In particular, quadratic equations have a wide range of applications in physics, architecture, business, finance, and even warfare (although usually that involves physics). We can define a quadratic equation in standard form as follows: Definition of a Quadratic Equation A quadratic equation in x is an equation that can be written in the standard form ax2 + bx + c = 0 quadratic equation in standard form where a, b, and c are all real numbers and a ≠ 0. This is also known as a second-degree polynomial equation in x. Under this area of the Algebra Brain, we will look at four main methods for solving quadratic equations. Each method only works if we have a quadratic equation in standard form! If we have a quadratic equation that is not in standard form, then we need to somehow transform it using equivalent equations so that it becomes a quadratic equation in standard form. For example, if we have the equation 4x2 – 5x + 3 = –2x example of quadratic equation not in standard form then we would add 2x to both sides, thus the right hand side becomes equal to 0 and all the terms are then on the left hand side: 4x2 – 5x + 3 = –2x 4x2 – 5x + 3 + 2x = –2x + 2x add 2x to both sides 4x2 – 3x + 3 = 0 combine like terms quadratic equation in standard form Once we have the equation in standard form, we can then apply a variety of techniques to solve the equation for x. In this particular example, we would probably head straight for the Quadratic Formula as this equation does not factor easily, nor does it really lend itself to completing the square, nor is it in an acceptable form for extracting square roots. As it turns out, this equation does not contain any real roots. We can solve it using the method of completing the square, but this method is actually more tedious than the Quadratic Formula in this instance. Thus, after applying the Quadratic Formula, we discover that the two roots of this equation are: Quadratics Quadratic Equations – 249 x = x = 3 8 3 8 + ⎛ 39 ⎞ i ⎜ ⎟ ⎝ 8 ⎠ and − ⎛ 39 ⎞ ⎜ ⎟i ⎝ 8 ⎠ solutions to the quadratic equation One of the most important skills required for solving quadratic equations is the ability to recognize which technique to use. As with any mathematical skill, it only comes through practice, practice, and more practice. Let's look at a real life example of how quadratic equations work. On a particular university campus, there is a square courtyard covered in lush green grass. Despite the fact that there are "DON'T WALK ON THE GRASS" signs posted all around the grass, students still insist on walking on the grass in order to cut down the time it takes to get to their classes. The path they generally follow across the grass covers a distance of 47 feet. The path they travel is shown in the diagram below: How many feet does a student save by walking across the lawn instead of walking on the sidewalk? solution Based on the figure above, we let x be the length of the shorter path around the grass and then we use a ruler on our diagram to discover that the longer path is twice the length of the shorter path, which we can represent with 2x. Since we have a right triangle, we apply the Pythagorean Theorem to get: x2 + (2x)2 = 472 Pythagorean Theorem x2 + 4x2 = 2209 simplify 5x2 = 2209 combine like terms x2 = 441.8 divide both sides by 5 x = x ≈ 250 – Quadratic Equations 441.8 21.02 extract positive square root (see thought on Extracting Square Roots) approximate value of x to nearest hundreth Quadratics The total distance on the sidewalk is: x + 2x = 3x total length of path around grass = 3( 441.8) substitute in for x ≈ 63.06 feet simplify, rounding to nearest hundreth Cutting across the grass saves the student having to walk about 63.06 – 47 = 16.06 feet. answer Note: Eventually the university became so tired of students traipsing across their nice green grass that they finally put a sidewalk down where the students were actually walking, instead of where the university wanted the students to walk. Quadratics Quadratic Equations – 251 252 – Quadratic Equations Quadratics Advanced To solve quadratic equations by factoring. No definitions on this page. The Englishman Thomas Harriot (1560-1621) is credited with developing much of the early work in solving quadratic equations by factoring and applying the Zero Product Property, although others may have applied the method before him. Harriot applied his method to polynomials of higher degree. By the time Carl Gauss developed his Fundamental Theorem of Algebra two hundred years later, Harriot's ideas were already wellestablished in the mathematical community. He was also prominent in astronomy, being one of the first to discover sunspots. Quadratics → Quadratic Equations → 1. Factoring Quadratic equations can sometimes be factored into the product of two linear expressions. If we are able to manage this, then solving a quadratic equation becomes very simple. First of all, we need to write the quadratic equation in standard form: ax2 + bx + c = 0 standard form of quadratic equation If a quadratic equation in standard form can be factored, we make use of the Zero-Factor Property to find the solutions to the quadratic equation. Recall that the Zero-Factor Property states that if the product of two factors equals zero, then either one of the factors is zero or both of the factors is zero. Therefore, we set each of the linear factors of the quadratic equation equal to zero and solve for x. The values that we find for x are the solutions to the quadratic equation. It is easier to see this in practice than it is to explain it. Solve 2x2 + 9x + 7 = 3 for x. solution First of all, we notice that this is not in standard form, so we have to tweak the equation a bit: 2x2 + 9x + 7 = 3 given equation 2x2 + 9x + 4 = 0 subtract 3 from both sides to get it into standard form Now that it is in standard form, we want to factor the equation: 2x2 + 9x + 4 = 0 standard form (2x + 1)(x + 4) = 0 factored form Once we have this equation in factored form, we note that the only way the left hand side can be equal to the right hand side is if one or the other of the factors is equal to zero. We set each of the factors equal to zero and solve for x. Then each solution is one of two possible solutions for this equation. Quadratics Factoring – 253 2x + 1 = 0 set 1st factor equal to zero 2x = –1 subtract 1 from both sides x = − x+4 = 0 set 2nd factor equal to zero x = –4 subtract 4 from both sides the other solution 1 2 divide both sides by two one solution Thus we have two possible solutions: x = –1/2 and x = –4. answer Check these solutions in the original equation. 254 – Factoring Quadratics Advanced To solve quadratic equations by extracting the square roots. extracting square roots – the process of solving a quadratic equation by isolating the x2 term on one side, constant terms on the other side, and then finding the square root of both sides. Robert Recorde (1510-1558) did more than simply invent the equals sign, "=". He established the English school of mathematical writers. He was the first English writer on arithmetic, geometry, and astronomy. Many of Recorde's writings were in poetic form to help students remember the rules of operation. Because he published his work in English, he had only limited circulation in the rest of Europe, which used Latin and/or Greek as the standard language for mathematics. Finally, in his work, The Whetstone of Witte, he explains how to extract the square root. Quadratics → Quadratic Equations → 2. Extracting Square Roots Sometimes we find we have a very simple quadratic equation of the form u2 = d where u is an algebraic expression and d > 0. We can use factoring to solve equations of this type. First get the equation into standard form: u2 = d quadratic equation u2 – d = 0 subtract d from both sides standard form While it may not be immediately obvious, the expression u2 – d is actually the difference of two perfect squares. Thus, we can quickly factor this expression as: u2 – d = 0 standard form ( u − d)( u + d) = 0 factor difference of two squares Then we set each factor equal to zero and solve for u: u− d = u = u+ d = 0 set 2nd factor equal to zero u = − d solve for u 0 set 1st factor equal to zero d solve for u Since the two solutions only differ in sign, we can compact the solutions into one expression, using a "plus or minus sign" ( ): u Quadratics = u is equal to plus or minus the square root of d Extracting Square Roots – 255 In practice, we can avoid going through all the steps above and skip to the end. Solving an equation of the form u2 = d without going through the process of factoring is extracting square roots . Extracting Square Roots The equation u2 = d, where d > 0, has exactly two solutions: u = and u = The solutions can also be written as: u = Solve the following equations by extracting the square roots: a.) 5x2 = 15 b.) (x – 12)2 = 18 solution a.) b.) Note: While we could go through the business of factoring, it is simpler to try and extract the square roots. First, we need to isolate x2 by itself. Then we will take the square root of both sides to obtain our solutions for x: 5x2 = 15 given equation x2 = 3 divide both sides by 5 x = take square root of both sides answer Here we have u = (x – 12)2. However, the process is exactly identical, except that we have to go through one extra step to solve this equation for x. (x – 12)2 = x – 12 = take square root of both sides x = add 12 to both sides answer 18 given equation While we can simplify 18 to 3 2, there is no real good reason to do so in the context of this problem. 256 – Extracting Square Roots Quadratics Advanced To solve quadratic equations by completing the square. No definitions on this page. The Hindu-Arabic number system was introduced into Europe in the 12th century via Latin translations of Arabic texts written by al-Khwarizmi. Among the contents of these works are a "new" method of calculation that requires repeated erasure and shifting of numerals as the calculation is performed. It was called "dust-board" calculation because the easiest way to do it was with a piece of slate and a lump of chalk - tools still used in classrooms today in the form we know as the chalkboard. Quadratics → Quadratic Equations → 3. Completing the Square Every so often, we find ourselves confronted with a quadratic equation that is neither factorable nor are we able to extract any square roots from it. Or can we? Consider the equation x2 – 6x + 2 = 0 We cannot factor this equation at all. Nor can we take the square root of both sides. How can we possibly find the solutions to such an equation? The answer lies in transforming the equation into an equivalent equation that we can solve. First, let's subtract 2 from both sides and see what happens: x2 – 6x + 2 = 0 original equation x2 – 6x = –2 subtract 2 from both sides What we want to do is add some number to both sides such that the left hand side will become a perfect square trinomial. In other words, we want the left hand side to look like (x – a)2 where a is some real number. Remember that we have to add the same number to both sides of the equation to maintain equality. What if we add the number 9 to both sides? x2 – 6x + 9 = –2 + 9 add 9 to both sides (x – 3)2 = 7 factor left hand side simplify right hand side Once we have the equation in the form just given, we can now apply the method of Extracting Square Roots to find the solutions to this equation: x = solutions to equation How did we know to add 9 to both sides of the equation? We looked at the coefficient of the x-term. We know that when we square a linear factor of the form (x – a), the middle term of the resulting trinomial is given by 2a and the last term becomes a2. In our example above, we took the number 6 (coefficient of x), divided it by 2 to get 3 and then squared the 3 to get 9. This process is formally known as Completing the Square. Quadratics Completing the Square – 257 Completing the Square To complete the square for the expression x2 + bx add (b/2)2, which is the square of half the coefficient of x. Consequently = Note: completing the square If you add a quantity to one side of an equation in order to complete the square, remember to add the same quantity to the other side of the equation. Solve the quadratic equation by completing the square. x2 + 8x + 8 = –6 solution First, group all the constant terms onto one side of the equation. Then add the square of half the coefficient of b, which in this case means we divide 8 (coefficient of x-term) by 2 and square it to get 16. Then we convert the polynomial to factored form and extract the square roots. Note: x2 + 8x + 8 = –6 original equation x2 + 8x = –14 group all constants on right hand side of equation x2 + 8x + (4)2 = –14 + (4)2 add (half of 8)2 (x + 4)2 = –14 + 16 perfect square trinomial on left hand side (x + 4)2 = 2 simplify x+4 = take square root of both sides x = subtract 4 from both sides answer In this example, the leading coefficient is 1. In the eventuality that the leading coefficient is not 1, you must divide both sides of the equation by the leading coefficient before completing the square. For instance, to complete the square for 5x2 – 2x + 7 = 0, first divide each of term by the leading coefficient 5: 2 x − 2 5 x+ 7 5 = 0 divide each term by leading coefficient 5 and then proceed as in the example above. 258 – Completing the Square Quadratics Advanced To use the Quadratic Formula to solve quadratic equations. Quadratic Formula - an important formula that can be used to solve any quadratic equation for the independent variable. imaginary number - a number that is the square root of a negative real number. Girolamo Cardano was an Italian physician, mathematician, and astrologer. His book, Ars Magna (published in 1545), is considered a fundamental work in the history of algebra. He was also one of the first mathematicians to work extensively with imaginary numbers, which he called 'fictitious numbers'. Quadratics → Quadratic Equations → 4. Quadratic Formula One of the most important formulas used in algebra is the infamous Quadratic Formula. With this little handy, dandy formula, we can solve any and all quadratic (second-degree) polynomial equations. This shortcut for solving quadratics results from another technique used to solve quadratics, namely Completing the Square. However, instead of using the Completing the Square technique on an individual quadratic equation, we can use it once in a general setting, thus allowing us to apply the result to any quadratic equation. Let's see how this works. Consider the following general second-degree polynomial: ax2 + bx + c = 0 general second-degree polynomial Now we wish to solve this equation for x. From the Fundamental Theorem of Algebra and the Linear Factorization Theorem, we know that we will end up with two roots. ax2 + bx + c = 0 given equation ax2 + bx = –c subtract c from both sides = − = − 2 x + 2 x + b a x+ b a x ⎛ b⎞ ⎜ ⎟ ⎝ 2a ⎠ 2 = c + a ⎛ b⎞ ⎜ ⎟ ⎝ 2a ⎠ b − 4ac 4a 2 2 complete the square, remembering to add the same quantity to both sides convert the left-hand side to its factored form simplify the right-hand side 2 b = 2a b − 4ac 4a x Quadratics divide both sides by a a 2 2 ⎛x + b ⎞ ⎜ ⎟ 2a ⎠ ⎝ x+ c = 2 take the square root of both sides isolate the x-term Quadratic Formula – 259 x = simplify to find the solutions for x x = since the quantity 2 | a | is the same as 2a, we can omit the absolute value sign and simplify Quadratic Formula The Quadratic Formula The solutions of a quadratic equation in standard form ax2 + bx + c = 0 a≠0 are given by the Quadratic Formula: x Note: = Quadratic Formula It is sometimes helpful to remember how to say this formula in plain English: "Negative b, plus or minus the square root of b squared minus 4ac, all divided by 2a." Solutions of a Quadratic Equation The solutions of a quadratic equation ax2 + bx + c = 0, a ≠ 0, can be classified as follows: Note: 1. If the discriminant b2 – 4ac is positive, then the quadratic equation has exactly two distinct real solutions. 2. If the discriminant b2 – 4ac is zero, then the quadratic equation has exactly one repeated solution. 3. If the discriminant b2 – 4ac is negative, then the quadratic equation has no real solution. If the discriminant is negative (as in classification 3 above), then its square root is imaginary and the Quadratic Formula yields two complex solutions. 260 – Quadratic Formula Quadratics The ancient Greeks first discovered conic sections over 2300 years ago. However, they were mostly interested in the geometric properties of conics. It wasn't until Newton and his chums rediscovered conics in the seventeenth century that conic sections began making their appearance in calculus. There are three main ways of defining a conic section. The first, the geometric interpretation known to the Greeks, is a plane intersecting a double-napped cone. There are actually seven conic sections, but three of them are known as degenerate conics because they don't really resemble a conic. The degenerate conics, namely a point, a line and two intersecting lines, are formed when the intersecting plane passes squarely through the vertex of the cone. The other four, namely the familiar conics of the parabola, the circle, the ellipse, and the hyperbola, are all formed by an intersecting plane that does not pass through the vertex. Geometric Interpretation of Conic Sections A second way of defining conic sections is the algebraic method of representing a conic section by a seconddegree polynomial of the form: Ax2 + Bxy + Cy2 + Dx + Ey + F = 0 general second-degree polynomial equation Finally, the third and potentially most useful definition is by defining each conic section by a locus of points satisfying some geometric property. For example, a circle is the locus, or collection, of all points (x, y) equidistant from some fixed point (h, k) in a plane. Using this locus definition, we obtain the standard equation for a circle: (x – h)2 + (y – k)2 = r2 standard equation of circle We use similar definitions for the other conic sections. A parabola is defined as the collection of points that are equidistant from both the focus and a fixed line. We will discuss each definition in much more detail under the relevant thought. Conic sections have a wide range of applications, particularly in physics and engineering. For example, a satellite antenna used to pick up television signals is in the shape of a parabola because a parabolic shape reflects all incoming (parallel) signals so that they pass through the focus of the parabola. Thus, an amplifier located at the focus can take all the incoming signals and "boost" the signal so that you can watch television. A flashlight works on the same principle in reverse. The bulb emits beams which are reflected by the parabolic reflector to travel parallel outwards, thus the flashlight is able to shine more light on a surface than if it had no reflector. Ellipses are the normal shape of one object orbiting another in space (actually all the conic sections are possible, but only an ellipse will form a true orbit). Ellipses are also used in architecture, such as bridges and other arched objects. Hyperbolas are often used in long-range detections systems such as radar. Quadratics Conic Sections – 261 262 – Conic Sections Quadratics Advanced To learn the algebraic definition of a circle. circle – a plane curve consisting of all points at a given distance from a fixed point in the plane. center – the point in the plane from which all other points on the circle are defined. radius – the distance between the center and a point on the circle. While it is relatively simple to show that x2 + y2 = r2, mathematicians have long debated whether or not a solution can be found if '2' is replaced by any larger natural number. Pierre de Fermat (1601-1665) stated emphatically that there are no positive whole numbers that solve xn + yn = zn when n is a natural number greater than 2. However, he left no proof for his statement. With the advent of computers, we have been able to show that Fermat's Last Theorem holds for at least the first 4 million natural numbers. Quadratics → Conic Sections → Circles Standard Equation of a Circle This is the first sort of conic section usually taught in algebra. Most people are already familiar with the shape of a circle. It is a perfectly round shape that can be described as a polygon with only one side or infinitely many sides, depending on how you look at it. Now we want to look at a circle in algebraic terms. Algebraically speaking, a circle is defined as the set of all points in a plane that lie an equal distance from a fixed point elsewhere in the plane. The fixed point is called the center of the circle. The distance from the center to the set of equidistant points is called the radius. Consider the diagram of a circle shown below. A point (x, y) is on the circle if and only if its distance from the center (h, k) is some fixed quantity r. If we move the point (x, y) along the circle, its distance from the center will always be r. The Distance Formula, which defines the distance between any two points in the plane, gives us the quantity r: = Quadratics r Distance Formula Circles – 263 Now, we don't like looking at radical signs, so we will get rid of the radical sign by squaring both sides of the Distance Formula. The result is the standard form of the equation of a circle: Standard Form of the Equation of a Circle The point (x, y) lies on the circle of radius r and center (h, k) if and only if (x – h)2 + (y – k)2 = r2 standard form of the equation of a circle For a circle centered on the origin, the point (h, k) = (0, 0). The standard form of a circle whose center is at the origin is: x2 + y2 Note: = r2 standard form of the equation of a circle whose center is the origin If we have the expression (x + h)2, then this is equivalent to (x – (–h))2, which indicates that the xcoordinate of the center lies to the left of the y-axis. A similar fact holds for the expression (y + k)2 = (y – (–k))2, thus the y-coordinate of the center is below the x-axis. As always, an example right about now might clarify what we have explained so far. One of the major advantages of describing a circle with an equation is that we can easily graph the equation. Similarly, if we know some basic facts about a particular circle, we can quickly determine the equation describing that circle. Graph the circle centered at (3, 2) and which has a radius of 2. solution In this problem, we are given the center of the circle, along with its radius. From the standard form of the equation of a circle, we now know three of the five terms. The x- and y-terms are completely arbitrary. Let's write down what we know: h = 3 x-coordinate of center of circle k = 2 y-coordinate of center of circle r = 2 radius of circle Thus our equation has the following form: (x – 3)2 + (y – 2)2 = 4 standard form of an equation of a circle To graph this on the Cartesian Plane, you need to solve for y. Note that we will actually obtain two possible solutions for y. This stems from the fact that this equation is quadratic (i.e. the y-term is raised to the second power), thus we will end up with two solutions from the Fundamental Theorem of Algebra. (y – 2)2 = y–2 = take square root of both sides y = add 2 to both sides to obtain y = rearrange terms slightly 264 – Circles 4 – (x – 3)2 subtract (x – 3)2 from both sides Quadratics This can be split into two different solutions: y 2 one possible solution 2 other possible solution = 2 + 4 − ( x − 3) = 2 − 4 − ( x − 3) or y When we graph each one of these possible solutions, we end up with only half a circle. This is because a circle is not a function (it fails the Vertical Line Test), so we have to graph each half of the circle separately. Using an automatic graphing utility, we discover that our circle looks like: answer Recognizing a Circle Equation Circles can be hidden in seemingly unrelated equations. Suppose we had the following equation. x2 + 4x + y2 + 6y – 12 = 0 funky equation At first glance, this seems totally unrelated to the equation of a circle in standard form, right? Wrong! It is actually the same equation in disguise. Let's examine why. Suppose we add "12" to both sides in order to get the constant on one side. x2 + 4x + y2 + 6y = 12 add 12 to both sides Now here is where we have to get creative. Let's use one of the methods used for solving quadratic equations, namely completing the square. However, we use this method twice, once for the terms involving x and once for the terms involving y. (x2 + 4x + ?) + (y2 + 6y + ?) = 12 start completing the square (x2 + 4x + 4) + (y2 + 6y + 9) = 12 + 4 + 9 add the same quantities to both sides (x + 2)2 + (y + 3)2 = 25 complete the square The last step above is, in fact, the equation of a circle in standard form. Here we have a circle with radius 5, and center located at (–2, –3). Quadratics Circles – 265 266 – Circles Quadratics Advanced To define a parabola using algebraic equations. parabola – the set of all points in the plane of a line l and a point F not on l whose distance from F equals its distance from l. directrix – a line associated with a parabola such that the distance from it to any point on the parabola is equal to the distance from that point to the focus. focus (parabola) – the point along with the directrix from which a point on a parabola is equidistant. vertex – the intersection of a parabola with its axis of symmetry. axis of symmetry – the line passing through both the focus and the vertex. The Greek mathematician Archimedes (287-212 B.C.) is among the first to work with parabolas. In his work Quadrature of the parabola, he finds the area of a segment of a parabola cut off by any chord. There are various stories related about his death. One rumor has it that he was killed in 212 B.C. when the Romans sacked the city of Syracuse (his hometown). Apparently he was so absorbed in a math problem that he refused to accompany a Roman soldier until he had finished the problem. Angered, the Roman soldier ran Archimedes through with his sword. Quadratics → Conic Sections → Parabolas If we graph the function f(x) = ax2 + bx + c on the Cartesian plane, we find that the graph is a distinctive shape known as a parabola. However, this equation is only good for parabolas that open upwards or downwards. We can also define "sideways" parabolas, by which we mean a parabola that opens to the left or right. What we would like to have is a general, all-purpose definition of a parabola that is independent of orientation or location in the plane. Recall that we defined a circle as the locus of points equidistant from a fixed point. Now we will define a parabola in a similar manner, using some sort of distance relation as the quantity that truly defines a parabola. The general definition is as follows: Geometric Definition of a Parabola A parabola is the set of all points (x, y) equidistant from a fixed line called the directrix and a fixed point called the focus not on the line. Quadratics Parabolas – 267 The midpoint between the focus and the directrix is called the vertex. We indicate the distance between the focus and the vertex by the letter p. The line passing through both the focus and the vertex is called the axis of the parabola. A parabola is symmetric with respect to its axis. The definition above gives us the general idea of what a parabola looks like, but we are actually more interested in specific parabolas. To get specific, we need some equations. Standard Equation of a Parabola The standard form of the equation of a parabola with vertex (0, 0) and directrix y = –p is x2 = 4py p≠0 vertical axis For directrix x = –p, the equation becomes y2 = 4px p≠0 horizontal axis The focus is on the axis p units (directed distance) from the vertex. If the vertex is located at the origin, then (h, k) = 0, and the equations above reduce to: (x – 0)2 = 4p(y – 0) vertical axis, vertex located at origin (0, 0) x2 = 4py simplify (y – 0)2 = 4p(x – 0) horizontal axis, vertex located at origin (0, 0) y2 = 4px simplify and The four basic orientations of a parabola are: up, down, right or left. Each orientation is determined by the direction it "opens." The parabola above, for example, opens upward. The factor that determines whether it opens up/down or right/left is the sign of p. If the parabola is on the vertical axis and p > 0, then the parabola opens up. If p < 0, then it opens downward. When the parabola is on the horizontal axis and p > 0, then the parabola opens to the right. When p < 0, the parabola opens to the left. Note: A parabola that opens to the right or left is not a function because it fails the Vertical Line Test, while a parabola that opens up or down is a function. Proof: Since the two cases of a parabola are similar, we will only prove the vertical axis case. First, we suppose that the directrix (y = –p) is parallel to the x-axis. We make the assumption that p > 0. Since p is the directed distance from the vertex to the focus, the focus must lie above the directrix. Now, according to the Geometric Definition of a Parabola, the point (x, y) on the parabola must be equidistant from (0, p) and the line y = –p. We can therefore use the Distance Formula to show that they are the same: 2 2 = y+p Distance Formula x2 + (y – p)2 = (y + p)2 square both sides x2 + y2 – 2py + p2 = y2 + 2py + p2 expand binomials x2 = y2 – y2 + 2py + 2py + p2 – p2 move all terms except x2 over to right hand side x2 = 4py simplify ( x − 0) + ( y − p ) 268 – Parabolas proof Quadratics Find the vertex and focus of the parabola and sketch its graph. x2 + 8y = 0 solution First, we need to rewrite the equation so that it is in standard form. All we need to do is move the 8y onto the other side of the equation: x2 + 8y = 0 given equation x2 = –8y subtract 8y from both sides (x – 0)2 = (4)(–2)(y – 0) equation of parabola in standard form Based on the equation of a parabola in standard form, we can see that the vertex (h, k) = (0, 0). Furthermore, the distance between the vertex and the focus is –2 units. Since this parabola has a vertical axis, this means it opens downward and the focus is located –2 units below the vertex. In other words, we have: vertex = (0, 0) and focus = (0, –2) answer To graph this parabola, we plot a few points on either side of the axis and then draw a line connecting them: x –8 –4 –2 0 2 4 8 y –8 –2 –0.5 0 –0.5 –2 –8 answer Applications of Parabolas Parabolic motion is one of the most common applications of parabolas. What do we mean by "parabolic motion"? As it turns out, if you throw an object across the room, the path it takes will be a parabola. This is even true if you simply roll a ball off a table. In that case, the path of the ball will be one half of a parabola that is vertically oriented and opens downward. One of the most useful properties of parabolas is the property of reflectivity. In physics, a surface is called reflective if the tangent line at any point on the surface makes equal angles with an incoming ray (such as light) and the resulting outgoing ray. The incoming angle is known as the angle of incidence and the outgoing angle is the angle of reflection. A flat mirror is one common example of a reflective surface. If a parabola is revolved about its axis, then we have another reflective surface. The key property of parabolic reflectors is that all incoming rays parallel to the axis are reflected to pass through the focus of the parabola. This is the operating Quadratics Parabolas – 269 principle behind radio telescopes and satellite dishes, which gather all incoming radio waves to one place: the focus. If the light source is located at the focus, as in a flashlight, then all outgoing rays are parallel. Finally, objects traveling through the solar system sometimes travel in a parabolic orbit. This may seem contradictory since most people think of an "orbit" as being a circular or elliptical path. An elliptical or circular orbit means that a celestial object, such as the Moon, is trapped and cannot escape the planet's gravitational field. A parabolic orbit, such as that traveled by many comets, means that the comet passes very close to some body (like the Earth), and then flies back out into space, never to be seen again. When the Pioneer spacecraft was launched, it was given a parabolic orbit so that it could escape Earth's gravitational field and leave the solar system. As it passed by the outer planets, we made sure that each time it was given a new parabolic orbit so that it would not become trapped in orbit around one of the four gas giants. 270 – Parabolas Quadratics Advanced To learn the relationship between the center, foci, and intercepts of an ellipse and the equation of an ellipse. ellipse – the set of points P in a plane which satisfy PF1 + PF2 = d, where F1 and F2 (its foci) are any two fixed points and d (its focal constant) is a constant with d > F1F2. focus (plural foci) – fixed points from which the sum of distances to a point on the conic is a constant. vertices (ellipse) – endpoints of the major axis of an ellipse. major axis – the line segment joining the two foci of an ellipse and has endpoints at the vertices of the ellipse. minor axis – line segment perpendicular to the major axis, passing through the center of the ellipse. semi-major axis – line segment that is half the length of the major axis. Its endpoints are the center of the ellipse and one of the vertices. semi-minor axis – a line segment half the length of the minor axis, perpendicular to the semi-major axis. eccentricity – a measure of the ovalness of an ellipse. A branch of mathematics known as elliptic geometry does not, in fact, derive its name from the ellipse. Instead, the name comes from the Greek word elleiptis, meaning to lack or fall short. Like hyperbolic geometry, the name elliptic geometry was coined by the geometer Felix Klein, creator of the Klein bottle. In elliptical geometry, there are no lines that pass through the point P and are parallel to the line L, unlike Euclidean geometry, where only 1 line passes through P and is parallel to L. Quadratics → Conic Sections → Ellipses Back around 1500 A.D., Nicholas Copernicus came up with his bizarre (at the time) notion that the Earth traveled around the Sun. In his day, it was believed that the Earth traveled in a perfectly circular orbit. Unfortunately, the mathematics at the time was unable to accurately describe the observed motions of the heavens. All sorts of wild attempts were made to fit the math to the observations, such as the idea of "epicycles", where the celestial objects traveled in a circular path on a circular path. For theological reasons, it was incomprehensible that the motions of the planets should be in any shape other than a circle. However, around 1660 A.D., a bright young astronomer from Germany by the name of Johannes Kepler finally arrived at the correct assumption: the planets travel in elliptical orbits. The Sun is at one of the foci of the ellipse. He is also responsible for Kepler's Three Laws of Motion, all dealing with ellipses and every one fitting the observed motions of the planets. As with all the other major conic sections, we can define an ellipse in geometric terms. We can also use algebraic equations to describe a particular ellipse. Geometric Definition of an Ellipse An ellipse is the set of all points (x, y) the sum of whose distances from two fixed points (foci) is constant. The line through the foci intercepts the ellipse as two points called the vertices (vertex is singular). The chord joining the vertices is called the major axis. The chord perpendicular to the major axis that passes through the center of the ellipse is called the minor axis. The length of half of the major axis is known as the semi-major axis (denoted by a) and the length of half of the minor axis is known as the semi-minor axis (denoted by b). Quadratics Ellipses – 271 Now that we have defined an ellipse geometrically, it is time to apply some algebra. The distance between one vertex and the center of the ellipse can be represented by the letter a. In which case, the entire length of the major axis then has length 2a. Similarly, we designate the length of the minor axis as 2b, where b is the distance between the center of the ellipse and the edge of the ellipse perpendicular to the vertex. Using these two quantities, we can develop a standard equation of an ellipse. Standard Equation of an Ellipse (Center at the Origin) The standard equation of an ellipse, with center at the origin (0, 0) and major and minor axes of lengths 2a and 2b, respectively, where a > b, is: = 1 major axis is horizontal minor axis is vertical = 1 major axis is vertical minor axis is horizontal The foci lie on the major axis, c units from the center, where c2 = a2 – b2. Find an equation of the ellipse with center at the origin. Vertices: ( 5, 0) Foci: ( 2, 0) solution We are given that the vertices are at (5, 0) and (–5, 0), which means that the major axis of this ellipse is horizontal. We are also given that the foci are at (2, 0) and (–2, 0). Using this information we already know that: a = 5 distance from vertex to center c = 2 distance from focus to center 272 – Ellipses Quadratics In order to find b, we use the relationship c2 = a2 – b2: c2 = a2 – b2 22 = 52 – b2 substitute in a = 5 and c = 2 4 = 25 – b2 simplify b2 = 21 subtract 4 from both sides and add b2 to both sides b = take square root of both sides Now in our standard form of the equation of an ellipse, we have: x 2 a x 2 2 2 ( 2 2 2 2 y 2 + = 1 standard form of equation of ellipse = 1 substitute in for a and b = 1 simplify answer 2 21) 2 4 y b y + x + 21 Eccentricity Now that we have defined ellipses, how can we tell one ellipse from another without long involved calculations? One way is by looking at just how "flattened" the ellipse is. Consider the orbits of the planets. The reason why early astronomers had difficulty determining that the shape of the orbits was elliptical was because the foci of the planets' orbits are relatively close to the center of the solar system, thus the orbits appear to be nearly circular. To measure the ovalness of an ellipse, we use the concept of eccentricity. Definition of Eccentricity The eccentricity e of an ellipse is given by: e = eccentricity Furthermore, for an ellipse, 1 > e > 0. Since the foci of an ellipse are located on the major axis between the center and the vertices, it follows that 0 < c < a If c = 0, then the foci are identical to the center, and we have a circle. Recall that c2 = a2 – b2, so if c = 0, then we have: 02 = a2 – b2 let c = 0 b2 = a2 axes are equal to each other Quadratics Ellipses – 273 which gives us, in the standard equation of an ellipse: x 2 a 2 + y a 2 2 x2 + y2 = 1 ellipse where a2 = b2 = a2 multiply both sides by a2 This is just the standard equation of a circle with r = a. As it turns out, eccentricity is a property shared by all conic sections, and, in fact, can be used to define a conic section. The area of an ellipse is given (exactly) by: A Note: = πab If a = b, then we just have the area of a circle: A = r2. 274 – Ellipses Quadratics Advanced To learn the geometric definition and standard equations associated with hyperbolas. foci (hyperbola) – the two points from which the difference of distances to a point on the hyperbola is constant. focal radius – a line connecting a point and a focus. difference of focal radii – the absolute value of the difference of the distances from a point on a hyperbola to the two foci of the hyperbola. transverse axis – the line passing through both the vertices and the foci of a hyperbola. asymptotes (hyperbola) – two lines that are approached by the points on the branches of a hyperbola as the points get farther from the foci. The hyperbola never touches the asymptotes. conjugate axis – a line perpendicular to the transverse axis and passing through the center of the hyperbola. eccentricity (hyperbola) – a measure of the flatness or roundedness of the branches of a hyperbola. A branch of mathematics known as hyperbolic geometry does not, in fact, derive its name from the hyperbola. Instead, the name comes from the Greek word hyperbole (note spelling), which means excessive. In hyperbolic geometry it can be proved that there exists more than one line passing through a point P and parallel to a line L, whereas in typical Euclidean geometry (the geometry most of us are familiar with), only 1 line can pass through P and be parallel to a given line L. Felix Klein, creator of the topological oddity of the Klein Bottle, is credited with coining the term hyperbolic geometry. Quadratics → Conic Sections → Hyperbolas Remember that we defined an ellipse as the sum of the distances between the foci and a point on the ellipse. A hyperbola, on the other hand, is like an inside-out ellipse. We define a hyperbola in terms of the difference of the distances between the foci and a point on the hyperbola. Geometric Definition of a Hyperbola A hyperbola is the set of all points P = (x, y) in the plane such that the difference of the distances from P to two fixed points F1 and F2 (the foci ) is a given constant. The line connecting P and a focus is known as a focal radius. Thus the given constant is sometimes called the difference of focal radii . Like ellipses, the line through the two foci intersects the hyperbola at two points, which we will again call the vertices. The line segment connecting the vertices is called the transverse axis. Quadratics Hyperbolas – 275 Just like every other conic section, we have a standard form for the equation of a hyperbola based on its orientation (vertical or horizontal). Standard Equation of a Hyperbola (Center at the Origin) The standard form of the equation of a hyperbola with center at (h, k) is: = 1 transverse axis is horizontal = 1 transverse axis is vertical The vertices are a units from the center and the foci are c units from the center. Also, b2 = c2 – a2. When attempting to draw a hyperbola, useful aids to sketching are the asymptotes of the hyperbola. Every hyperbola has two asymptotes that intersect at the center of the hyperbola. These asymptotes form the diagonals of a rectangle of dimensions 2a by 2b, with center at the origin (0, 0). Remember that 2a is the length of the line segment connecting the vertices. 2b is the length of the line segment connecting (0, b) and (0, –b) [horizontal orientation] and is called the conjugate axis. If the hyperbola is oriented vertically, then the conjugate axis connects (b, 0) and (–b, 0). In either case, the conjugate axis is perpendicular to the transverse axis and passes through the center of the hyperbola. Asymptotes of a Hyperbola (Center at the Origin) For a horizontal transverse axis, the equations of the asymptotes are y = and y = horizontal transverse axis For a vertical transverse axis, the equations of the asymptotes are y 276 – Hyperbolas = and y = vertical transverse axis Quadratics Graphically, we represent the asymptotes as follows: Note the rectangular box shape formed by the asymptotes. Remember that one of the defining characteristics of an asymptote is that the graph of the hyperbola never, ever crosses the asymptote. In fact, the hyperbola never even touches an asymptote. An asymptote is a line that comes arbitrarily close to the hyperbola as the branches of the hyperbola extend out to infinity. Usually when we sketch a hyperbola, we first plot the vertices, then we plot the endpoints of the conjugate axis, draw a rectangle the connects all four points, draw the asymptotes, and last we sketch the branches of the hyperbola outward from the vertices such that they approach the asymptotes (but never touch or cross the asymptotes!). a.) Find an equation of the specified hyperbola with center at the origin. Vertices: (0, 4) b.) Foci: (0, 5) Use the asymptotes to help sketch the hyperbola. solution a.) The vertices are located at (0, 4) and (0, –4), so the distance between the vertices and the center is a = 4. The foci are located at (0, 5) and (0, –5) which gives us c = 5. To find b, we use the relationship: b2 c2 – a2 = foci relationship Substituting in a = 4 and c = 5, we get: b2 = 52 – 42 substitute in a = 4 and c = 5 = 25 – 16 simplify = 9 We notice that the vertices are located along the y-axis, so we have a vertical transverse axis and our equation in standard form for this hyperbola is: y 2 16 Quadratics − x 2 25 = 1 transverse axis vertical answer Hyperbolas – 277 b.) Now we need to find the asymptotes in order to graph this hyperbola. Since we have a vertical transverse axis, we use the relation: y = a b x and y = a − x b In this case, a = 4 and b = 5, so we have: y = 4 5 x and y = 4 − x 5 Using these asymptotes, we can sketch the hyperbola as shown below. The hyperbola is represented by the red lines. answer Eccentricity of a Hyperbola Like ellipses, there is a special relationship between the distance from the center to a focus and the distance from the center to a vertex. This relationship is known as eccentricity and is defined for hyperbolas in exactly the same way we define eccentricity for ellipses. The difference is that whereas eccentricity for an ellipse is defined as 1 > e > 0, eccentricity for a hyperbola is defined such that e > 1. Definition of Eccentricity of a Hyperbola The eccentricity of a hyperbola is given by the ratio e Note: = This is the same definition for the eccentricity of an ellipse. However, in the case of a hyperbola, c > a, so it follows that e > 1. If the eccentricity is large, then the branches of the hyperbola are nearly flat. As e gets closer and closer to 1, the branches of the hyperbola are more pointed. 278 – Hyperbolas Quadratics Applications of Hyperbolas There are a variety of applications involving hyperbolas. As with any conic section, they can be used to trace the path of celestial objects. Many comets have hyperbolic orbits. Comets with hyperbolic (or parabolic) orbits can only be seen once when they pass by the Earth, but comets with elliptical (or circular) orbits, such as the famous Halley's comet can be seen periodically. In the case of Halley's comet, we see it every 76 years or so (its next visit is in 2061). The sun serves as one focus of the orbit. The type of orbit for a comet or other celestial body can be determined as follows: 1. Ellipse v < 2. Parabola v = 3. Hyperbola v > 2GM p 2GM p 2GM p In all three cases, p is the distance between the vertex and the focus of the comet's orbit (in meters), v is the velocity of the comet at the vertex (in meters per second), M is the mass of the sun (M 1.991 x 1030 kg), and G is Newton's Gravitational Constant (G ≈ 6.67 x 10−11 m3/(kg s2)). Note that the velocity is actually the escape speed necessary to escape the pull of the Sun's gravity. If the speed is too little, then the comet is trapped in an elliptical orbit, never to leave. If the speed is greater than or equal to the escape velocity, then the comet will eventually leave the solar system, never to return. One very useful application of using escape velocities is in planning missions to other planets. In order to escape Earth's gravity, a satellite or probe circles the Earth a few times to gain speed, when its speed is great enough, it eventually leaves on its journey to another planet (such as Mars). When it reaches Mars, it uses the gravitational pull of Mars in order to slow itself down to a velocity less than the escape velocity so it can remain in orbit. Using the gravity of a planet to gain speed is often called a "slingshot". Note: Due to conservation of momentum, every time a satellite travels around the Earth in order to gain speed, the rotation of the Earth slows down just a tiny bit. Similarly, when the satellite slows down around Mars, the rotation of Mars speeds up just a fraction. Mother Nature never gives away energy for free, although modern science is trying to find a way around that. Quadratics Hyperbolas – 279 280 – Hyperbolas Quadratics Advanced To summarize the equations we use to translate conic sections around in the Cartesian plane. No definitions on this page. Archimedes of Syracuse (287-212 B.C.) was one of the greatest thinkers of the ancient world. He established principles for plane and solid geometry, discovered the concept of specific gravity, and incidentally discovered the area of a circle is equal to π times its radius squared, that is, A = π r2. He also used a sophisticated technique to find the area under a curve, which anticipated the development of integral calculus by about 2000 years. Quadratics → Conic Sections → Translation of Conics In all the other thoughts under the thought Conic Sections, we look at various shapes such as circles, parabolas, ellipses and hyperbolas. In order to introduce you to the fundamental concepts, we leave each shape centered at the origin. Now it is time to start monkeying around with the equations and moving the conic sections around in the Cartesian plane. The basic form of the equations remains virtually unchanged, except that everywhere we see an x, we plug in (x – h) and everywhere we see a y, we plug in (y – k). In general, the coordinate point (h, k) represents the "new" center of the conic section. When (h, k) = (0, 0), the conic section is centered on the origin. If h = 0 and k 0, then the conic section is shifted along the vertical axis. If h 0 and k = 0, then the conic section is shifted along the horizontal axis. Anytime you are confronted with a conic section that is not centered at the origin, this is the page you want to come back to for reference. Quadratics Translation of Conics – 281 Standard Forms of Equations of Conics Circle: Center = (h, k) Radius = r (x – h)2 + (y – k)2 = r2 standard equation of a circle Parabola: Vertex = (h, k) Directed distance from vertex to focus = p (x – h)2 = 4p(y – k) parabola opens up or down (y – k)2 = 4p(x – h) parabola opens left or right Ellipse: Center = (h, k) Major axis length = 2a Minor axis length = 2b ( x − h) b 2 2 + ( y − k) a 2 = 1 major axis horizontal = 1 major axis vertical 2 Hyperbola: Center = (h, k) Transverse axis length = 2a Conjugate axis length = 2b Note: = 1 transverse axis horizontal = 1 transverse axis vertical If h < 0, then the conic is shifted to the left. If h > 0, then the conic is shifted to the right. If k < 0, the conic is shifted downward. If k > 0, then the conic is shifted upward. Rather than give an extended example of how each equation can be used on this page, we have subdivided examples of each translation into separate thoughts in the Algebra Brain. Translations of conics have important applications in the real world. When we trace the trajectory of a projectile, it travels in a parabolic path. The vertex of the parabola is often far from the origin. Usually, we assign the origin to be the starting point of the projectile and then figure out the rest of the problem from there. Another real world example involves celestial mechanics. Orbits of planets are elliptical with the sun at one focus. If we place the origin so that it coincides with the sun, then problems involving orbital mechanics become slightly easier to solve. For more in-depth examples of how to move conic sections around the Cartesian plane, check out the following thoughts: Circle Translation Parabola Translation Ellipse Translation Hyperbola Translation 282 – Translations of Conics Quadratics Advanced To move circles around in the Cartesian plane. No definitions on this page. Apollonius of Perga (c.262-170 B.C.) is perhaps the greatest Greek geometer since Euclid. His work, Conics, is considered to be the greatest achievement of Greek geometry. In Conics, Apollonius set forth new ideas on how to subdivide the cone to produce circles. He even improved on Archimedes calculation of π. Quadratics → Conic Sections → Translation of Conics → Circle Translation In the previous thought, we introduced the notion of moving a conic section around in the plane. Now we are going to focus on a specific example, namely the circle. For your convenience, we will once again provide you with the standard equation of a circle: Standard Equation of a Circle The point (x, y) lies on the circle of radius r and center (h, k) if and only if (x – h)2 + (y – k)2 = r2 standard form of the equation of a circle For a circle centered on the origin, the point (h, k) = (0, 0). Find the standard form of the equation of the specified circle: Center: (3, –2) Solution Point: (–1, 1) Graph the circle. solution We are given two pieces of data about this circle. We know its center and we know a given point located on that circle. All we need to do is find the radius of that circle. To do that we apply the distance formula, since the distance between the center and the solution point will be equal to the radius. We let C = (h, k) = (3, –2) and P = (x1, y1) = (–1, 1). Quadratics Circle Translation – 283 ( x1 − h ) 2 + ( y1 − k ) 2 = r Distance Formula 2 = r substitute in (h, k) = (3, –2) and (x1, y1) = (–1, 1) 2 = r simplify 16 + 9 = r simplify 25 = r simplify 5 = r take positive square root 2 ( −1 − 3) + ( 1 + 2) 2 ( −4) + ( 3) Thus we now know that the radius of the circle is 5. Using our standard equation of a circle, we can thus define this circle as: (x – h)2 + (y – k)2 = r2 standard equation of circle (x – 3)2 + (y + 2)2 = 25 substitute in (h, k) = (3, –2) and r = 5 answer The graph of this circle looks like: answer 284 – Circle Translation Quadratics Advanced To move parabolas around in the Cartesian plane. No definitions on this page. The eighth volume of Apollonius's work, Conics has long since vanished from the world, along with a number of other writings. Scholars are now trying to piece them back together. Among his writings, he mentions "burning mirrors", wherein he disproved the notion that parallel rays of light could be focused by a spherical mirror. Quadratics → Conic Sections → Translation of Conics → Parabola Translation Moving a parabola around the Cartesian plane is very similar to moving a circle around. The defining point of a circle is its center, whereas for a parabola, we define it in terms of its vertex. The basic equation for a parabola translated about the Cartesian plane is very similar to the equation of a parabola with its vertex at the origin. The only difference is that we replace x with (x – h) and y with (y – k). Now we will show an example or two on how to work with parabolas moved around the plane. For convenience, we once again provide you with the standard equation of a parabola. Standard Equation of a Parabola The standard form of the equation of a parabola with vertex (h, k) and directrix y = –p is (x – h)2 = 4p(y – k) p≠0 vertical axis For directrix x = –p, the equation becomes (y – k)2 = 4p(x – h) p≠0 horizontal axis The focus is on the axis p units (directed distance) from the vertex. Find the vertex, focus, and directrix of the parabola and sketch its graph. ⎛x + ⎜ ⎝ 1⎞ ⎟ 2⎠ 2 = 4(y – 3) given parabola solution Since our parabola equation is in standard form, we can immediately find the vertex, which is located at: ⎛ − 1 , 3⎞ ⎜ 2 ⎟ ⎝ ⎠ coordinates of the vertex answer Our equation also tells us that p = 1. Since x is the variable that is squared, we know that this has a vertical axis. Therefore, the focus is located p units above the vertex (because p is positive). The focus lies at: ⎛ − 1 , 4⎞ ⎜ 2 ⎟ ⎝ ⎠ Quadratics coordinates of the focus answer Parabola Translation – 285 Finally, by definition the directrix lies p units below the vertex, so we know that the directrix is given by the line: y = equation of directrix answer 2 Using these three quantities, we can now sketch the graph of the parabola: answer Find an equation of the parabola with the given parameters: Vertex: (3, 2) Focus: (–1, 2) Graph the parabola. solution Note that the y-coordinates of both the vertex and the focus are the same. That tells us that this parabola has a horizontal axis of symmetry. We are given that the vertex (h, k) is (3, 2). Now we need to find p in order to find an equation. We can find p by taking the difference of the x-coordinates of the vertex and the focus. In this case, the difference is 4. Since the focus is located to the left of the vertex on the Cartesian plane, we know that this parabola opens up to the left. Therefore, p = –4. Using the vertex and p, we can now find our equation: (y – k)2 = 4p(x – h) standard equation of parabola opens to the right or left (y – 2)2 = 4(–4)(x – 3) substitute in (h, k) = (3, 2) and p = –4 (y – 2)2 = –16(x – 3) simplify answer Now we use this equation to help us graph the parabola: 286 – Parabola Translation Quadratics answer Quadratics Parabola Translation – 287 288 – Parabola Translation Quadratics Advanced To move ellipses around in the Cartesian plane. No definitions on this page. Apollonius of Perga's (c.262-170 B.C.) most important contribution to the study of conics was the fact that he used a single cone to produce all conic sections. He split the cone along two fixed lines called the latus transversum and the latus erectum. These eventually became a coordinate system and frame of reference. Thus geometry could do the work that algebra does now. Quadratics → Conic Sections → Translation of Conics → Ellipse Translation Translation of ellipses is very similar to translations of other conic sections, particularly circles, since ellipses and circles are directly related. Once again, we replace all instances of x with (x – h) and all instances of y with (y – k), where (h, k) is the location of the center of the ellipse. For convenience, we give you the standard equations of an ellipse again, this time not centered at the origin. Standard Equation of an Ellipse The standard equation of an ellipse, with center (h, k) and major and minor axes of lengths 2a and 2b, respectively, where a > b, is: = 1 major axis is horizontal minor axis is vertical = 1 major axis is vertical minor axis is horizontal The foci lie on the major axis, c units from the center, where c2 = a2 – b2. Find the center, foci, and vertices of the ellipse, and sketch its graph. ( x − 1) 9 2 + ( y − 5) 25 2 = 1 solution First, we notice that the number under the y-term is larger than the number under the x-term. Recall that for an ellipse, a > b, therefore if we compare our given equation to the standard form of the equation, we realize that this ellipse has a vertical major axis and a horizontal minor axis. The second thing we can find just by looking at our equation is the center. Again, we compare our given equation to the standard equation. The center is located at (h, k). In our given equation, h = 1 and k = 5, so the center is located at: (1, 5) Quadratics center answer Ellipse Translation – 289 To find the vertices, we need to know a. We already stated that a2 is the larger of the denominators, by definition, so we can conclude that: a = = 25 5 The vertices of the ellipse are located a units away from the center. Our ellipse has a vertical major axis, so the vertices are located 5 units above the center and 5 units below the center, at: (1, 5 – 5) = (1, 0) one vertex (1, 5 + 5) = (1, 10) other vertex answer Finally, we need to find the foci of the ellipse. According to the algebraic definition of an ellipse given above, the foci lie c units from the center (in this case, above and below the center because our major axis is vertical), where c2 = a2 – b2 foci equation c2 = 25 – 9 substitute in a2 = 25 and b2 = 9 c2 = 16 simplify c = 4 take square root of 16 Thus the foci are located at (1, 5 + 4) = (1, 9) one focus (1, 5 – 4) = (1, 1) other focus answer Once we have determined the center, vertices, and foci of this ellipse, we can now graph it: answer 290 – Ellipse Translation Quadratics Find an equation for the ellipse with the given features: Vertices: (0, 2) and (4, 2) Minor axis length of 2 Graph the ellipse. solution We are given very little information about this ellipse, but believe it or not, we can find everything out about this ellipse we need just from these two pieces of information. First, let's deal with the vertices. By definition, the center of the ellipse is located at the midpoint of the line segment connecting the two vertices (the major axis). Therefore, if we use the Midpoint Formula, we can easily find the center of the ellipse: ⎛ x1 + x2 y1 + y2 ⎞ , ⎜ ⎟ 2 ⎝ 2 ⎠ = ⎛ 0 + 4, 2 + 2⎞ ⎜ 2 ⎟ 2 ⎠ ⎝ Midpoint Formula = ⎛ 4, 4⎞ ⎜ 2 2⎟ ⎝ ⎠ simplify = (2, 2) simplify center of the ellipse We are given that the length of the minor axis is 2, which is equal to 2b, where b is the length of the line segment connecting the center and the endpoint of the minor axis. This implies that b = 1. The length of the major axis is defined as 2a, so a is the distance between the center and the vertices. In this case, the length of the major axis is 4, so then a = 2. One last thing we have to determine is which way the major axis is oriented (vertical or horizontal). Since the vertices both have the same y-coordinate, and by definition are located on the major axis, we now know that the major axis is aligned horizontally. Thus, we use the first standard equation given in the definition above: ( x − h) a 2 + 2 ( x − 2) b 2 + 2 2 ( x − 2) 4 ( y − k) 2 2 ( y − 2) = 1 standard equation of ellipse major axis vertical = 1 substitute in (h, k) = (2, 2), a = 2, b = 1 = 1 simplify answer 2 2 1 2 + ( y − 2) 1 2 The graph of this ellipse looks is shown below: Quadratics Ellipse Translation – 291 answer 292 – Ellipse Translation Quadratics Advanced To move hyperbolas around in the Cartesian plane. No definitions on this page. Among the 400 theorems in Apollonius's Conics are ideas for cataloging "new" kinds of closed conics called parabolas, ellipses, and hyperbolas. He also sets aside the Pythagorean distaste for infinities, infinitesimals, and infinite sets. This new way of thinking would influence later mathematicians and pave the way for infinitesimal calculus. Finally, his notions of epicircles, epicycles and eccentrics would influence Ptolemaic cosmology until Johannes Kepler (1571-1630) discovered the correct motions of the planets: elliptical orbits. Quadratics → Conic Sections → Translation of Conics → Hyperbola Translation When we first introduced the notion of a hyperbola, we confined our discussion to a hyperbola whose center is located at the origin. However, not all hyperbolas can be written so that their center is at the origin and oftentimes is serves our purposes better to locate the center of the hyperbola somewhere else. Like translations of circles, parabola, and ellipses, translation of hyperbolas in the Cartesian plane does not involve a really radical (not in the mathematical sense) change to our original hyperbola equation. Again, we simply replace all instances of x with (x – h) and we replace all instances of y with (y – k). Note that this applies to both the standard equations of the hyperbola and the equations of the asymptotes. Since we shift the center of the hyperbola to the point (h, k), we also shift the intersection of the asymptotes, thus the asymptotes themselves are shifted by exactly the same amount. This will hopefully become clearer when we work an example. We now provide the equations for a hyperbola whose center is located at (h, k). If (h, k) = (0, 0), then the center is located at the origin, and we obtain the same hyperbola equations as we did before. Standard Equation of a Hyperbola The standard form of the equation of a hyperbola with center at (h, k) is: = 1 = 1 transverse axis is horizontal transverse axis is vertical The vertices are a units from the center and the foci are c units from the center. Also, b2 = c2 – a2. The equations for the asymptotes are then as follows: Quadratics Hyperbola Translation – 293 Asymptotes of a Hyperbola For a horizontal transverse axis, the equations of the asymptotes are (y – k) = and (y – k) = For a vertical transverse axis, the equations of the asymptotes are (y – k) = and (y – k) = Find the center, vertices, and foci of the hyperbola and sketch its graph. Sketch the asymptotes as an aid in obtaining the graph of the hyperbola. ( x + 1) 4 2 − ( y − 2) 2 = 1 1 solution We can find the center of the hyperbola just by looking at the given equation. If we compare the given hyperbola equation to the standard equation for a hyperbola, we can see that the center is located at (–1, 2). (Remember that x + h = x – (–h)): Center: (–1, 2) center of the hyperbola answer Since the x-term is first in the hyperbola equation, we know that this hyperbola has a horizontal transverse axis. We need to know this in order to find the vertices and foci. To find the vertices, we need a, which is the square root of the denominator under the x-term: a2 = a = 4 2 take square root of both sides The number a tells us how many units from the center the vertices are located. In this case, the vertices are located 2 units to the left and right of the center (because it is aligned horizontally). Thus, the vertices are located at: (–1 – 2, 2) = (–3, 2) one vertex (–1 + 2, 2) = (1, 2) the other vertex answer Once we have found the vertices, we can concentrate on the foci, which are each located c units away from the center of the hyperbola. The number c is found from the formula: c2 = a2 + b2 foci equation = 4+1 substitute in a2 = 4 and b2 = 1 c2 = 5 simplify c = 5 294 – Hyperbola Translation take square roots of both sides Quadratics Note that the square root of 5 is larger than 2, or a, which is as it should be. For a hyperbola, c > a. Thus, the foci are located at: ( −1 + 5 , 2) ≈ (1.236, 2) one focus ( −1 − 5 , 2) ≈ (–3.326, 2) other focus answer We are almost ready to sketch our hyperbola. All we need now are the equations for our asymptotes to help us sketch the branches of the hyperbola. According to our asymptote equations from the top of the page, they should take the form: (y – k) b = a ( x − h) and (y – k) = b − ( x − h) a because the hyperbola has a horizontal transverse axis. If we substitute in (h, k) = (–1, 2), a = 2 and b = 1, we obtain: (y – 2) = y = 1 2 1 2 ( x + 1) x+ 5 2 substitute in (h, k) = (–1, 2), a = 2 and b = 1 rewrite equation in Slope-Intercept Form one asymptote and (y – 2) = 1 − ( x + 1) 2 substitute in (h, k) = (–1, 2), a = 2 and b = 1 y = 1 3 − x+ 2 2 rewrite equation in Slope-Intercept Form the other asymptote When we finally sketch the graph of this hyperbola, we end up with: answer Although we have cluttered up our hyperbola considerably, labeling all points of interest, it is not always necessary to do so. If we remove all the extraneous information, our hyperbola looks like: Quadratics Hyperbola Translation – 295 where the red lines represent the branches of the hyperbola and the blue lines represent the asymptotes. 296 – Hyperbola Translation Quadratics Unit 5 Matrices and Determinants Matrices and Determinants – 297 298 – Matrices and Determinants Advanced To learn the basic definition of a matrix. matrix – a rectangular array of elements used to facilitate the study of problems in which the relation between these elements is fundamental. entry (matrix) – the object in a particular row and column of a matrix. rows – horizontal lines of numbers and variables in a matrix. columns – vertical lines of numbers or variables in a matrix. main diagonal entries – matrix entries that form a diagonal line from the top left corner to the lower right. Arthur Cayley (1821-1895) was one of the most prolific mathematicians of the 19th century. Besides being a mathematician, he also practiced law for 14 years, while at the same time writing over 300 mathematical papers. He is most famous for his invention of the algebra of matrices. This is partly a result of attempting to solve problems involving "n-dimensional" geometry, a term he coined. He saw that increasing dimensionality is the same as adding another variable to a linear equation. Solving such equations is child's play with matrix algebra, but considerably more difficult with traditional techniques such as the Method of Substitution. Matrices In everyday life, we tend to arrange objects in an orderly fashion. In mathematics, we do the same thing. An orderly arrangement of numbers and/or expressions is known as a matrix. In this section of the Algebra Brain, you will learn about the properties of matrices (plural of "matrix"). You will also learn how to manipulate matrices using properties of addition and multiplication. Finally, you will learn how to use matrices to solve actual problems that occur in real life. Before we describe matrices in detail, we need a working definition of what a matrix actually is: Definition of a Matrix If m and n are positive integers, an m x n matrix (read “m by n”) is a rectangular array in which each entry, aij, of the matrix is a number. An m x n matrix has m rows (horizontal lines) and n columns (vertical lines). A matrix having m rows and n columns is said to be of order m x n. If m = n, the matrix is square and of order n. For a square matrix, the entries a11, a22, . . ., ann are the main diagonal entries. Matrices and Determinants Matrices – 299 a. c. Order: 2 x 1 b. Order: 2 x 3 2 3 3 0 2 2 Order: 3 x 2 4 5 4 d. 5 0 Order: 4 x 4 (square matrix) 3 1 1 2 2 1 3 3 4 4 2 5 5 6 6 1 7 7 8 8 Matrices have a wide range of applications. Among the most common use of matrices is to store data. This is how computers store and process information. All information is stored in an array. Then if the computer needs a piece of information for some process, it just needs to look at a particular section of the hard drive to find it. This speeds up processing time considerably. Another extremely common and important use is to solve many equations with many unknowns simultaneously. Matrices can greatly simplify the process, although we usually turn to computers to give us a hand with this as well, as they can do it many times faster than a human. 300 – Matrices Matrices and Determinants Advanced To introduce operations involving matrices and to see how two matrices are equal to one another. No definitions on this page. William Rowan Hamilton (1805-1865) was an Irish mathematician and physicist. He showed promise of genius at an early age, mastering the languages of Greek, Latin, Hebrew, Persian, Arabic, and Sanskrit by the time he was ten. In mathematics, Hamilton worked to justify the use of negative and imaginary quantities. He used his ideas about negative numbers to construct the set of rational numbers. His work with imaginary numbers led to the discovery of the quaternion, or fourth-dimensional equation, in 1843. Matrices → Matrix Operations Matrices have a wide range of applications besides solving systems of linear equations. There is a very rich and diverse branch of mathematics that deals exclusively with matrices. The British mathematician Arthur Cayley first invented matrices in 1858. He was a Cambridge University graduate and a lawyer by profession. Cayley and two American mathematicians, Benjamin Peirce (1809 – 1880) and his son Charles S. Peirce (1839 – 1914) are credited with developing "matrix algebra". In this section of the Algebra Brain, we will look at some of the fundamental properties and operations performed with matrices. First, we need to understand how two matrices relate to each other. In other words, we want to know how to represent matrices and when matrices are equal to each other. Representation of a Matrix Matrices can be represented in three standard ways: 1.) A matrix can be denoted by an uppercase letter such as A, B, C, etc. 2.) A matrix can be denoted by a representative element enclosed in brackets, such as [aij], [bij], etc. 3.) A matrix can be denoted by a rectangular array of numbers, such as: A Note: = [aij] = Two matrices are equal if and only if corresponding entries are equal. That is, they must have the same order (m x n) and aij = bij for 1 ≤ i ≤ m and 1 ≤ j ≤ n. Matrices and Determinants Matrix Operations – 301 Solve for a11, a12, a21, and a22 in the following matrix equation: ⎛ a11 a12 ⎞ ⎜ ⎟ ⎝ a21 a22 ⎠ = ⎛ 4 −7 ⎞ ⎜ ⎟ ⎝3 0 ⎠ solution Since two matrices are only equal if their corresponding entries are equal, we can conclude that: a11 = 4 and a12 = –7 and a21 = 3 and a22 = 0 answer 302 – Matrix Operations Matrices and Determinants Advanced To learn the definition of the identity matrix. identity matrix – a matrix that consists of 1’s on its main diagonal and 0’s elsewhere. James Joseph Sylvester (1814-1897) used the term matrix to denote "'an oblong arrangement of terms consisting, suppose, of m lines and n columns" because "out of that arrangement, we may form various systems of determinants." Although Sylvester coined the term, it was his friend Arthur Cayley who put the terminology to use in his papers of 1855 and 1858. Matrices → Matrix Operations → Identity Matrix One of the most useful types of matrices is the identity matrix, defined as follows: Definition of Identity Matrix The n x n matrix that consists of 1’s on its main diagonal and 0’s elsewhere is called the identity matrix of order n and is denoted by In = 1 0 0 0 0 1 0 0 0 0 1 0 0 0 0 1 identity matrix Note than an identity matrix must be square. That is, it has the same number of rows and columns. When the order is understood to be n, we can simply denote the identity matrix In as I. Also, if A is an n x n matrix, then the identity matrix has the property that AIn = A and InA = A. For example: ⎛4 3 0 ⎞ ⎛1 0 0⎞ ⎜ 2 −1 −3 ⎟ ⋅ ⎜ 0 1 0 ⎟ ⎜ ⎟⎜ ⎟ ⎝7 0 4 ⎠ ⎝0 0 1⎠ = ⎛4 3 0 ⎞ ⎜ 2 −1 −3 ⎟ ⎜ ⎟ ⎝7 0 4 ⎠ AI3 = A = ⎛4 3 0 ⎞ ⎜ 2 −1 −3 ⎟ ⎜ ⎟ ⎝7 0 4 ⎠ I3A = A and ⎛1 0 0⎞ ⎛4 3 0 ⎞ ⎜ 0 1 0 ⎟ ⋅ ⎜ 2 −1 −3 ⎟ ⎜ ⎟⎜ ⎟ ⎝0 0 1⎠ ⎝7 0 4 ⎠ Matrices and Determinants Identity Matrix – 303 304 – Identity Matrix Matrices and Determinants Advanced To add/subtract matrices together. zero matrix – a matrix in which all the elements are zero. Johannes Kepler (1571-1630) is credited with discovering the correct shape of the orbits of the planets (elliptical), but he was actually trying to discover the mathematical rules God used to create the world. In his work Mysterium Cosmographicum, he attempts to explain why there are only six planets. According to Kepler, this is because "God is always a geometer." Kepler's theory was that each planet was separated from its neighbors by a regular Euclidean solid. For instance, inside the orbit of Saturn, one could inscribe a cube, which in turn circumscribed the orbit of Jupiter. Matrices → Matrix Operations → Matrix Addition One of the most basic operations of real numbers is to add two real numbers together. Similarly, one of the most basic operations of matrices is to add two matrices together. However, we have to note a few ground rules. First, we can only add matrices of the same order. For example, if we have a 2 x 3 matrix, we can only add it to another 2 x 3 matrix. Second, to add two matrices of the same order, we simply add their corresponding entries. That is all there is to it. The process is summarized below: Definition of Matrix Addition If A = [aij] and B = [bij] are matrices of order m x n, their sum is the m x n matrix given by A+B = [aij + bij] The sum of two matrices of different orders is undefined. Let's look at a couple of examples: Add the following matrices: Note: a.) ⎛1 1 ⎞ ⎛2 3⎞ ⎜ ⎟+⎜ ⎟ ⎝ 3 −4 ⎠ ⎝ 3 5 ⎠ ⎛1+ 2 1+ 3 ⎞ ⎜ ⎟ ⎝ 3 + 3 −4 + 5 ⎠ b.) ⎛ 3 2⎞ ⎛ 0 3⎞ ⎜ −5 7 ⎟ + ⎜ −2 9 ⎟ ⎜ ⎟ ⎜ ⎟ ⎝ −1 0 ⎠ ⎝ −1 4 ⎠ c.) ( −3 2 −1 ) + ( 3 −2 1 ) = = = ⎡ 3 + 0 2 + 3⎤ ⎢ −5 + ( −2) 7 + 9⎥ ⎢ ⎥ ⎣ −1 + ( −1) 0 + 4⎦ = ⎛3 4⎞ ⎜ ⎟ ⎝6 1⎠ = answer ⎛3 5⎞ ⎜ −7 16 ⎟ ⎜ ⎟ ⎝ −2 4 ⎠ [ −3 + 3 2 + ( −2) −1 + 1 ] = answer (0 0 0) answer The solution to (c) is called a zero matrix because all its elements are zero. Matrices and Determinants Matrix Addition – 305 Matrix subtraction is defined in similar manner to matrix addition except that we take the difference between corresponding entries. Again, both matrices must have the same exact dimensions in order for matrix subtraction to work. 306 – Matrix Addition Matrices and Determinants Advanced To multiply a matrix by a real number. scalar – a real number. scalar multiplication – the product of a real number and a matrix. The English word "sine" comes from a series of mistranslations of the Sanskrit word jya-ardha, often abbreviated as simply jya or jiva. The Arabs later translated this into an otherwise meaningless Arabic word jiba. Since Arabic is written without vowels, later writers interpreted jb as jaib, which means "bosom" or "breast". When Arabic trigonometry was translated into Latin in the 12th century, the translator use the Latin equivalent sinus, which also means "bosom", and by extension, "fold" (as in a toga over a breast) or a bay or gulf. This Latin word has now become our English "sine". Matrices → Matrix Operations → Scalar Multiplication When we deal with variables and real numbers, we often multiply the two together. The number in front of the variable is known as a coefficient. However, we can also call constant multiples of real numbers scalars because they "scale" the number. For instance, if we have the expression 2x, the number 2 "scales" the variable x. No matter what x is, the result of evaluating the expression is a number that is twice as large as the number we started with. Now we are going to do the same thing to matrices. Given a matrix A, if we multiply the entirety of the matrix by some real number constant, we end up "scaling" all the elements within the matrix. Thus if we have the expression 3A, what we actually mean is that we multiply each and every element within the matrix A by 3. We call this process scalar multiplication and define it as follows: Definition of Scalar Multiplication If A = [aij] is an m x n matrix and c is a scalar (any real number), the scalar multiple of A by c is the m x n matrix given by cA = [caij] scalar multiplication of a matrix Scalar multiplication is often combined with matrix addition to create matrix expression similar to algebraic expressions with real numbers. We can take advantage of these expressions to solve matrix equations in much the same way we solve normal algebraic equations. Let's look at a few examples of simplifying matrix expressions using scalar multiplication and matrix addition. For the given matrices find: a.) 4A + B b.) –2B c.) A – 3B A = ⎛ 2 −2 1 ⎞ ⎜4 3 7⎟ ⎜ ⎟ ⎝ 0 −3 8 ⎠ Matrices and Determinants and B = ⎛0 0 3⎞ ⎜ 0 −2 1 ⎟ ⎜ ⎟ ⎝ 7 −6 5 ⎠ Scalar Multiplication – 307 solution Matrix operations tend to follow the same order of operations as normal real numbers. First we perform any scalar multiplications. Then we perform the addition and subtraction operations in order from left to right. a.) b.) c.) 4A + B –2B A – 3B = ⎛ 2 −2 1 ⎞ ⎛ 0 0 3 ⎞ 4⎜ 4 3 7 ⎟ + ⎜ 0 −2 1 ⎟ ⎜ ⎟ ⎜ ⎟ ⎝ 0 −3 8 ⎠ ⎝ 7 −6 5 ⎠ multiply each element in A by the scalar 4 = ⎛ 8 −8 4 ⎞ ⎛ 0 0 3 ⎞ ⎜ 16 12 28 ⎟ + ⎜ 0 −2 1 ⎟ ⎜ ⎟ ⎜ ⎟ ⎝ 0 −12 32 ⎠ ⎝ 7 −6 5 ⎠ add 4A to B = ⎛ 8 8 −7 ⎞ ⎜ 16 10 29 ⎟ ⎜ ⎟ ⎝ 7 −18 37 ⎠ answer = ⎛0 0 3⎞ −2⋅ ⎜ 0 −2 1 ⎟ ⎜ ⎟ ⎝ 7 −6 5 ⎠ multiply each element in B by the scalar –2 = ⎛ 0 0 −6 ⎞ ⎜ 0 4 −2 ⎟ ⎜ ⎟ ⎝ −14 12 −10 ⎠ answer = ⎛ 2 −2 1 ⎞ ⎛0 0 3⎞ ⎜ 4 3 7 ⎟ − 3⋅ ⎜ 0 −2 1 ⎟ ⎜ ⎟ ⎜ ⎟ ⎝ 0 −3 8 ⎠ ⎝ 7 −6 5 ⎠ multiply each element in B by the scalar 3 = ⎛ 2 −2 1 ⎞ ⎛ 0 0 9 ⎞ ⎜ 4 3 7 ⎟ − ⎜ 0 −6 3 ⎟ ⎜ ⎟ ⎜ ⎟ ⎝ 0 −3 8 ⎠ ⎝ 21 −18 15 ⎠ subtract 3B from A = ⎛ 2 −2 −8 ⎞ ⎜ 4 9 4⎟ ⎜ ⎟ ⎝ −21 15 −7 ⎠ answer 308 – Scalar Multiplication Matrices and Determinants Advanced To multiply two matrices together. No definitions on this page. In ancient China, the civil service required its employees to undertake examinations for advancement. These examinations were largely based on Chinese literature, but some administrative services - surveying, taxation, calendar making - required knowledge of mathematics. Instead of attempting to find creative solutions to problems, applicants for offices were encouraged to study "classical" mathematics, tried and true methods for finding solutions. The examination system often required recitation of passages from mathematical texts as well as the solving of problems in the same manner as described in those texts. Matrices → Matrix Operations → Matrix Multiplication Matrix multiplication is by far the most convoluted of the basic matrix operations. Unlike matrix addition, where we simply add corresponding entries, matrix multiplication is a combination of multiplication and addition, and not of corresponding entries, either. First let's define matrix multiplication. Then we will explain the definition and give a few examples of how it works. Definition of Matrix Multiplication If A = [aij] is an m x n matrix and B = [bij] is an n x p matrix, the product AB is an m x p matrix AB = [cij] matrix multiplication where cij = ai1b1j + ai2b2j + ai3b3j + . . . + ainbnj. In order for AB to be defined, the number of columns of the first matrix must equal the number of rows of the second matrix. The resulting matrix has the order given by the number of rows of the first matrix and the number of columns of the second matrix. That is, if A is an m x n matrix, B must be an n x p matrix in order for matrix multiplication to work. The result will be a matrix of order m x p. Note: The definition of matrix multiplication indicates a row-by-column multiplication. That is, we multiply the entries in the first row of A by the corresponding entries in the first column of B and add up the results. The final tally goes into the first entry of the resultant matrix. We do the same thing with the first row of A and the second column of B. That result goes into the first row, second column of resultant matrix C. And so on. Although matrix multiplication can quickly become tedious, there are good reasons for using it. In today's modern world, we often use computers to perform matrix operations. However, we still need to know how to do it by hand just in case we do not have a computer handy. Let's find the product of two matrices to see how this process works. Find the product AB, where A = ⎛1 2⎞ ⎜ ⎟ ⎝4 2⎠ Matrices and Determinants and B = ⎛ 2 −1 ⎞ ⎜ ⎟ ⎝ −1 8 ⎠ Matrix Multiplication – 309 solution Before we do anything else, we need to double check the order of the matrices to make sure that we can multiply them together. Matrix A is 2 x 2 and matrix B is 2 x 2. Since the number of columns of A equals the number of rows of B, we conclude that we will be able to multiply the matrices together. Furthermore, we will obtain a 2 x 2 matrix as the result, in the form: ⎛ 1 2 ⎞ ⋅ ⎛ 2 −1 ⎞ ⎜ ⎟⎜ ⎟ ⎝ 4 2 ⎠ ⎝ −1 8 ⎠ = ⎛ c11 c12 ⎞ ⎜ ⎟ ⎝ c21 c22 ⎠ form of resultant matrix To find the entries of the product matrix C, multiply each row of A by each column of B as follows: AB = ⎛ 1 2 ⎞ ⎛ 2 −1 ⎞ ⎜ ⎟ ⋅⎜ ⎟ ⎝ 4 2 ⎠ ⎝ −1 8 ⎠ multiply two matrices = ⎡ ( 1) ( 2) + ( 2) ( −1) ( 1) ( −1) + ( 2) ( 8) ⎤ ⎢ ⎥ ⎣ ( 4) ( 2) + ( 2) ( −1) ( 4) ( −1) + ( 2) ( 8) ⎦ add rows of A to columns of B = ⎛ 2 − 2 −1 + 16 ⎞ ⎜ ⎟ ⎝ 8 − 2 −4 + 16 ⎠ simplify = ⎛ 0 15 ⎞ ⎜ ⎟ ⎝ 6 12 ⎠ simplify answer Be sure that you understand that the number of columns of A must equal the number of rows of B in order for this to work. If they are not equal, then we end up multiplying a number by something that does not exist, and the process fails. In other words, the middle two indices must be the same. The order of the resultant matrix is given by the outer indices, as indicated below: Application of Matrix Multiplication One of the most important applications of this process is solving systems of linear equations. If we have a system of linear equations that looks like: a11x1 + a12x2 + a13x3 = b1 a21x1 + a22x2 + a23x3 = b2 a31x1 + a32x2 + a33x3 = b3 we can represent this system as: AX = B linear system of equations written as matrix multiplication where the matrix A represents all the coefficients of the unknowns, matrix X represents all the unknowns in the system, and B represents the constant terms. In this case, A is called the coefficient matrix of the system, and X and B are column matrices: 310 – Matrix Multiplication Matrices and Determinants ⎛⎜ a11 a12 a13 ⎞⎟ ⎛⎜ x1 ⎞⎟ ⎜ a21 a22 a23 ⎟ ⋅ ⎜ x2 ⎟ ⎜ a31 a32 a33 ⎟ ⎜ x3 ⎟ ⎝ ⎠⎝ ⎠ = x X A ⎛⎜ b1 ⎞⎟ ⎜ b2 ⎟ ⎜ b3 ⎟ ⎝ ⎠ = B Find matrices A, X, and B such that the following system of linear equations can be represented by the matrix equation AX = B. Solve the system of equations. x – 2y + 3z = –x + 3y – 9 z = –6 2x – 5y + 5z = 17 solution We represent the matrix A with the coefficients of all the unknowns in the system. We represent matrix X with the unknowns in the system and matrix B is the constant terms: ⎛ 1 −2 3 ⎞ ⎛ x ⎞ ⎜ −1 3 −1 ⎟ ⋅ ⎜ y ⎟ ⎜ ⎟⎜ ⎟ ⎝ 2 −5 5 ⎠ ⎝ z ⎠ = ⎛9⎞ ⎜ −6 ⎟ ⎜ ⎟ ⎝ 17 ⎠ representation of linear system with matrix equation Now in order to solve this system of equations, we form the augmented matrix as follows: ⎛ 1 −2 3 ⎜ −1 3 −1 ⎜ ⎝ 2 −5 5 9 ⎞ −6 ⎟ ⎟ 17 ⎠ augmented matrix formed from coefficient matrix and constant matrix We can now apply either Gaussian elimination with back-substitution or Gauss-Jordan elimination in order to solve this system. Let's start doing some elementary row operations to convert the matrix to row-echelon form. We will start by adding Row 2 to Row 1 and putting the result in Row 2: R1 + R2 ⎛ 1 −2 3 ⎜0 1 2 ⎜ ⎝ 2 −5 5 R3 – 2R1 ⎛ 1 −2 3 ⎜0 1 2 ⎜ ⎝ 0 −1 −1 R3 + R2 ⎛ 1 −2 3 ⎜0 1 2 ⎜ ⎝0 0 1 ⎞ ⎟ ⎟ 17 ⎠ 9 3 ⎞ ⎟ ⎟ −1 ⎠ 9 3 9⎞ 3⎟ ⎟ 2⎠ add first row to second row and put result in second row subtract 2 times first row from third row and put result in third row add third row to second row and put result in third row Now we have transformed the augmented matrix into row-echelon form. At this point, we can either proceed with back-substitution or continue performing row operations until we obtain a matrix in reduced row-echelon form. Let's continue with the latter: Matrices and Determinants Matrix Multiplication – 311 R2 – 2R3 R1 + 2R2 R1 – 3R3 ⎛ 1 −2 3 ⎜0 1 0 ⎜ ⎝0 0 1 9 ⎞ −1 ⎟ ⎟ 2 ⎠ subtract 2 times the third row from the second row and put the result in the third row ⎛1 0 3 ⎜0 1 0 ⎜ ⎝0 0 1 ⎞ −1 ⎟ ⎟ 2 ⎠ add 2 times the second row to the first row and put the result in the first row ⎛1 0 0 ⎜0 1 0 ⎜ ⎝0 0 1 ⎞ −1 ⎟ ⎟ 2 ⎠ subtract 3 times the third row from the first row and put the result in the first row 7 1 Now we can simply read our solution from the reduced row-echelon form of the augmented matrix: x = 1 and y = –1 and z = 2 answer We can double check our solutions using matrix multiplication: ⎛ 1 −2 3 ⎞ ⎛ x ⎞ ⎜ −1 3 −1 ⎟ ⋅ ⎜ y ⎟ ⎜ ⎟⎜ ⎟ ⎝ 2 −5 5 ⎠ ⎝ z ⎠ = ⎛ 1 −2 3 ⎞ ⎛ 1 ⎞ ⎜ −1 3 −1 ⎟ ⋅ ⎜ −1 ⎟ ⎜ ⎟⎜ ⎟ ⎝ 2 −5 5 ⎠ ⎝ 2 ⎠ substitute in x = 1, y = –1, and z = 2 into our matrix equation = ⎡ 1( 1) + ( −2) ( −1) + ( 3) ( 2) ⎤ ⎢ ( −1) ( 1) + ( 3) ( −1) + ( −1) ( 2) ⎥ ⎢ ⎥ ⎣ ( 2) ( 1) + ( −5) ( −1) + ( 5) ( 2) ⎦ matrix multiplication = ⎛ 1+ 2+ 6 ⎞ ⎜ −1 − 3 − 2 ⎟ ⎜ ⎟ ⎝ 2 + 5 + 10 ⎠ simplify = ⎛9⎞ ⎜ −6 ⎟ ⎜ ⎟ ⎝ 17 ⎠ simplify check The product of A and X is indeed B, as we were given at the beginning of this problem, so we can conclude that we have found the correct solution to the system of linear equations. 312 – Matrix Multiplication Matrices and Determinants Advanced To identify properties of matrix operations. zero matrix – a matrix consisting entirely of zeros. additive identity – a matrix O such that A + O = A, provided the matrices O and A have the same dimensions. Indian mathematics originated in northern India around 600 B.C. The brahmin, or priestly caste, were the bearers of religious traditions that grew into Hinduism. Lower castes were considered unworthy of learning, so the brahmin passed on their traditions to each other in oral form, as opposed to written form. To aid in memorization of important ideas, many works were put into poetic stanzas. Such works included the basics of mathematics and astronomy, along with literature and religion. Matrices → Matrix Operations → Properties of Matrix Operations Properties of Matrix Addition and Scalar Multiplication Matrix addition obeys many of the same properties obeyed by addition of real numbers. Similarly, scalar multiplication of a matrix also obeys properties obeyed by scalar multiplication of real numbers. Properties of Matrix Addition and Scalar Multiplication Let A, B, and C be m x n matrices and let c and d be scalars. 1.) A+B=B+A Commutative Property of Matrix Addition 2.) A + (B + C) = (A + B) + C Associative Property of Matrix Addition 3.) (cd)A = c(dA) Associative Property of Scalar Multiplication 4.) 1A = A Scalar Identity 5.) c(A + B) = cA + cB Distributive Property 6.) (c + d)A = cA + dA Distributive Property Remember that we can only add two or more matrices together if they are of the same order. Note also that if A and B are of the same order, then A – B represents the sum of A and (–1)B. That is, A–B = A + (–1)B subtraction of two matrices One very important property of real numbers is that we can add the number 0 to any real number and we obtain the number we started with as a result. In other words, zero is the additive identity, where c + 0 = c if c is any real number. A similar property holds true for matrices. If we have the m x n matrix A and we add another m x n matrix O that consists entirely of zeros (and is therefore called a zero matrix), then A + O = A. The matrix O is the additive identity for the set of all m x n matrices. For example, the following matrix is the additive identity for the set of all 2 x 3 matrices: O = ⎛0 0 0⎞ ⎜ ⎟ ⎝0 0 0⎠ Matrices and Determinants zero 2 x 3 matrix Properties of Matrix Operations – 313 We can use the properties of matrix addition and scalar multiplication in order to solve matrix equations. The process is fairly similar to that used to solve equations with real numbers. Compare the following methods of solution: Real Numbers (solve for x) x+a = b m x n Matrices (solve for X) X+A = B given equation X + A + (–A) = B + (–A) subtract same quantity from both sides b–a X+O = B–A simplify b–a X = B–A simplify x + a + (–a) = b + (–a) x+0 = x = Let's look at an example of how these properties can be used to help us solve a matrix equation. Solve the following equation for X given: A ⎛ −2 −1 ⎞ ⎜1 0⎟ ⎜ ⎟ ⎝ 3 −4 ⎠ = 2X + 3A = B and B = ⎛0 3⎞ ⎜2 0⎟ ⎜ ⎟ ⎝ −4 −1 ⎠ given equation solution Before we start substituting in matrices, we want to use properties of matrix addition and scalar multiplication to isolate the matrix X by itself, just as we normally isolate the variable x when working with real numbers. We begin by subtracting 3A from both sides of the equation. Then we multiply both sides of the equation by 1/2 (the same as dividing both sides by 2) to get the matrix X by itself. We know that the matrix X will be a 3 x 2 matrix because both A and B are also 3 x 2 matrices. 2X + 3A = B given equation 2X = B – 3A subtract 3A from both sides X = 1 2 ( B − 3 A) multiply both sides by 1/2 At this point, we can now replace A and B with their rectangular array forms: 314 – Properties of Matrix Opeations Matrices and Determinants X ⎡⎛ 0 3 ⎞ ⎛ −2 −1 ⎞ ⎤ ⎟ 2 0 − 3⋅ ⎜ 1 0 ⎟ ⎥ ⎟ ⎜ ⎟⎥ 2⎢⎜ ⎣ ⎝ −4 −1 ⎠ ⎝ 3 −4 ⎠ ⎦ replace A and B with rectangular array forms ⎡⎛ 0 3 ⎞ ⎛ −6 −3 ⎞⎤ 2 0 ⎟ − ⎜ 3 0 ⎟⎥ ⎟ ⎜ ⎟⎥ 2 ⎢⎜ ⎣⎝ −4 −1 ⎠ ⎝ 9 −12 ⎠⎦ multiply second matrix by scalar multiple 3 = 1⎢⎜ = 1 ⎢⎜ = 1⎜ ⎛ 6 X = ⎞ −1 0 ⎟ ⎜ ⎟ 2 ⎝ −13 11 ⎠ subtract matrices ⎛ 3 ⎜ ⎜ −1 ⎜ 2 ⎜ 13 ⎜− ⎝ 2 multiply by scalar multiple 1/2 answer 6 ⎞ ⎟ 0 ⎟ ⎟ 11 ⎟ ⎟ 2 ⎠ 3 Properties of Matrix Multiplication Matrix multiplication obeys rules of its own, although these properties are similar in nature to those obeyed by multiplication of real numbers. However, we must always be aware that matrix multiplication only works if our matrices have the right orders. That is, if we have two matrices A and B, and we want the product AB, then the number of columns of A must be equal to the number of rows of B, otherwise we cannot multiply them together. For more details on how matrix multiplication actually works, we refer you to the Matrix Multiplication section of the Algebra Brain. Properties of Matrix Multiplication Let A, B, and C be matrices and let c be a scalar. Note: 1.) A(BC) = (AB)C Associative Property of Matrix Multiplication 2.) A(B + C) = AB + AC Distributive Property 3.) (A + B)C = AC + BC Distributive Property 4.) c(AB) = (cA)B = A(cB) There is no Commutative Property of Matrix Multiplication. That is, AB ≠ BA, except in certain rare instances. Matrices and Determinants Properties of Matrix Operations – 315 316 – Properties of Matrix Opeations Matrices and Determinants Advanced To represent a system of linear equations with a matrix. augmented matrix – a matrix derived from a system of linear equations, where the last column of the matrix represents the constant terms of the system. coefficient matrix – the matrix derived from the coefficients of the variables in a linear system. It does not include any of the constant terms of the system. The Chinese mathematical work Jiu zhang suan shu (Nine Chapters on the Mathematical Art), compiled around 200 B.C., contains two methods of solving systems of linear equations. The first method is known as the method of surplus and deficiency. It is effectively a method for solving two equations with two unknowns. The second method is virtually identical to Gaussian elimination and is even presented in matrix form. Matrices → Linear Systems in a Matrix A matrix can be derived from a system of linear equations (each written in standard form with the constant term on the right of the equals sign). This matrix is called the augmented matrix of the linear system. The matrix derived from just the coefficients of the linear system is called the coefficient matrix. System of Linear Equations x – 4y + 3z = 5 –x + 3y – z = –3 2x – 4z = 6 Augmented Matrix 5 ⎞ ⎛ 1 −4 3 ⎜ −1 3 −1 ⎜ ⎝ 2 0 −4 Note: −3 ⎟ 6 ⎟ ⎠ The last column of the matrix corresponds to the right hand side of the linear system. Coefficient Matrix ⎛ 1 −4 3 ⎞ ⎜ −1 3 −1 ⎟ ⎜ ⎟ ⎝ 2 0 −4 ⎠ This matrix just contains the coefficients of the linear system. Note that a 0 is placed for the missing y-variable in the third equation. Whenever forming the coefficient or the augmented matrix, begin by vertically aligning the variables in the equation and then using zeros for the missing variables: Matrices and Determinants Linear Systems in a Matrix – 317 System Line up variables x + 3y = 9 –y + 4z = –2 x – 5z = 0 318 – Linear Systems in a Matrix Form Augmented Matrix x + 3y = 9 – y + 4z = –2 x – 5z = 0 ⎛1 3 0 ⎜ 0 −1 4 ⎜ ⎝ 1 0 −5 9 ⎞ 0 ⎟ ⎠ −2 ⎟ Matrices and Determinants Advanced To solve linear systems of equations by applying elementary row operations to the matrix representation of a given linear system. elementary row operations – interchanging, multiplying, and adding rows of a matrix to find solutions for each of the variables in the linear system represented by the matrix. row-equivalent – two matrices are said to be row-equivalent when one matrix can be obtained from the other matrix by applying elementary row operations. row-echelon form – a matrix that has a stair-step pattern with leading coefficients of 1 along the main diagonal and zeros below the main diagonal. leading 1 – the first non-zero entry of a row that does not consist entirely of zeros. reduced row-echelon form – an augmented matrix that has 1's along the main diagonal of the coefficient matrix and zeros in every entry above and below the main diagonal of the coefficient matrix. Although Arthur Cayley is credited with developing the algebra of matrices, the word "matrix" was actually coined in 1850 by James Joseph Sylvester, a friend of Cayley's and a fellow mathematician. Matrices → Linear Systems in a Matrix → Elementary Row Operations When you attempt to solve a system of linear equations, you often have to transform one or more equations into an equivalent equation in order to eliminate one or more variables. This results in an equivalent system of linear equations. One way to transform a linear equation is to multiply the entire equation by some real number. Another way is to interchange two equations. Finally, we can add a multiple of one equation to another equation. We can adopt these three procedures to work with matrices. In particular, we can represent a system of linear equations in a matrix. Then we apply the three operations given above to find solutions for each of the variables in the linear system. We call these basic procedures elementary row operations. Elementary Row Operations There are three basic row operations that may be performed on systems of linear equations: 1. Interchange two rows. 2. Multiply a row by a nonzero constant. 3. Add a multiple of one row to another row. These same operations may be performed on linear systems that are represented as an augmented matrix. Two matrices are row-equivalent if one can be obtained from the other by a series of elementary row operations. Elementary row operations are simple to perform, but they can require considerable amounts of pure arithmetic. It is important to understand what is transpiring in each step. Therefore, it is helpful to indicate what is happening to each row of the matrix. In the examples of the elementary row operations given below, we use R to designate a Row and a subscript (1, 2, etc.) to indicate one particular Row. The operation "3R1 →", for Matrices and Determinants Elementary Row Operations – 319 instance, tells us that we multiply each element in the first row by 3. The arrow is pointing to the first row, so we know that we are supposed to place the results of our multiplication in the first row. a.) Interchange two rows: Original Matrix Operation New Row Equivalent Matrix ⎛ 4 3 −1 5 ⎞ ⎛0 3 1 0⎞ ⎜ 4 3 −1 5 ⎟ ⎜ ⎟ ⎝1 8 3 2⎠ ⎜0 3 1 0⎟ ⎜ ⎟ ⎝1 8 3 2⎠ b.) Multiply the first row by 3: Original Matrix ⎛ 4 3 −1 5 ⎞ ⎜0 3 1 0⎟ ⎜ ⎟ ⎝1 8 3 2⎠ c.) Operation 3R1 → New Row Equivalent Matrix ⎛ 12 9 −3 15 ⎞ ⎜0 3 1 0⎟ ⎜ ⎟ ⎝1 8 3 2⎠ Add –2 times the third row to the second row: Original Matrix Operation ⎛ 4 3 −1 5 ⎞ ⎜0 3 1 0⎟ ⎜ ⎟ ⎝1 8 3 2⎠ –2R3 + R2 → New Row Equivalent Matrix ⎛ 4 3 −1 5 ⎞ ⎜ −2 −13 −5 −4 ⎟ ⎜ ⎟ ⎝1 8 3 2 ⎠ Here we are multiplying each element in the second row by –2 and adding to the result the corresponding element in Row 3. The final result is then placed into its corresponding position in Row 2. Note: The operation is written next to the row that is being changed. Although we have given very general examples of elementary row operations, when you begin to study Gaussian elimination, you will see that this is an extremely powerful tool for quickly solving systems of linear equations by hand. Consider the following example of solving a system of linear equations using elementary row operations: Solve the system of linear equations using elementary row operations Linear System x + 2y = 7 2x + y = 8 solution To demonstrate the parallel between elementary row operations and normal procedures on how to solve a linear system without a matrix, we are going to simultaneously solve the system both without a matrix and using an associated augmented matrix. 320 – Elementary Row Operations Matrices and Determinants Linear System x + 2y = 7 2x + y = 8 Associated Augmented Matrix ⎛1 2 7⎞ ⎜ ⎝2 1 ⎟ 8⎠ Linear System multiply the first equation by 2 2x + 4y = 14 2x + Associated Augmented Matrix multiply the first row by 2 2R1 2 4 14 ⎛ ⎜ ⎝2 1 y = 8 ⎞ ⎟ 8 ⎠ Linear System subtract the second equation from the first equation 2x + 4y = 14 Associated Augmented Matrix subtract Row 2 from Row 1 and put the result in Row 2 0x + 3y = 6 R2 – R1→ ⎛2 4 ⎜ ⎝0 3 14 ⎞ 6 ⎟ ⎠ Linear System divide the second equation by 3 to solve for y 2x + 4y = 14 Associated Augmented Matrix divide Row 2 by 3 and put the result in Row 2 0x + 1 y = 2 Linear System divide the first equation by 2 x + 2y = 7 0x + y = 2 /3R2 → ⎛2 4 ⎜ ⎝0 1 14 ⎞ 2 ⎟ ⎠ Associated Augmented Matrix divide Row 1 by 2 and put the result in Row 1 1 /2R1→ 1 2 7 ⎛ ⎜ ⎝0 1 ⎞ ⎟ 2⎠ At this point, we can back-substitute in y = 2 into the first equation and then solve the first equation for x to find out that x = 3. Therefore, the solution to this system of linear equations is: x = 3 and y = 2 answer The last step in the example above illustrates one reason why we use elementary row operations. Notice that in order to find the solution for y, all we had to do was read across the bottom row because there is only a 0 before the y. This last matrix is said to be in row-echelon form. The term echelon refers to the stair-step pattern formed by non-zero elements in the matrix. In this form, a matrix must have the following properties: Row-Echelon Form and Reduced Row-Echelon Form A matrix in row-echelon form has the following properties: 1. All rows consisting entirely of zeros occur at the bottom of the matrix. 2. For each row that does not consist entirely of zeros, the first non-zero entry is a 1 (called a leading 1). 3. For two consecutive (nonzero) rows, the leading 1 in the higher row is farther to the left than the leading 1 in the lower row. A matrix in row-echelon form is in reduced row-echelon form if every column that has a leading 1 has zeros in every position above and below its leading 1. Matrices and Determinants Elementary Row Operations – 321 Words fail to adequately describe just what row-echelon form is all about, so let's take a look at a few examples: The following matrices are in row-echelon form: a.) ⎛1 0 0 0⎞ ⎜0 1 1 5⎟ ⎜ ⎟ ⎝0 0 1 7⎠ b.) c.) ⎛1 0 2 1 1 ⎞ ⎜0 0 1 4 7 ⎟ ⎜ ⎟ ⎝ 0 0 0 1 −9 ⎠ d.) ⎛⎜ 1 ⎜0 ⎜0 ⎜0 ⎝ 0 0 9⎞ ⎟ 1 0 1⎟ 0 1 5⎟ ⎟ 0 0 0⎠ The matrix in (d) also happens to be in reduced row-echelon form. The following matrices are not in rowechelon form (reduced or otherwise). e.) ⎛ 1 4 3 12 ⎞ ⎜0 2 8 6 ⎟ ⎜ ⎟ ⎝ 0 0 1 −5 ⎠ f.) ⎛ 1 2 −1 2 ⎞ ⎜0 0 0 0 ⎟ ⎜ ⎟ ⎝ 0 1 2 −4 ⎠ We can easily transform (e) into row-echelon form by multiplying R2 by 1/2. We can transform (f) into rowechelon form by interchanging R2 and R3. 322 – Elementary Row Operations Matrices and Determinants Advanced To solve systems of linear equations using Gaussian elimination. row-echelon form (matrices) – a matrix that has a stair-step pattern with 1's along the main diagonal, and all zeros below the main diagonal. inconsistent – a system of linear equations that has no solution. In 1801, Karl Friedrich Gauss (1777-1855) rediscovered the "lost" asteroid Ceres. Ceres was originally discovered by Giuseppe Piazzi of Palermo, Italy earlier in the year. Unfortunately, he was unable to keep track of the strange object in the sky and so Ceres eventually disappeared. He believed that he had found a comet, but Gauss examined Piazzi's calculations and this examination allowed another astronomer, Baron Francis Xavier von Zach, to rediscover Ceres at the end of the year. Shortly thereafter, a fair number of asteroids were subsequently discovered. Now we know of thousands, if not millions, of asteroids in orbit between the planets Jupiter and Mars. Matrices → Linear Systems in a Matrix → Elementary Row Operations → Gaussian Elimination - Matrices In the previous thought, we looked at transforming matrices by manipulating the rows of matrices. Now we are going to examine in detail just why we do this. Recall that we can write a system of linear equations in an augmented matrix. We use a process called Gaussian Elimination with Back-Substitution to transform the matrix into a form where we can begin solving the system of linear equations. In effect, we reduce the matrix to row-echelon form. If the system of linear equations does have a solution set, then the last row of the matrix will contain only two nonzero elements. The next to last element in the row represents the final variable in the linear system. The last element in the row represents the value of that variable. It is much easier to see the process than it is to explain it. First we will give you the general rules for Gaussian Elimination with Back-Substitution. Then we will demonstrate this process on a system of linear equations. Gaussian Elimination with Back-Substitution 1.) Write the augmented matrix of the system of linear equations 2.) Use elementary row operations to rewrite the augmented matrix in row-echelon form. 3.) Write the system of linear equations corresponding to the matrix in row-echelon form and use back-substitution to find the solution set. Now that we have given the rules for the procedure, it is time to get our hands dirty and work an example. When applying this procedure, it is important to keep track of the order in which the operations are being performed. In general, it is best to start with the first column, using elementary row operations to obtain zeros below the leading 1. Then we move on to the second column, again trying to obtain zeros below the leading 1, and so on. Matrices and Determinants Gaussian Elimination – 323 Use Gaussian elimination with back-substitution to solve the following system of linear equations. x 3x + – 3z = –2 y – 2z = 2x + 2y + z = 5 4 solution According to the rules for Gaussian elimination, we first need to represent this with a matrix. We use the coefficients of the variables as the elements in our matrix. The last column of the matrix represents the constant terms on the right hand side of the equations. We end up with the following matrix representation of our linear system: ⎛ 1 0 −3 ⎜ 3 1 −2 ⎜ ⎝2 2 1 −2 ⎞ ⎟ ⎟ 4 ⎠ 5 matrix representation of our linear system First of all, notice that the top row has a "1" as our first element. That suits us just fine, so we are not going to mess with this row. We will start by transforming the "3" in the second row into a "0". To do that, we subtract R2 from 3R1 and put the result in the second row: 3R1 – R2 ⎛ 1 0 −3 ⎜ 0 −1 −7 ⎜ ⎝2 2 1 −2 ⎞ −11 ⎟ ⎟ 4 ⎠ subtract the second row from 3 times the first row Before we go any further, we should remind you TO BE EXTREMELY CAREFUL ABOUT YOUR SIGNS! Sign errors are all too frequent when working with elementary row operations, so it is a very, very good idea to double check each step. This is one reason why we like to use computers to solve this sort of problem. Okay, back to the problem. Our next step is to get a "0" as the first element in R3. We can do that if we subtract R3 from 2R1 and put the result in the third row: 2R1 – R3 ⎛ 1 0 −3 ⎜ 0 −1 −7 ⎜ ⎝ 0 −2 −7 −2 ⎞ −11 ⎟ ⎟ −8 ⎠ subtract the third row from 2 times the first row Our next step is to somehow transform the "–2" in the last row into another "0". We do that by subtracting R3 from 2R2 and putting the result in the last row. Again, we need to be VERY careful about our signs. 2R2 – R3 ⎛ 1 0 −3 ⎜ 0 −1 −7 ⎜ ⎝ 0 0 −7 −2 ⎞ −11 ⎟ ⎟ −14 ⎠ subtract the third row from 2 times the second row In order to get the third element in row 3 to be a "1", we multiply R3 by –1/7: ⎛ 1 0 −3 ⎜ 0 −1 −7 ⎜ ⎝0 0 1 324 – Gaussian Elimination −2 ⎞ −11 ⎟ ⎟ 2 ⎠ multiply the third row by –1/7 Matrices and Determinants Although at first glance, it may seem like we have reduced the matrix to row-echelon form, we are not quite done yet. Remember that the diagonal of a matrix in row-echelon form consists only of "1's". We have a "–1" in the second row. Getting rid of it is simply a matter of multiplying R2 by –1: ⎛ 1 0 −3 ⎜0 1 7 ⎜ ⎝0 0 1 –1R2 −2 ⎞ 11 ⎟ ⎟ 2 ⎠ multiply the second row by –1 Now that we have converted our matrix into row-echelon form, all we need to do is look at the third row to determine that: z = 2 third linear equation If we rewrite our matrix in row echelon form as a system of linear equations, we have: x – 3z = –2 y + 7z = 11 z = 2 Using back-substitution, we can easily determine that: x – 3z = –2 first linear equation x – 3(2) = –2 substitute in z = 2 x–6 = –2 simplify x = 4 solve for x y + 7z = 11 second linear equation y + 7(2) = 11 substitute in z = 2 y + 14 = 11 simplify y = –3 solve for y and that Therefore, our solution set is: x = 4 and y = –3 and z = 2 answer If we had obtained a row that was zeros except for the last entry, then that would mean our system of equations has no solution. At that point, we do not need to continue any further. Such a system is called inconsistent. Matrices and Determinants Gaussian Elimination – 325 326 – Gaussian Elimination Matrices and Determinants Advanced To solve systems of linear equations using Gauss-Jordan elimination. reduced row-echelon form – a matrix which has 1's (or zeros) along the main diagonal and zeros above and below the main diagonal. Karl Friedrich Gauss (1777-1855) was extremely fond of newspapers and magazines, still a novelty item in the early 19th century. Students at the Unversity of Göttingen in Germany referred to Gauss as the "newspaper tiger" for his habit of staring down anyone who dared to take a newspaper he wanted first. Matrices → Linear Systems in a Matrix → Elementary Row Operations → Gaussian Elimination – Matrices → Gauss-Jordan Elimination In the previous thought, we used Gaussian elimination with back-substitution to transform a matrix representing a system of linear equations into a matrix in row-echelon form. Now we will discuss a second method of solving systems of linear equations that is virtually identical to Gaussian elimination. The difference is that we are now going to transform a matrix into reduced row-echelon form. Recall that a matrix in reduced rowechelon form has every column with a leading one also having a zero above and below the leading one. This is really just an extension of Gaussian elimination. We simply keep applying elementary row operations until we arrive at a matrix in reduced row-echelon form. Then the solutions of the system of linear equations are given explicitly without resorting to back-substitution. Let's use the example from the previous thought to illustrate this concept. In the previous thought, we started with the following system of linear equations: x 3x + – 3z = –2 y – 2z = 2x + 2y + z = 5 4 After apply elementary row operations, we transformed the matrix representing this system into: ⎛ 1 0 −3 ⎜0 1 7 ⎜ ⎝0 0 1 −2 ⎞ 11 ⎟ ⎟ 2 ⎠ row-echelon matrix representing system of linear equations In that example, we used back-substitution to find the solution to the system. Now use Gauss-Jordan elimination to find the solution. solution Remember that we want the coefficient part of the augmented matrix to consist of only "1's" along the diagonal and zeros everywhere else. In order to do that, we are going to continue performing elementary row operations. We will start be getting rid of the "–3" in the top row. We can do that by adding 3R3 to R1 and placing the result in R1: Matrices and Determinants Gauss-Jordan Elimination – 327 R1 + 3R3 ⎛1 0 0 ⎜0 1 7 ⎜ ⎝0 0 1 4 ⎞ 11 ⎟ ⎟ 2 ⎠ add 3 times the third row to the first row That takes care of the first row. Now we need to turn that pesky "7" in the second row into a "0". Again, we will make use of the third row, subtracting 7R3 from R2 and placing the result in the second row: ⎛1 0 0 ⎜0 1 0 ⎜ ⎝0 0 1 R2 – 7R3 ⎞ −3 ⎟ ⎟ 2 ⎠ 4 subtract 7 times the third row from the second row At this point, in order to find the solution to the system of linear equations, all we have to do is read it directly from the matrix. If we convert the matrix back to a system of linear equations, we get: x = 4 and y = –3 and z = 2 answer Compare the solutions found by this method to normal Gaussian elimination with back-substitution. Although we tend to use integers in our elementary row operations, there is no reason why we cannot use fractions if we need to. Fractions tend to be rather messy, so we usually try to find a way to avoid them by judiciously choosing elementary row operations. 328 – Gauss-Jordan Elimination Matrices and Determinants Advanced To define the inverse of a square matrix. invertible – a matrix with an inverse. nonsingular – a matrix with an inverse. This is synonymous with invertible. singular – a matrix without an inverse. Edmond Halley (1656-1742), after whom Halley's Comet is named, was a noted English astronomer and geophysicist. He theorized that comets travel in elliptical paths and are therefore subject to Kepler's Three Laws of Planetary Motion. He was absolutely correct in this and proved it by predicting that the comet of 1682 would once again be visible in 1758. It was this comet, which did indeed return close to Halley's prediction, that is now known as Halley's Comet. Matrices → Inverse of a Square Matrix Before we talk about the inverse of a matrix, let's first review the inverse of a real number. Consider the equation ax = b. In order to solve this equation for x, we divide both sides by a, provided that a ≠ 0: ax = x = b given equation b divide both sides by a, if a ≠ 0 a However, we can rewrite the division by a as multiplying b by the quantity a–1. In other words, we actually multiply both sides by the quantity a–1: ax = b given equation (a–1a)x = a–1b multiply both sides by "a–1" (read "a inverse") (1)x = a–1b recall that a(a–1) = a/a = 1 x = a–1b The number a–1 is called the multiplicative inverse of a because a–1a = 1. The definition of the multiplicative inverse of a matrix is similar: Definition of the Inverse of a Square Matrix Let A be an n x n matrix. If there exists matrix A–1 such that AA–1 = In = A–1A then A–1 is called the inverse of A. Note: Recall that it is not always true that AB = BA, even if both products are defined. However, if A and B are both square matrices and AB = In, then it can be shown that BA = In and hence B would be the inverse of A. Matrices and Determinants Inverse of a Square Matrix – 329 Show that B is the inverse of A. A ⎛2 1⎞ ⎜ ⎟ ⎝5 3⎠ = and B = ⎛ 3 −1 ⎞ ⎜ ⎟ ⎝ −5 2 ⎠ solution In order to demonstrate that these two matrices are inverses of each other, we multiply them together. First we take the product AB. Then we take the product BA. If we obtain I2 both times, then we know that B is indeed the inverse of A. AB = ⎛ 2 1 ⎞ ⎛ 3 −1 ⎞ ⎜ ⎟ ⋅⎜ ⎟ ⎝ 5 3 ⎠ ⎝ −5 2 ⎠ multiply matrices = ⎡ 2( 3) + 1( −5) 2( −1) + 1( 2) ⎤ ⎢ ⎥ ⎣ 5( 3) + 3( −5) 5( −1) + 3( 2) ⎦ multiply rows of A by columns of B = ⎛ 6 − 5 −2 + ⎜ ⎝ 15 − 15 −5 + simplify = ⎛1 0⎞ ⎜ ⎟ ⎝0 1⎠ 2⎞ ⎟ 6⎠ simplify Now we find the product BA: BA = ⎛ 3 −1 ⎞ ⎛ 2 1 ⎞ ⎜ ⎟ ⋅⎜ ⎟ ⎝ −5 2 ⎠ ⎝ 5 3 ⎠ multiply matrices = ⎡ 3( 2) + ( −1) ( 5) 3( 1) + ( −1) ( 3) ⎤ ⎢ ⎥ ⎣ −5( 2) + 2( 5) −5( 1) + 2( 3) ⎦ multiply rows of B by columns of A = ⎛ 6−5 3−3 ⎞ ⎜ ⎟ ⎝ −10 + 10 −5 + 6 ⎠ simplify = ⎛1 0⎞ ⎜ ⎟ ⎝0 1⎠ simplify Since AB = In = BA, we can conclude that B is indeed the matrix inverse of A. answer If a matrix A does have an inverse, then it is called invertible (or nonsingular). If it does not have an inverse, then it is called singular. A nonsquare matrix cannot have an inverse. To see why, consider the situation where matrix A is of order m x n and where matrix B is of order n x m, where m ≠ n. The product AB will then be of order m x m and the product BA is of order n x n. Since they are of different orders, there is no way they can produce the same product Im (or n). Most of the time we are not given the inverse of a matrix, so we have to apply some sort of technique to find the inverse. One way to do this is by a matrix equation. 330 – Inverse of a Square Matrix Matrices and Determinants Find the inverse of A = ⎛1 2⎞ ⎜ ⎟ ⎝3 7⎠ solution In order to find the inverse of A, we will attempt to solve the matrix equation AX = I. If we can find X such that AX = I, we know that X must be the inverse of A. We can check our result using matrix multiplication. First, we need to set up our matrix equation, using the rectangular array form of the matrices: AX = I original matrix equation ⎛ 1 2 ⎞ ⎛ x11 x12 ⎞ ⎜ ⎟ ⋅⎜ ⎟ ⎝ 3 7 ⎠ ⎝ x21 x22 ⎠ = ⎛1 0⎞ ⎜ ⎟ ⎝0 1⎠ rectangular array form of matrices ⎛ 1 x11 + 2 x21 1 x12 + 2 x22 ⎞ ⎜ ⎟ ⎝ 3 x11 + 7 x21 3 x12 + 7 x22 ⎠ = ⎛1 0⎞ ⎜ ⎟ ⎝0 1⎠ matrix multiplication Now we can equate corresponding elements in the matrices. This gives us the following system of linear equations: x11 + 2x21 = 1 and x12 + 2x22 = 0 3x11 + 7x21 = 0 and 3x12 + 7x22 = 1 corresponding system of linear equations Since we have four equations with four unknowns, we can easily apply standard back-substitution techniques to solve this system. From the first system, we have: x11 = 7 x21 = –3 and From the second system, we find through Gaussian elimination that: x12 = –2 x22 = 1 and Therefore, our inverse matrix X is: X = ⎛ 7 −2 ⎞ ⎜ ⎟ ⎝ −3 1 ⎠ answer If we want to check our answer, then we find the products AX and XA. If they both equal the product matrix I2, then we can conclude that X = A–1. Matrices and Determinants Inverse of a Square Matrix – 331 AX XA = ⎛ 1 2 ⎞ ⋅ ⎛ 7 −2 ⎞ ⎜ ⎟⎜ ⎟ ⎝ 3 7 ⎠ ⎝ −3 1 ⎠ = ⎡ 1( 7) + 2( −3) 1( −2) + 2( 1) ⎤ ⎢ ⎥ ⎣ 3( 7) + 7( −3) 3( −2) + 7( 1) ⎦ = ⎛1 0⎞ ⎜ ⎟ ⎝0 1⎠ = ⎛ 7 −2 ⎞ ⎛ 1 2 ⎞ ⎜ ⎟ ⋅⎜ ⎟ ⎝ −3 1 ⎠ ⎝ 3 7 ⎠ = ⎡ 7( 1) + ( −2) ( 3) 7( 2) + ( −2) ( 7) ⎤ ⎢ ⎥ ⎣ −3( 1) + 1( 3) −3( 2) + 1( 7) ⎦ = ⎛1 0⎞ ⎜ ⎟ ⎝0 1⎠ check check Any system of linear equations can have one solution, no solution, or infinitely many solutions. If the coefficient matrix A of a square system (i.e. a system that has the same number of equations as unknowns) is invertible, then the system has a unique solution, defined as follows: A System of Linear Equations with a Unique Solution If A is an invertible matrix, the system of linear equations represented by the matrix equation AX = B has a unique solution given by: X = A–1B 332 – Inverse of a Square Matrix solution of matrix equation AX = B Matrices and Determinants Advanced To find the inverse of a square matrix. adjoining – creating a larger matrix out of two smaller matrices. Edmond Halley (1656-1742) did more than watch the skies for comets. He calculated the focal length of certain optical lenses, the trajectory of artillery shells, and prepared actuarial tables for use by insurance companies to determine life and annuity premiums and values. This actuarial work is an early form of social statistics. Halley also published papers on trade winds, monsoons, tides, and terrestrial magnetism. These papers lay the groundwork for scientific geophysics. Finally, he discovered that the stars are in motion relative to each other. Halley only studied the motion of three stars - Arcturus, Procyon, and Sirius - but theorized that smaller, fainter stars also move. This could not be proved until 150 years later, when more advanced instruments became available. Matrices → Inverse of a Square Matrix → Finding Matrix Inverse In the previous thought, we used the matrix equation AX = I to determine the inverse of A. Our result was two systems of linear equations. We can represent those systems using augmented matrices as follows: ⎛1 2 ⎜ ⎝3 7 1⎞ ⎟ 0⎠ and ⎛1 2 ⎜ ⎝3 7 0⎞ ⎟ 1⎠ augmented matrices representing systems of linear equations Note that these matrices both have the same coefficient matrix. Rather than solving these systems independently, we can solve them simultaneously by adjoining the identity matrix to the coefficient matrix, as shown below: ⎛1 2 ⎜ ⎝3 7 1 0⎞ ⎟ 0 1⎠ adjoin identity matrix to coefficient matrix Now, in order to solve both systems simultaneously, we apply Gauss-Jordan elimination on the adjoined matrices. The objective is to transform the coefficient matrix (the left-half) into an identity matrix. Then the right half of the adjoined matrix will be the solutions to the systems of linear equations. 3R1 – R2 –R2 R1 – 2R2 ⎛1 2 ⎜ ⎝3 7 1 0⎞ ⎛1 2 ⎜ ⎝ 0 −1 1 0⎞ ⎛1 2 ⎜ ⎝0 1 1 0⎞ ⎛1 0 ⎜ ⎝0 1 ⎟ given matrix 0 1⎠ ⎟ 3 1⎠ ⎟ −3 1 ⎠ 7 −2 ⎞ −3 1 ⎟ ⎠ subtract second row from 3 times row 1 multiply second row by –1 subtract 2 times second row from first row Thus, we go from the "doubly augmented" matrix [ A : I ] to the matrix [ I : A–1]. In other words: Matrices and Determinants Finding Matrix Inverse – 333 A–1 = ⎛ 7 −2 ⎞ ⎜ ⎟ ⎝ −3 1 ⎠ inverse of original matrix A We already demonstrated in the previous thought that this is indeed the inverse of the matrix A. Based upon this example, we can develop a procedure (or algorithm) to find the inverse of a matrix (if it has one). Finding an Inverse Matrix Let A be a square matrix of order n. 1.) Write the n x 2n matrix that consists of the given matrix A on the left and the n x n identity matrix I on the right to obtain [ A : I ]. Note that we separate the matrices A and I by a dotted line. We call this process adjoining the matrices A and I. 2.) If possible, row reduce A to I using elementary row operations on the entire matrix [ A : I ]. The result will be the matrix [ I : A–1 ]. If this is not possible, then A is not invertible. 3.) Check your work by multiplying to see that AA–1 = I = A–1A. Let's see another example of how this works. Find the inverse of the matrix, if it exists. ⎛1 1 1⎞ ⎜3 5 4⎟ ⎜ ⎟ ⎝3 6 5⎠ solution First, we setup a 3 x 3 matrix that consists of our given matrix above adjoined to a 3 x 3 identity matrix, as shown below: ⎛1 1 1 ⎜3 5 4 ⎜ ⎝3 6 5 1 0 0⎞ 0 1 0⎟ ⎟ 0 0 1⎠ adjoin given matrix with 3 x 3 identity matrix Next, we apply elementary row operations to transform the left hand side of the matrix into an identity matrix. Then the right hand side of the matrix will be miraculously transformed into the inverse matrix. Start be eliminating the "3's" in the second and third rows: R2 – 3R1→ R3 – 3R1→ 2R2 – R3→ 2R3 – 3R2→ R1 – R2 – R3→ 334 – Finding Matrix Inverse ⎛1 1 1 ⎜0 2 1 ⎜ ⎝0 3 2 1 0 0⎞ −3 1 0 ⎟ ⎟ −3 0 1 ⎠ ⎛1 1 1 ⎜0 1 0 ⎜ ⎝0 0 1 1 0 ⎞ −3 2 −1 ⎟ ⎟ 3 −3 2 ⎠ ⎛1 0 0 ⎜0 1 0 ⎜ ⎝0 0 1 1 1 −1 ⎞ 0 −3 2 −1 ⎟ ⎟ 3 −3 2 ⎠ subtract 3 times first row from second row subtract 3 times first row from third row subtract third row from 2 times second row subtract 3 times second row from 2 times third row subtract second and third rows from first row Matrices and Determinants Now that we have transformed the left hand side of the equation to an identity matrix, we can see that our inverse matrix is: −1 ⎛1 1 1⎞ ⎜3 5 4⎟ ⎜ ⎟ ⎝3 6 5⎠ = ⎛ 1 1 −1 ⎞ ⎜ −3 2 −1 ⎟ ⎜ ⎟ ⎝ 3 −3 2 ⎠ answer Naturally, we want to check to make sure that this is indeed the correct solution, so if we label our original matrix as A and our inverse matrix as A–1, we multiply them together and see if we obtain I3: –1 AA = ⎛ 1 1 1 ⎞ ⎛ 1 1 −1 ⎞ ⎜ 3 5 4 ⎟ ⋅ ⎜ −3 2 −1 ⎟ ⎜ ⎟⎜ ⎟ ⎝ 3 6 5 ⎠ ⎝ 3 −3 2 ⎠ multiply matrices = ⎡ [ 1⋅ ( 1) + 1⋅ ( −3) + 1⋅ ( 3) ] [ 1⋅ ( 1) + 1⋅ ( 2) + 1⋅ ( −3) ] [ 1⋅ ( −1) + 1⋅ ( −1) + 1⋅ ( 2) ] ⎤ ⎢ [ 3⋅ ( 1) + 5⋅ ( −3) + 4⋅ ( 3) ] [ 3⋅ ( 1) + 5⋅ ( 2) + 4⋅ ( −3) ] [ 3⋅ ( −1) + 5⋅ ( −1) + 4⋅ ( 2) ] ⎥ ⎢ ⎥ ⎣ [ 3⋅ ( 1) + 6⋅ ( −3) + 5⋅ ( 3) ] [ 3⋅ ( 1) + 6⋅ ( 2) + 5⋅ ( −3) ] [ 3⋅ ( −1) + 6⋅ ( −1) + 5⋅ ( 2) ] ⎦ multiply rows of A by columns of A–1 = ⎛ 1 − 3 + 3 1 + 2 − 3 −1 − 1 + 2 ⎞ ⎜ 3 − 15 + 12 3 + 10 − 12 −3 − 5 + 8 ⎟ ⎜ ⎟ ⎝ 3 − 18 + 15 3 + 12 − 15 −3 − 6 + 10 ⎠ simplify = ⎛1 0 0⎞ ⎜0 1 0⎟ ⎜ ⎟ ⎝0 0 1⎠ simplify Matrices and Determinants check Finding Matrix Inverse – 335 336 – Finding Matrix Inverse Matrices and Determinants Advanced To use a formula to find the determinant of a 2 x 2 matrix. determinant – associated with each square matrix is a real number called the determinant of the matrix. The number of elements in any row or column of the square matrix is called the order of the determinant. David Hilbert (1862-1943) was a German number theorist and the most famous mathematician of his day. In 1900, he posed a series of 23 questions to the mathematical community at a conference in Paris, France. His seventh question dealt with the irrationality of certain numbers, among which are the transcendental numbers e and π. Hilbert was not the first to prove these two numbers are transcendental, but his genius was such that he simplified the proofs considerably. Matrices → Inverse of a Square Matrix → Inverse of 2 x 2 Matrix In the previous thought, we looked at the definition of the inverse of a matrix. In a related thought, we learned how to find the inverse of a matrix of any order n x n. Now we are going to show a shortcut method for finding the inverse of a 2 x 2 matrix. Here we can apply a quick and convenient formula that allows us to determine not only the inverse of a matrix, but even if the matrix has an inverse. Basically, the formula works as follows. The Inverse of a 2 x 2 Matrix If A is a 2 x 2 matrix given by: A = 2 x 2 matrix then A is invertible (has an inverse) if and only if ad – bc ≠ 0. Moreover, if ad – bc ≠ 0, then the inverse is given by: A–1 = inverse of matrix A This only works with 2 x 2 matrices! Note: The denominator ad – bc is called the determinant of the 2 x 2 matrix A. You will study determinants and their applications in another section of the Algebra Brain. Let's try out this formula with a couple of examples. Find the inverse of the matrix, if it exists. A = ⎛2 4⎞ ⎜ ⎟ ⎝4 8⎠ Matrices and Determinants Inverse of 2 x 2 Matrix – 337 solution First, we check the determinant ad – bc. If it doesn't equal zero, then we can go ahead and compute the inverse for this matrix: ad – bc = (2)(8) – (4)(4) find determinant = 16 – 16 simplify = 0 simplify Since ad – bc = 0, we can conclude that this matrix does not have an inverse. answer Find the inverse of the matrix, if it exists. B ⎛ 3 7⎞ ⎜ ⎟ ⎝ −1 2 ⎠ = solution Again, we first find the determinant ad – bc. If it equals zero, then we do not need to continue any further with the problem. ad – bc = 3(2) – 7(–1) find determinant = 6+7 simplify = 13 simplify Since ad – bc ≠ 0, we can conclude that this matrix does have an inverse. Moreover, its inverse is given by: B–1 = ⎛ d −b ⎞ ⎟ ad − bc ⎝ −c a ⎠ inverse of 2 x 2 matrix formula = ⎛ 2 −7 ⎞ ⎟ 13 ⎝ 1 3 ⎠ substitute in ad – bc = 13 = ⎛ ⎜ ⎜ ⎜ ⎝ 1 1 ⋅⎜ ⋅⎜ −7 ⎞ 13 13 ⎟ ⎟ 1 3 ⎟ 13 13 ⎠ 2 scalar multiplication answer Let's just double check to make sure that we have found the inverse. If we multiply B by its inverse B–1, we should end up with the identity matrix I2. 338 – Inverse of 2 x 2 Matrix Matrices and Determinants BB–1 −7 = ⎛ ⎛ 3 7 ⎞ ⋅⎜ ⎜ ⎟⎜ ⎝ −1 2 ⎠ ⎜ ⎝ ⎞ ⎟ ⎟ 1 3 ⎟ 13 13 ⎠ = ⎡ 3⎛ 2 ⎞ + 7⎛ 1 ⎞ 3⎛ −7 ⎞ + 7⎛ 3 ⎞ ⎤ ⎢ ⎜⎝ 13 ⎟⎠ ⎜⎝ 13 ⎟⎠ ⎜⎝ 13 ⎟⎠ ⎜⎝ 13 ⎟⎠ ⎥ ⎢ ⎥ ⎢ −1⎛⎜ 2 ⎞⎟ + 2⎛⎜ 1 ⎞⎟ −1⎛⎜ −7 ⎞⎟ + 2⎛⎜ 3 ⎞⎟ ⎥ ⎣ ⎝ 13 ⎠ ⎝ 13 ⎠ ⎝ 13 ⎠ ⎝ 13 ⎠ ⎦ multiply rows of B by columns of B–1 = ⎛ 13 0 ⎞ ⎜ 13 ⎟ ⎜ ⎟ ⎜ 0 13 ⎟ 13 ⎠ ⎝ simplify = ⎛1 0⎞ ⎜ ⎟ ⎝0 1⎠ simplify check 2 13 13 matrix multiplication A similar process can be used to show that B–1B also equals I2. Therefore, B–1 is indeed the inverse of B. Matrices and Determinants Inverse of 2 x 2 Matrix – 339 340 – Inverse of 2 x 2 Matrix Matrices and Determinants Advanced To find the determinant of a square matrix. determinant – associated with each square matrix is a real number called the determinant of the matrix. The number of elements in any row or column of the square matrix is called the order of the determinant. expanding by cofactors – calculating the determinant of a matrix by taking the sum of the entries in any row and multiplying by the cofactors associated with that row. Seki Gowa (1642-1708) is often seen as the Isaac Newton of Japan. He brought Japanese problem-solving to the modern age, conquering many of the same problems Newton was working on in England. Seki Gowa and Newton pursued parallel lives. Whereas Newton was in charge of the Mint in England, Seki Gowa held a similar position in Japan. Newton was knighted while Seki Gowa became a samurai. Seki Gowa is credited with the invention of determinants at the same time that they were being introduced to Europe. His students referred to him as the "Arithmetical Sage", which is inscribed on his tomb in the Buddhist cemetery at Ushigome in Tokyo, Japan. Matrices → Determinants Every square matrix (i.e. one that has the same number of rows and columns) is associated with a real number called a determinant. Determinants have many uses and we will explore these under the thought labeled Applications of Determinants. Where are determinants and where do they come from? Consider the following system of linear equations: a1x + b1y = c1 a2x + b2y = c2 system of linear equations With a little bit of algebraic mumbo-jumbo, we can find the solution to this system of equations. It turns out that when we solve this system for x and y, we obtain the following solution set: x = c1 b 2 − c2 b 1 a1 b2 − a2 b1 and y a 1 c2 − a 2 c1 = a1 b2 − a2 b1 provided that a1b2 – a2b1 ≠ 0, of course. Note that we have the same denominator in both solutions. This denominator is called the determinant of the coefficient matrix of the linear system. Coefficient Matrix ⎛ a1 a2 ⎞ A = ⎜ ⎟ ⎝ b1 b2 ⎠ Determinant det (A) = a1b2 – a2b1 We can also denote the determinant of a matrix with vertical bars on either side of the matrix, as indicated in the definition below. This should not be confused with finding the absolute value of the matrix! Matrices and Determinants Determinants – 341 Definition of the Determinant of a 2 x 2 Matrix The determinant of the matrix A = is given by det (A) = |A| = = a1b2 – a2b1 determinant of 2 x 2 matrix A quick and easy method for remembering the formula for finding the determinant of a 2 x 2 matrix is to simply take the difference of the products of the diagonal terms, as indicated in the diagram below: This works very well for 2 x 2 matrices, but does not work at all for higher order matrices. In order to find the determinant for higher order matrices, we have to make use of minors and cofactors. Minors and Cofactors In the previous section we looked at a convenient method for finding the determinant of 2 x 2 matrices. Now we need a more general method for finding determinants that applies to all matrices. This process is not nearly so simple or trivial. The basic procedure is to break a 3 x 3 or larger matrix into bite-size chunks that we can work with. We will still need to refer to the formula for finding the determinant of a 2 x 2 matrix. Essentially, we find several of these smaller "mini-determinants". The total determinant of the matrix is then the sum of these mini-determinants. For higher order matrices, it is convenient to use cofactors and minors to define the determinant. Minors and Cofactors of a Square Matrix If A is a square matrix, the minor Mij of the entry aij is the determinant of the square matrix obtained by deleting the ith row and jth column of A. The cofactor Cij of the entry aij is given by Cij = (–1)i + 1Mij This is a somewhat confusing definition. Let's look at an example on how to find the minors and cofactors of a 3 x 3 matrix. 342 – Determinants Matrices and Determinants Find all the minors and the cofactors of the given matrix. A ⎛ 1 2 3⎞ ⎜ −4 5 8 ⎟ ⎜ ⎟ ⎝ −1 −2 1 ⎠ = solution To find the minor M11, delete the first row and first column of A and find the determinant of the remaining matrix: M11 = (5)(1) – (–2)(8) = 21 Similarly, to find M12, we delete the first row and second column of A and find the determinant of the remaining matrix: M12 = (–4)(1) – (–1)(8) = 4 We perform the process 7 more times. The final minor M33 is found by deleting the third row and third column of A and finding the determinant of the remaining matrix: M33 = (1)(5) – (–4)(2) = 13 Through this process of eliminating rows and columns and finding determinates, we will find the following minors of matrix A: M11 = –11 M12 = 4 M13 = 13 M21 = 8 M22 = 4 M23 = 0 M31 = 1 M32 = 20 M33 = 13 Now in order to find the cofactors, we multiply each of the minors above by (–1)i+j where i is the row and j is the column of the matrix A: C11 = (–1)2(–11) = 21 C12 = (–1)3(4) = –4 C13 = (–1)4(13) = 13 C21 = (–1)3(8) = –8 C22 = (–1)4(4) = 4 C23 = (–1)5(0) = 0 C31 = (–1)4(1) = 1 C32 = (–1)5(20) = –20 C33 = (–1)6(13) = 13 Note: In the sign pattern above, even positions (where i + j is even) keep the same sign as the minor, but odd positions (where i + j is odd) change the sign of the minor. Matrices and Determinants Determinants – 343 Determinant of a Square Matrix Once we have defined minors and cofactors, we can now define the determinant of an n x n square matrix. Determinant of a Square Matrix If A is a square matrix (of order 2 x 2 or greater), the determinant of A is the sum of the entries in any row (or column) of A multiplied by their respective cofactors. For instance, expanding along the first row yields |A| = a11C11 + a12C12 + a13C13 + . . . + a1nC1n Applying this definition to find a determinant is called expanding by cofactors. Find the determinant of the given matrix. A ⎛ 1 2 3⎞ ⎜ −4 5 8 ⎟ ⎜ ⎟ ⎝ −1 −2 1 ⎠ = solution First note that this is the exact same matrix that we used in the previous example. The cofactors of the elements in the first row of the matrix were: C11 = –11 and C12 = –4 and C13 = 13 By using the definition above to find the determinant, we expand the first row as follows: |A| = a11C11 + a12C12 + a13C13 first-row expansion = (1)(21) + (2)(–4) + (3)(13) substitute in numbers = 21 – 8 + 39 simplify = 52 simplify answer Although we choose to do a first row expansion, we could just as easily have down a second row expansion or even a third column expansion. If we did that, then we choose the relevant cofactors, as indicated below: |A| = a21C21 + a22C22 + a23C23 second-row expansion = (–8)(–4) + (5)(4) + (8)(0) substitute in numbers = 52 simplify = a13C13 + a23C23 + a33C33 third-column expansion = (3)(13) + (8)(0) + (1)(13) substitute in numbers = 52 simplify or |A| As long as we multiply our cofactors by the right row or column in the matrix, we will obtain the same number each time. This leads to a useful strategy when finding determinants. Look for the row or column with the 344 – Determinants Matrices and Determinants most number of zeros. Then we only have to find a limited number of cofactors. We do not need to find cofactors of zero entries because zero times its cofactor is still zero. Find the determinant of the following matrix. B ⎛ 2 −1 3 ⎞ ⎜ 4 0 −5 ⎟ ⎜ ⎟ ⎝1 0 2 ⎠ = solution Since we have two zeros in the second column, that would be the column we want to use cofactor expansion with. If we use the definition for finding determinants, expanding around column 2, we have: |B| = b12C12 + b22C22 + b32C32 second-column expansion = (–1)C12 + (0)C22 + (0)C32 substitute in numbers from B = –C12 simplify Now all we need to do is determine C12. To do that, we delete the first row, second column of B and evaluate the determinant of the remaining matrix: C12 3 ⎛ 4 −5 ⎞ ⎜ ⎟ ⎝1 2 ⎠ = ( −1) ⋅ = (–1)(8 – (–5)) evaluate determinant of 2 x 2 matrix = –13 simplify formula to find cofactor Our determinant for our given 3 x 3 matrix is therefore: |B| = –C12 = –(–13) substitute in for C12 = 13 answer Matrices and Determinants Determinants – 345 346 – Determinants Matrices and Determinants Advanced To review some of the properties associated with the determinant of a matrix. No definitions on this page. Alan Turing (1912-1954) was highly instrumental in cracking German codes during World War II. The famous Enigma machine was already well-understood by the English. The reason why Enigma worked so well was that the number of possible letter combinations used to produce a code was astronomically huge. The trick to successful decryption of codes is to run through the most combinations in the least amount of time. Turing did this by building a special machine known as Colossus. It reduced decryption time from hours to minutes, which gave the Allies a huge tactical advantage. Matrices → Determinants → Properties of Determinants Taking the determinant of a matrix is a mathematical operation. As we should already know, many mathematical operations have properties associated with them that allow us to greatly simplify our calculations. For example, by this time we are familiar with expanding a binomial square, such as (4 – 3x)2. We have a theorem that tells us that when we expand this binomial expression we will get (16 – 6x + 9x2). However, in order to derive this result, we need to use the Distributive Property of Multiplication. In the same way, if we know about the properties of determinants outlined below, we can immediately start simplifying the determinant without resorting to tedious calculations. Property 1 If all the elements in a particular row or a particular column are zero, then the determinant of the matrix is also zero. ⎛ 3 −2 4 ⎞ ⎜0 0 0⎟ ⎜ ⎟ ⎝1 5 6⎠ = 0 = 0 ⎛ −2 4 ⎞ + 0⋅ ⎛ 3 4 ⎞ + 0⋅ ⎛ 3 −2 ⎞ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎝ 5 6⎠ ⎝1 6⎠ ⎝1 5 ⎠ cofactor expansion along second row simplify by multiplying each determinant by zero Property 2 If any two rows or columns of a matrix are interchanged, then the determinant of the resulting matrix is the opposite in sign of the determinant of the original matrix. Matrices and Determinants Properties of Determinants – 347 ⎛7 0⎞ ⎛ −2 4 ⎞ ⎛ −2 4 ⎞ ⎜ ⎟ − 8⋅ ⎜ ⎟ +1 ⎜ ⎟ ⎝5 6⎠ ⎝ 5 6⎠ ⎝ 7 0⎠ cofactor expansion along first column = 3⋅ = 3(42) – 8(–32) + 1(–28) use formula for determinants of 2 x 2 matrices = 354 simplify Now switch the first and second columns and see what we get: ⎛7 0⎞ ⎛ −2 4 ⎞ ⎛ −2 4 ⎞ ⎜ ⎟ + 8⋅ ⎜ ⎟ − 1⋅ ⎜ ⎟ ⎝5 6⎠ ⎝ 5 6⎠ ⎝ 7 0⎠ cofactor expansion along second column = −3⋅ = –3(42) + 8(–32) – 1(–28) use formula for determinants of 2 x 2 matrices = –354 simplify Property 3 If two rows or columns of a matrix have corresponding elements that are equal, then the determinant of that matrix is zero. ⎛ 4 3 7⎞ ⎜ −1 2 1 ⎟ ⎜ ⎟ ⎝ −1 2 1 ⎠ ⎛ 2 1 ⎞ − 3 ⎛ −1 1 ⎞ + 7⋅ ⎛ −1 2 ⎞ ⎜ ⎟ ⎜ ⎟ ⎜ ⎟ ⎝2 1⎠ ⎝ −1 1 ⎠ ⎝ −1 2 ⎠ = 4⋅ = 4(0) – 3(0) + 7(0) use formula for determinants of 2 x 2 matrices = 0 simplify cofactor expansion along first row Property 4 If each element in one row or column is multiplied by some real number constant k, then the determinant of that matrix is also multiplied by k. ⎛ 5 −1 3 ⎞ ⎜2 1 4⎟ ⎜ ⎟ ⎝ −7 3 −4 ⎠ ⎛1 4 ⎞ ⎛2 4⎞ ⎛ 2 1⎞ ⎜ ⎟ − ( −1) ⋅ ⎜ ⎟ + 3⋅ ⎜ ⎟ ⎝ 3 −4 ⎠ ⎝ −7 −4 ⎠ ⎝ −7 3 ⎠ = 5⋅ = –21 cofactor expansion along first row simplify Now multiply every element in the first row by 3 and see what happens to the determinant: 348 – Properties of Determinants Matrices and Determinants = 3⋅ 5⋅ = 3⎢ 5⋅ ⎡ ⎣ ⎛1 4 ⎞ ⎛2 4⎞ ⎛ 2 1⎞ ⎜ ⎟ − 3( −1) ⋅ ⎜ ⎟ + 3⋅ 3⋅ ⎜ ⎟ ⎝ 3 −4 ⎠ ⎝ −7 −4 ⎠ ⎝ −7 3 ⎠ ⎛ 1 4 ⎞ − ( −1) ⋅ ⎜ ⎟ ⎝ 3 −4 ⎠ ⎛ 2 4 ⎞ + 3⋅ ⎛ 2 1 ⎞ ⎤ ⎜ ⎟ ⎜ ⎟ ⎥ ⎝ −7 −4 ⎠ ⎝ −7 3 ⎠ ⎦ cofactor expansion along first row, multiplying each element in first row by 3 factor out a 3 from each term = 3(–21) simplify expression inside parentheses from above = –63 this is 3 times original determinant Property 5 If each and every element of one row or one column is multiplied by a real number constant k and if the resulting products are then added to the corresponding elements of another row or column, respectively, the resulting determinant equals the original determinant. This property is considerably more complicated than the other four properties, so a little more explanation is in order. In this example, we will first find the determinant normally. Then we will apply property 5 as outlined above to our original matrix. We will multiply the second column by some real number k. We will add the product of k and the first element of column 2 to the first element of column 1. We will add the product of k and the second element of column 2 to the second element of column 1. We do the same procedure once more for the product of k and the third element of column two and the third element of column 1. Again, we find the determinant of the resultant matrix. If everything goes as planned, we should obtain the same determinant as we did the first time. First, find the determinant normally: ⎛ 2 −1 7 ⎞ ⎜ 6 −2 5 ⎟ ⎜ ⎟ ⎝ −6 2 1 ⎠ ⎛ −2 5 ⎞ ⎜ ⎟ − ( −1) ⎝ 2 1⎠ ⎛ 6 5⎞ ⎛ 6 −2 ⎞ ⎜ ⎟ +7 ⎜ ⎟ ⎝ −6 1 ⎠ ⎝ −6 2 ⎠ cofactor expansion along first row = 2 = 2(–24) – (–1)(36) + 7(0) use formula for determinants of 2 x 2 matrices = 12 simplify Now we are going to mess around with the first column. First, we multiply the second column by 3. Then we add the result of this to the first column to produce a column equivalent matrix: Matrices and Determinants Properties of Determinants – 349 ⎡ 2 + 3( −1) −1 7 ⎤ ⎢ 6 + 3( −2) −2 5 ⎥ ⎢ ⎥ ⎣ −6 + 3( 2) 2 1 ⎦ = ⎛ −1 −1 7 ⎞ ⎜ 0 −2 5 ⎟ ⎜ ⎟ ⎝ 0 2 1⎠ ⎛ −2 5 ⎞ ⎜ ⎟ + 0+ 0 ⎝ 2 1⎠ add 2 times second column to first column = −1 = –1(–12) use formula for determinants of 2 x 2 matrices = 12 simplify cofactor expansion along first column From the previous example, we can see one reason why we like to have these properties around. In the final example for Property 5, we were able to convert one column of the matrix so that it contained mostly zeros. This made it much easier to perform the cofactor expansion since we then knew that we would end up with a lot of zero terms, thus reducing the number of times we have to find the determinant of a 2 x 2 matrix. Anything that reduces the workload is all right by us. The properties are summarized below for convenient reference. Properties of Determinants 1.) If all the elements in a particular row or a particular column are zero, then the determinant of the matrix is also zero. 2.) If any two rows or columns of a matrix are interchanged, then the determinant of the resulting matrix is the opposite in sign of the determinant of the original matrix. 3.) If two rows or columns of a matrix have corresponding elements that are equal, then the determinant of that matrix is zero. 4.) If each element in one row or column is multiplied by some real number constant k, then the determinant of that matrix is also multiplied by k. 5.) If each and every element of one row or one column is multiplied by a real number constant k and if the resulting products are then added to the corresponding elements of another row or column, respectively, the resulting determinant equals the original determinant. 350 – Properties of Determinants Matrices and Determinants Advanced To look at a special type of matrix. upper triangular matrix – matrix containing nonzero elements on and above the main diagonal and zero elements below the main diagonal. lower triangular matrix – matrix containing nonzero elements on and below the main diagonal and zero elements above the main diagonal. diagonal matrix – a matrix that has nonzero elements on the main diagonal and zero elements everywhere else. Thales of Miletus (c.640-546 B.C.) was a famous astronomer in addition to his vocations as mathematician and philosopher. One story reports that he once fell down a well while star-gazing. The slave girl who discovered him commented that Thales was "so interested in the heavens that he could not see what was in front of his feet." Thales's interest in astronomy may be a result of his travels to Babylonia, where he studied with the Chaldean magi. He advised navigators to rely on the constellation Ursa Minor (Little Bear) for navigation rather than Ursa Major (Great Bear). This was because Ursa Minor contains the star Polaris, which is always located due north and it is therefore easy for navigators to find their position relative to this star. Matrices → Determinants → Triangular Matrix There are three types of triangular matrices. An upper triangular matrix contains nonzero elements above the main diagonal and zero elements below the diagonal. A lower triangular matrix is just the opposite, containing nonzero elements below the main diagonal and zero elements above the main diagonal. Finally, a matrix that has zero elements above and below the main diagonal is said to simply be a diagonal matrix . Upper Triangular a 1n ⎞ ⎛ a11 a12 a13 ⎜ ⎜ 0 a22 a23 ⎜ 0 0 a 33 ⎜ ⎜ ⎜ 0 0 0 ⎝ ⎟ a 2n ⎟ a 3n ⎟ ⎟ ⎟ a mn ⎟⎠ Lower Triangular Diagonal 0 0 ⎛⎜ a11 ⎜ 0 a22 0 ⎜ 0 0 a 33 ⎜ ⎜ ⎜ 0 0 0 ⎝ ⎞⎟ 0 ⎟ 0 ⎟ ⎟ ⎟ a mn ⎟⎠ 0 One of the neat little advantages of such a triangular matrix (whether upper, lower, or diagonal) is that the determinant of such a matrix is simply the product of the entries along the main diagonal. Find the determinant of the following matrix. A = ⎛1 ⎜ −2 ⎜ ⎜ 14 ⎜ −3 ⎜ ⎝2 ⎞ ⎟ ⎟ 1 8 0 0 ⎟ 4 7 6 0 ⎟ ⎟ 2 1 6 −5 ⎠ 0 0 0 0 3 0 0 0 Matrices and Determinants Triangular Matrix – 351 solution Normally, we would cringe in terror at the thought of finding the determinant of a 5 x 5 matrix. However, we note that it is a lower triangular matrix in that all the entries above the main diagonal are zero. Thus, we know that the determinant is simply the product of the entries along the main diagonal: |A| = ⎛1 ⎜ −2 ⎜ ⎜ 14 ⎜ −3 ⎜ ⎝2 ⎞ ⎟ ⎟ 1 8 0 0 ⎟ 4 7 6 0 ⎟ ⎟ 2 1 6 −5 ⎠ 0 0 0 0 3 0 0 0 determinant of matrix A = (1)(3)(8)(6)(–5) product of main diagonal entries = –720 simplify answer We can check our answer by inputting this matrix into a computer program and asking it to find the determinant for us. Sure enough, it will return the answer we got above, –720. 352 – Triangular Matrix Matrices and Determinants Now that we know what determinants are, we want to find applications for them in the real world. In this section of the Algebra Brain, we will examine four common uses of determinants. Cramer's Rule Cramer's Rule allows us to quickly use determinants to find the solutions to systems of linear equations. At this point, we now have four ways to solve systems of linear equations: back-substitution, elimination with equations, elimination with matrices (a la Gaussian and Gauss-Jordan elimination), and finally Cramer's Rule. Use whichever method suits your fancy at the time. Cramer's Rule is ideal for use with computers because they can find determinants of matrices many times faster than a human. As long as we input the data correctly into a formula, then a computer program like MathCAD 2000® will do the number crunching and spit out the solution of the linear system. Area of a Triangle We can also use matrices and determinants to find the area of a triangle whose coordinates are given as points in a coordinate plane. Basically, we use a formula that takes the coordinate points as inputs into a matrix and spits out the area as the output of the formula. Test for Collinear Points If we are given three points in a coordinate plane, we can determine 1 of two things. Either they are not all on a line, in which case we can form a triangle with the three points whose area can be calculated using the formula above, or they are all on one line. If the three points are on one line, then by definition the area enclosed by the points must be zero. Therefore, if the determinant of the matrix used to calculate the area of a triangle equals zero, then all three points must lie on the same line in the plane. Two-Point Form of the Equation of a Line Given only two points, we can use the same 3 x 3 determinant from the previous two applications to find the equation of a line. The only difference between this application and the previous application is that instead of using six numbers in the matrix, we use four numbers (the coordinates of our two points) and the variables x and y as elements in the determinant of the matrix. Matrices and Determinants Applications of Determinants – 353 354 – Applications of Determinants Matrices and Determinants Advanced To find the area of a triangle using a 3 x 3 matrix. No definitions on this page. John Wallis (1616-1703) was an English algebraist, who, like many of his countrymen, led a varied and interesting life. He was a human calculating prodigy who could perform amazing mental feats, such as finding the square root of a 53-digit number to 17 decimal places without notation. During the English Civil War he deciphered Royalist documents for Lord Oliver Cromwell. As a reward for his services Lord Cromwell appointed Wallis the Savilian Professor of Geometry at Oxford. Matrices → Determinants → Applications of Determinants → Area of Triangle One useful application of determinants is to find the area of a triangle given its three vertices. The usual geometric formula only finds the area of a triangle if we know the base and the height (A = 1/2Bh), which is fine for many triangles, but is more difficult to use if we have a weirdly arranged triangle, like the one below. If we want to use the geometric formula, then we need to do a lot of calculation of distances and such. Very tedious and not much fun. However, there is a better method. Since we know what the three vertices of the triangle are, we can use the determinant of a 3 x 3 matrix. We set up the matrix so that the first two columns contain the coordinates of the three vertices. The third column contains all "1's". Then we find the area of this triangle by taking the determinant of this matrix, and multiplying it by 1/2. Matrices and Determinants Area of a Triangle – 355 Area of a Triangle The area of a triangle with vertices (x1, y1), (x2, y2) and (x3, y3) is given by Area = where the symbol ± indicates that the appropriate sign should be chosen to yield a positive area. Find the area of the triangle whose vertices are (0, 0), (2, 5), and (4, 1) as shown below. solution First, let's find the determinant of the matrix formed from the three vertices given above. We let (x1, y1) = (0, 0), (x2, y2) = (2, 5), and (x3, y3) = (4, 1) ⎛⎜ x1 y1 1 ⎞⎟ ⎜ x2 y2 1 ⎟ ⎜ x3 y3 1 ⎟ ⎝ ⎠ determinant of matrix formed from vertices of triangle = = = ⎛ 5 1 ⎞ + 0 ( −1) 3⋅ 0 ( −1) ⋅ ⎜ ⎟ ⎝1 1⎠ 2 ⎛2 5⎞ ⎜ ⎟ ⎝4 1⎠ ⎛ 2 1 ⎞ + 1 ( −1) 4⋅ ⎜ ⎟ ⎝4 1⎠ ⎛2 5⎞ ⎜ ⎟ ⎝4 1⎠ cofactor expansion along first row (because it has the most zeros) first two minors are multiplied by zero, so all we have left is the third minor = (2)(1) – (5)(4) determinant of 2 x 2 matrix = –18 simplify We plug this number into our formula for area. Note that we have to choose our sign so that our answer ends up being positive. In this case, we need to choose the negative sign. 356 – Area of a Triangle Matrices and Determinants Area = formula for area = use result from above = –(–9) choose negative sign to get a positive solution = 9 answer And that is all there is to finding the area of a triangle using determinants and three points. Matrices and Determinants Area of a Triangle – 357 358 – Area of a Triangle Matrices and Determinants Advanced To solve systems of linear equations using determinants and Cramer's Rule. No definitions on this page. Gabriel Cramer (1704-1752), a Swiss geometer, is not as well known as other mathematicians, but he still made important contributions to the areas of analysis, determinants, and geometry. In addition to Cramer's Rule, he also came up with Cramer's paradox, involving an apparent contradiction in a theorem put forth by Colin MacLaurin. Matrices → Determinants → Applications of Determinants → Cramer's Rule So far in algebra, we have become familiar with three basic methods of solving systems of linear equations. The first involves solving one or more of the equations independently, then back-substituting into the rest of the equations until we have found all of the solutions (assuming that the system has a solution). The second method involves some sort of elimination of a variable combined with back-substitution to find the solution. The third method uses matrices to perform the elimination process more efficiently. We can use either Gaussian elimination or Gauss-Jordan elimination. If we use Gaussian elimination, then we have to rely on backsubstitution to finish the process. If we use Gauss-Jordan elimination, we keep working on the matrix until it is in reduced row-echelon form, revealing the solution to the system. All of these methods work great under certain circumstances. However, if we have access to a computer program, there is another method that we can use to almost instantaneously find the solution to a system of linear equation. This method is called Cramer's Rule, named after Gabriel Cramer (1704–1752). This rule uses determinants to write the solution to a system of linear equations. Cramer's Rule: 2 x 2 Matrices In the previous thought about Determinants, we told you that the solution to the following linear system: a1x + b1y = c1 a2x + b2y = c2 and is given by: x = c1 b 2 − c2 b 1 a1 b2 − a2 b1 and y = a 1 c2 − a 2 c1 a1 b2 − a2 b1 as long as a1b2 – a2b1 ≠ 0. Notice that the numerators and denominators correspond to determinants of 2 x 2 matrices, as follows: Matrices and Determinants Cramer's Rule – 359 x c1 b 2 − c2 b 1 = y a1 b2 − a2 b1 ⎛ c1 ⎜ ⎝ c2 ⎛ a1 ⎜ ⎝ a2 = = a 1 c2 − a 2 c1 a1 b2 − a2 b1 ⎛ a1 ⎜ ⎝ a2 ⎛ a1 ⎜ ⎝ a2 b1 ⎞ ⎟ b2 ⎠ b1 ⎞ ⎟ b2 ⎠ = c1 ⎞ ⎟ c2 ⎠ b1 ⎞ ⎟ b2 ⎠ For both the variables x and y, we find the solution by dividing a determinant by another determinant based on the coefficient matrix. The numerator matrix is determined by the constant terms and the coefficients of the other variable. Above, the coefficients b1 and b2 correspond to the y-terms in the linear equation, so we use those numbers to help us find x. Similarly, since a1 and a2 are coefficients of the x-term, we use them to help us find y. If we denote the determinant of the coefficient matrix by D and the other two determinants in the numerators by Dx and Dy respectively, then we have the following: D = ⎛ a1 b1 ⎞ ⎜ ⎟ ⎝ a2 b2 ⎠ and Dx = ⎛ c1 b 1 ⎞ ⎜ ⎟ ⎝ c2 b 2 ⎠ and Dy = ⎛ a 1 c1 ⎞ ⎜ ⎟ ⎝ a 2 c2 ⎠ which means that we can write our solution to the linear equation as: x = = ⎛ c1 ⎜ ⎝ c2 ⎛ a1 ⎜ ⎝ a2 b1 ⎞ ⎟ b2 ⎠ b1 ⎞ ⎟ b2 ⎠ Dx y = = D ⎛ a1 ⎜ ⎝ a2 ⎛ a1 ⎜ ⎝ a2 c1 ⎞ ⎟ c2 ⎠ b1 ⎞ ⎟ b2 ⎠ Dy D Now that we have seen how Cramer's Rule works in the abstract sense, let's use a few numbers. Use Cramer's Rule to solve (if possible) the system of equations. 3x + 4y = –2 5x + 3y = 4 solution First we need to find the determinant of the coefficient matrix and make sure it does not equal to zero. Otherwise, Cramer's Rule doesn't work. 360 – Cramer's Rule Matrices and Determinants D ⎛3 4⎞ ⎜ ⎟ ⎝5 3⎠ = determinant of coefficient matrix = 9 – 20 formula for determinant of 2 x 2 matrix = –11 simplify Since D ≠ 0, we can go ahead and apply Cramer's Rule as follows: x = Dx formula to find x D ⎛ −2 4 ⎞ ⎜ ⎟ ⎝ 4 3⎠ = y find Dx and divide by D = = Dy ⎛ 3 −2 ⎞ ⎜ ⎟ ⎝5 4 ⎠ −11 = = x = formula to find y D find Dy and divide by D −11 −6 − 16 −11 −22 −11 use formula for determinant of 2 x 2 matrix = simplify = simplify answer 2 y = 12 − ( −10) −11 22 −11 –2 use formula for determinant of 2 x 2 matrix simplify simplify answer The solution is x = 2 and y = –2. Feel free to check this in the original system. Cramer's Rule: n x n Matrix Now we will expand Cramer's Rule to matrices of higher order. If we have n equations and n unknowns, then the value of each variable is given as the quotient of two determinants. The denominator is the determinant of the coefficient matrix. The numerator is the determinant of the matrix formed by replacing the column corresponding to the variable (being solved for) with the column representing the constants. For example, if we have the following system of three unknowns and three equations: a11x1 + a12x2 + a13x3 = b1 a21x1 + a22x2 + a23x3 = b2 a31x1 + a32x2 + a33x3 = b3 then the solution for x1 in the system is given by: x1 = A1 A = ⎛⎜ b1 ⎜ b2 ⎜ b3 ⎝ ⎛⎜ a11 ⎜ a21 ⎜ a31 ⎝ a 12 a 13 ⎞ ⎟ a 22 a 23 ⎟ a 32 a 33 ⎟⎠ a 12 a 13 ⎞ ⎟ a 22 a 23 ⎟ a 32 a 33 ⎟⎠ At this point, we would probably break down and use a calculator or computer to find the determinants rather than find them by hand. In general, we can write Cramer's Rule as: Matrices and Determinants Cramer's Rule – 361 Cramer's Rule If a system of n linear equations containing n unknowns has a coefficient matrix A with a nonzero determinant | A |, the solution of the system is given by: x1 = , x2 = ,..., xn = where the ith column of Ai is the column of constants in the system of equations. If the determinant of the coefficient matrix is zero, the system has either no solution or infinitely many solutions. Let's look at an example of how this works with a system of three equations and three unknowns. For simplicity, we will not show the steps needed to find the determinants. If you need to go back and review how to do that, please visit the Determinant page two thoughts up in the Algebra Brain. Use Cramer's Rule to solve (if possible) the following system of linear equations. 4x – 2y + 3z = –2 2x + 2y + 5z = 16 8x – 5y – 2z = 4 solution First we need to find the determinant of the coefficient matrix: D 2 ⎛ 2 5 ⎞ + 2 ( −1) 3⋅ ⎜ ⎟ ⎝ −5 −2 ⎠ ⎛ −2 3 ⎞ + 8 ( −1) 4⋅ ⎜ ⎟ ⎝ −5 −2 ⎠ ⎛ −2 3 ⎞ ⎜ ⎟ ⎝ 2 5⎠ = 4 ( −1) ⋅ = 4(21) – 2(19) + 8(16) find determinants of 2 x 2 matrices = 84 – 38 – 128 simplify = –82 simplify expand along first column Since the determinant is not zero, we can apply Cramer's Rule. We find the determinant of three matrices. Dx is formed by replacing the first column of D with the constant terms. Dy is formed by replacing the second column of D with the constant terms. Dz is formed by replacing the third column of D with the constant terms. 362 – Cramer's Rule Matrices and Determinants x = y = z = Dx D Dy D Dz D = = = = = = −410 −82 −656 −82 164 −82 = 5 = 8 = –2 Thus, the solution to the system of linear equations is: x = 5 and y = 8 and z = –2 answer Matrices and Determinants Cramer's Rule – 363 364 – Cramer's Rule Matrices and Determinants Advanced To use a matrix to determine if three points all lie on the same line. No definitions on this page. John Wallis (1616-1703) made numerous contributions to modern mathematics. He investigated an infinite approximation to 4 / π, negative and fractional exponents, and the center of gravity in cycloids using indivisibles. Wallis was the first to use " " to indicate infinity and introduced the mantissa in logarithms. Finally, he is the first to graphically display the complex roots of a real quadratic equation. Matrices → Determinants → Applications of Determinants → Test for Collinear Points Collinear Points Suppose we have three points that are all on the same line in a plane. By definition, the area enclosed by these three points is equal to zero. We use three points to define the area of a triangle. Therefore, we can conclude that if the area of a triangle given by three points in a plane is equal to zero, those three points must lie on the same line. Consider the three points given below: The points (0, 3), (3, 4) and (6, 5) are clearly on the same line. If we use the formula for finding the "area" enclosed by these points using determinants, we find that the area is: Matrices and Determinants Test for Collinear Points – 365 Area = = = = = 1 ⎛0 3 1⎞ ⎜3 4 1⎟ ⎟ 2 ⎜ ⎝6 5 1⎠ formula for area of triangle 1⎡ 2 ⎛4 1⎞ 3 ⎛3 1⎞ 4 ⎛3 4⎞ ⎤ ⎢ 0 ( −1) ⋅ ⎜ ⎟ + 3 ( −1) ⋅ ⎜ ⎟ + 1 ( −1) ⎜ ⎟ ⎥ 2⎣ ⎝5 1⎠ ⎝6 1⎠ ⎝6 5⎠ ⎦ cofactor expansion along first row 1 [ 0 − 3( 3 − 6) + ( 15 − 24) ] determinants of 2 x 2 matrices ( 9 − 9) simplify 2 1 2 0 simplify again We have just demonstrated that the "area" enclosed by three points on the same line is zero. This leads us to the conclusion that if the determinant in the formula above is zero, all three points must lie on the same line. Formally, we can write this as: Test for Collinear Points Three points (x1, y1), (x2, y2), and (x3, y3) are collinear (lie on the same line) if and only if = 0 test for collinear points This makes sense because if the determinant did not equal zero, then you would have a triangle. Use a determinant to see whether the points are collinear. ⎛ 2, − 1 ⎞ ⎜ 2⎟ ⎝ ⎠ (–4, 4) (6, –3) given points solution To see if these three points are collinear, we find the determinant of the matrix formed from these points. ⎛⎜ 2 − 1 1 ⎞⎟ 2 ⎟ ⎜ ⎜ −4 4 1 ⎟ ⎜ 6 −3 1 ⎟ ⎝ ⎠ = 2 2 ( −1) ⋅ ⎛ 4 1⎞ ⎛ 1⎞ 3 ⎛ −4 1 ⎞ 4 ⎛ −4 4 ⎞ ⎜ ⎟ + ⎜ − ⎟ ( −1) ⋅ ⎜ ⎟ + 1 ( −1) ⋅ ⎜ ⎟ ⎝ −3 1 ⎠ ⎝ 2 ⎠ ⎝ 6 1⎠ ⎝ 6 −3 ⎠ cofactor expansion along first row = 2( 4 + 3) + 1 ( −4 − 6) + ( 12 − 24) 2 determinants of 2 x 2 matrices = –3 simplify Since the determinant is not equal to zero, we conclude that these three points do not lie on the same line, as we can see on the graph below. answer 366 – Test for Collinear Points Matrices and Determinants Equation of a Line We can adapt the test for collinear points to another use. If we are only given two points in a rectangular coordinate system instead of three, we can use the same formula above to find the equation of the line described by those two points. The only difference between the formula above and the next formula is that we replace the first two elements of the array with x and y, representing any two points on the line. Two-Point Form of the Equation of a Line An equation of the line passing through the points (x1, y1) and (x2, y2) is given by = 0 two-point form of the equation of a line The result of using this formula is an equation of a line in standard form. Let's see how this works. Find an equation of the line passing through the points (1, 3) and (7, 5), as shown in the figure below. Matrices and Determinants Test for Collinear Points – 367 solution Applying the determinant formula for the equation of a line produces: ⎛x y 1⎞ ⎜1 3 1⎟ ⎜ ⎟ ⎝7 5 1⎠ = 0 two-point form of the equation of a line To evaluate this determinant, we expand by cofactors along the first row to obtain the following: 2 x( −1) ⋅ ⎛3 1⎞ 3 ⎜ ⎟ + y( −1) ⋅ 5 1 ⎝ ⎠ ⎛1 1⎞ 4 ⎜ ⎟ + 1 ( −1) ⋅ 7 1 ⎝ ⎠ ⎛1 3⎞ ⎜ ⎟ ⎝7 5⎠ = 0 cofactor expansion along first row x(3 – 5) – y(1 – 7) + (5 – 21) = 0 determinants of 2 x 2 matrices –2x + 6y – 16 = 0 simplify Therefore, an equation of the line is: –2x + 6y – 16 = 0 answer Compare this method to another two-point method for finding an equation of a line. 368 – Test for Collinear Points Matrices and Determinants Unit 6 Sequences and Probability Sequences and Probability – 369 370 – Sequences and Probability In this section of the Algebra Brain, we will examine sequences and probability. In English, a sequence is simply a set of items arranged in a particular order. A list of names is often listed in alphabetical order because this makes it easy to find a particular name on the list. In mathematics, a sequence is a function whose domain consists of all the positive integers, listed in some specific order. Once we know the general pattern for the sequence, it becomes a relatively simply matter to find the nth term for that sequence. Two very common types of sequences are the arithmetic sequence and the geometric sequence. In an arithmetic sequence, each successive term of the sequence is found by adding some number to the current term in the sequence. The difference between two successive terms is a constant number d. For example, in the sequence: 1, 3, 5, 7, 9, … we add 2 to each term of the sequence to obtain the next term of the sequence. In a geometric sequence, the ratio of two successive terms is a constant number r. In other words, to find the next term of the sequence, we multiply the current term by r. For example, in the sequence: 1, 2, 4, 8, 16, 32, … we multiply each term of the sequence by 2 to obtain the next term of the sequence. We also present the Binomial Theorem in this section of the Algebra Brain. Recall that we can use a simple formula to expand a binomial term raised to the second formula. This becomes much more difficult for binomials raised to higher powers. The Binomial Theorem gives us a formula that allows us to find the coefficients of a binomial term raised to the nth power. Finally, we will investigate the basic laws of probability. Most people would agree that a coin tossed into the air has a 1 in 2 chance of coming up heads. However, we can also use the laws of probability to determine the likelihood of buying a defective product if we know how many defective products are in a particular shipment. We can also determine the probability of winning the lottery, in which case we no longer need to study algebra and can go live on a yacht in the Caribbean. In fact, random drawings are a primary use of probability models, whether we are trying to win a million dollar jackpot or at a charity raffle. Sequences and Probability – 371 372 – Sequences and Probability Advanced To understand the definition of a factorial. factorial – for a positive integer n, the product of all the positive integers less than or equal to n. The word "factorial" was originally used by the German mathematician Christian Kramp. He referred to the product of factors in arithmetic progression, indicated as an|d. That is, a(a + d)(a + 2d) . . . a (a + n – 1)d = an|d. He is also responsible for the notation n! used today to represent the product of the first n positive integers. This corresponds directly to d = 1 in Kramp's original calculation above. Sequences and Probability → Factorials Some of the most important sequences in mathematics involve terms that are defined with special types of products called factorials . Probability is one of the more common applications of factorials. The Binomial Theorem also uses factorials to define coefficients of binomial expansion. They also play an important role in sequences and series in calculus. Definition of Factorial If n is a positive integer, n factorial is defined by: n! = 1 · 2 · 3 · 4 · . . . · (n – 1) · n! n factorial As a special case, zero factorial is defined as 0! = 1. From this definition it is easy to see that 1! = 1, 2! = (2)(1) = 2, 3! = (3)(2)(1) = 3 and so on. Also, factorials have the same determination in the order of operations as exponents. That is, just as 2x3 and (2x)3 imply different orders of operations, 2n! and (2n)! imply the following orders: 2n! (2n)! = 2(n!) = = 2(1 · 2 · 3 · . . . · n) 1 · 2 · 3 · . . . · n · (n + 1) · . . . · 2n Fractions involving factorials are often very easy to simplify. We can cancel all the like factors in the numerator and the denominator, leaving just the unlike factors. Simplify each factorial expression: a.) b.) c.) 7! 2!⋅ 6! 3!⋅ 4! 4!⋅ 5! n! ( n − 1)! Sequences and Probability Factorials – 373 solution For each of these expressions, we write out each of the factorials, then cancel like factors in the numerators and denominators. Then we evaluate the remaining expression. a.) 7! 2!⋅ 6! write out factorials in long form cancel common factors = b.) 3!⋅ 4! 4!⋅ 5! 2 write out factorials in long form cancel common factors = 1 = 5⋅ 4 1 = c.) n! ( n − 1)! remaining factors simplify answer 20 write out factorials in long form cancel common factors = = 374 – Factorials simplify whatever is left after our cancellations answer 7 = n remaining factor answer Sequences and Probability Advanced To learn the Fundamental Counting Principle, used to determine the number of outcomes of an event. No definitions on this page. In Archimedes's (287-212 B.C.) work, The Sand Reckoner, he lays out a system in which he attempts to calculate the number of grains of sand in the universe. It also contained a special notation for estimating and expressing very large numbers. Another of his famous mathematical puzzles is known as the Cattle Problem. The idea is to determine the number of cows and bulls of various colors, given that each cattle color is represented in a particular ratio to the others. As it turns out, there are an infinite number of solutions, but the Greeks had a difficult time solving the puzzle because they had no knowledge of algebra. Sequences and Probability → Counting Principle One of the most fundamental concepts to understanding probability is to be able to count the number of ways a particular event can occur. Oftentimes, the outcome of one event A can affect the outcome of a subsequent event B, so it is useful to know the likelihood of A occurring. The only way to do that is to count the number of ways A can occur. A certain game requires a roll of three six-sided dice (indicated by 3d6). A success on one dice roll is indicated when the total on all three dice is "8". How many different ways can a total of "8" be obtained on 3d6? solution One way we can do this is to construct the following table: one die another die the other die 1 1 6 2 2 4 3 3 2 4 3 1 5 2 1 The order of the numbers on the dice is unimportant, since all three are absolutely identical. The important thing to consider is the numbers. As we can see, there are only 5 ways we can obtain a total of "8" on three dice. answer In another game, the rules indicate that we need to roll a six-sided die at one point. However, if we roll a "5" or a "6" on the die, we roll again, this time disregarding any more 5's or 6's. Assuming that we roll a "5" or a "6" on the first roll, how many different ways can we obtain a total of "8" on the two dice rolls? solution Again, we construct a table to help us count the different possibilities: first roll second roll 5 3 6 2 Here, we can see that if we get a "5" on the first roll, we must roll a "3" in order to get a total of "8". Similarly, if we roll a "6" on the first roll, then we must roll a "2" in order to get a total of "8". Thus, there are only 2 ways to obtain a total of "8" on the two die rolls. answer Sequences and Probability Counting Principle – 375 Each of the previous examples illustrates an application of the counting principle. The first example involves counting with replacement, meaning that each roll of the die is independent of the other rolls. The second example, however, illustrates counting without replacement, meaning that the second die roll is dependent on the first. You will study this type event in more detail when you look at Probability. These examples are fairly simple. We can easily list the possible outcomes and determine which of those outcomes fit our criteria (in our examples, we wanted a total of "8" on all the dice). When possible, listing the outcomes is always the best approach. However, in many cases that is not feasible, because there are so many outcomes. For example, if we wanted to know how many ways we could rearrange the letters in "supercalifragilisticexpialidocious", we would be here the rest of our lives writing down the list of possible combinations. Fortunately, we can take advantage of a number of techniques to find the solution without having to list all the possible outcomes. One such method is the Fundamental Counting Principle, defined below: The Fundamental Counting Principle Let E1 and E2 be two events. The first event E1 can occur in m1 different ways. After E1 has occurred E2 can occur in m2 different ways. The number of ways that the two events can occur is given by m1 · m2. Note: The Fundamental Counting Principle can be extended to three or more events. For instance, the number of ways that three events E1, E2, and E3 can occur is m1 · m2 · m3. Telephone numbers in the United States have ten numbers. The first three digits are the area code, and indicate where in the United States a particular call originates. The next seven digits are the local telephone number. Now, there are a few restrictions on which digits can be used in the local telephone number. The digits 0 and 1 are not permitted as the first digit of the local telephone number. Also, the first 3 digits of the number cannot be "555", as that indicates a phony telephone number (you may have noticed that all telephone numbers in movies and television begin with "555". This is so nobody calls those numbers). How many different telephone numbers are possible within each area code? solution The first digit cannot be 0 or 1. That leaves 8 choices (2 – 9). The 6 digits remaining can be 0 – 9. If we apply the Fundamental Counting Principle, then the total number of local telephone numbers is given by: (8)(10)(10)(10)(10)(10)(10) = 8,000,000 total number of local telephone numbers However, we have to exclude all those numbers whose first three digits are "555", which means that we subtract (10)(10)(10)(10) = 10,000 numbers, leaving us with: 8,000,000 – 10,000 = 7,990,000 actual number of possible local phone numbers answer This is actually not very many phone numbers, considering the widespread use of faxes and cell-phones, so many businesses often have extension numbers, indicating different parts of an organization. 376 – Counting Principle Sequences and Probability Advanced To find the number of combinations of n elements taken r at a time. No definitions on this page. The multiplicative rule of combinations, nCr, n things taken r at a time dates as far back as the Jain mathematician Mahavira in 850 A.D. in India. Later mathematicians of the 10th and 11th centuries are responsible for creating the combinatorial triangle known today as Pascal's triangle, the elements of which are derived from the multiplicative rule of combinations. Sequences and Probability → Counting Principle → Combinations In a related thought, we discussed permutations, which refer to arranging items in a particular order. Now we are going to talk about combinations, where order is completely irrelevant. Basically, we want to look at a subset of a larger set. The arrangement of the subset does not matter. Most standard card games use this principle. Consider the collectible card game Magic: The Gathering®. Each player is dealt 7 cards from his or her deck. Once they pick up the cards, each player is free to rearrange the cards in any order they want. There is no particular arrangement of cards, so each arrangement of the 7 cards is the same combination. The only time the combination changes is when a player draws one card and then plays a card from his or her hand. In a more concrete example, consider the word SPEAR. While we can rearrange the letters in a number of ways, the combinations PEARS and REAPS are both equivalent combinations because they contain the exact same elements. Note, however, that if order were important then they would be different permutations. To sum up: Combinations order NOT important Permutations order VERY important Like permutations, we look at subsets of a larger set. That is, we can find the number of combinations of n elements taken r at a time. For example, we could find the number of 3-letter combinations we can find from the 5 letters in the word SPEAR. The formula for finding the number of combinations of n elements taken r at a time is similar to the formula for the number of permutations with an extremely important difference. The formula is given below: Combinations of n Elements Taken r at a Time The number of combinations of n elements taken r at a time is given by: nCr Note: = number of combinations of n elements taken r at a time This is the exact same formula used to compute the coefficients of a binomial expansion. In a classroom experiment, four students out of a class of 20 are selected to participate in the experiment. How many different combinations of four students are possible? Sequences and Probability Combinations – 377 solution The order of the students chosen is irrelevant, so we know that we can go ahead and use the formula given above, letting n = 20 and r = 4: 20C4 = = = 20! ( 20 − 4)! 4! 20! 16! 4! 20⋅ 19⋅ 18⋅ 17⋅ 16! 16! 4⋅ 3⋅ 2⋅ 1 number of combinations of 20 elements taken 4 at time simplify denominator expand a few factorials cancel common factors = = (5)(19)(3)(17) remaining factors = 4845 simplify answer Out of 20 students, there are nearly 5000 different ways of arranging them into 4-person teams. 378 – Combinations Sequences and Probability Advanced To find the number of ways elements can be arranged in order. No definitions on this page. Permutations actually come from a study of linguistics. At the turn of the 13th century, the Arab ibn Mun'im wrote in his Fiqh al-Hisab rules and results obtained in linguistics. These results illustrated the use of both permutations and combinations. At the end of the 13th century, another Arab, ibn al-Banna, demonstrated the rules for the number Pn of permutations of n elements, of combination nCr and arrangement nPr of n things taken r at a time. Sequences and Probability → Counting Principle → Permutations One important application of the Fundamental Counting Principle is in determining the number of ways we can arrange n elements (in order). An ordering of n elements is called a permutation and is defined below: Definition of Permutation A permutation of n different elements is an ordering of the elements such that one element is first, one element is second, one is third, and so on. How many permutations are possible for the letters B, R, A, I and N? solution To understand the number of permutations possible, we undertake the following reasoning. We basically want to rearrange the five letters in as many different ways as possible, with the order of the letters playing an important role. For the first letter of a permutation, we have 5 choices. For the second letter of a permutation, we only have 4 letters to choose from. The third letter of a permutation only offers 3 choices, and so on. We only get one possible choice for the final letter of a permutation (whichever letter we have not used). Thus: First position: Second position: Third position: Fourth position: Fifth position: Any of the 5 letters Any of the remaining 4 letters Any of the remaining 3 letters Any of the remaining 2 letters The 1 remaining letter We use the Fundamental Counting Principle to determine the total number of permutations: (5)(4)(3)(2)(1) = 5! total number of permutations = 120 simplify factorial answer The result of this example can be generalized as follows: Sequences and Probability Permutations – 379 Permutations of n Elements The number of permutations of n elements is given by: n • (n – 1) . . . 4 • 3 • 2 • 1 = n! number of permutations of n elements In other words, there are n! different ways that n elements can be ordered. In many cases, we are more interested in ordering a subset of a collection of elements rather than the entire collection. For instance, we might want to choose and order 3 elements out of a total of 8 elements. This means that we extract any three elements out of the group of eight and order them. Then we extract a second group of three elements (possibly the same, but not necessarily) and order those three elements. The idea is to find as many different orders of three elements out of the group of eight elements. If we generalize this, we can say that we are ordering r elements out of a total of n elements. This kind of ordering is called a permutation of n elements taken r at a time, defined as follows: Permutations of n Elements Taken r at a Time The number of permutations of n elements taken r at time is = nPr = n(n – 1)(n – 2) . . . (n – r + 1) number of permutations of n elements taken rat a time Eight students race at a swim meet. In how many ways can a group of three students out of the original eight come in first, second, and third (assuming there are no ties)? solution Using the formula above, we let n = 8 (total number of students) and r = 3 (subset of larger group). Then we get: 8P3 = = 8! ( 8 − 3)! 8! 5! substitute in n = 8 and r = 3 simplify denominator cancel common factors = = (8)(6)(7) remaining factors = 336 simplify answer The key thing to remember about permutations is that ORDER IS IMPORTANT! That is, if you take a look at the permutations of the letters C, A and T taken three at a time, then the permutations (C, A, T) and (T, A, C) are different because of the order. Note that there are three distinct elements in this set of three letters. However, what if we have the letters K, I, W, and I? How can we distinguish between the first I and the second I? The short answer is that we can't. The total number of permutations is given by 4P4 = 4! = 24. But some of the permutations will be the same because there are two I's. In order to find the total number of distinguishable permutations, we use the following formula: 380 – Permutations Sequences and Probability Distinguishable Permutations Suppose a set of n objects has n1 of one kind of object, n2 of a second kind, n3 of a third kind, and so on, with n = n1 + n2 + n3 + . . . + nk. Then the number of distinguishable permutations of the n objects is: number of distinguishable permutations In how many distinguishable ways can the letters in CALCULUS be arranged? solution This word has 8 letters, of which 2 are C's, 2 are L's, two are U's, 1 is an A, and 1 is an S. Thus, the number of distinguishable ways the letters can be written is: 8! 2! 2! 2! 1! 1! = 8⋅ 7! 8 number of distinguishable permutations = 7! cancel common factors = 5040 simplify factorial answer There are over five thousand different ways we can arrange the letters in the word CALCULUS that are completely distinct from every other permutation. Note that the total number of permutations (allowing for duplicate permutations) is given by 8P8 = 8! = 40,320. And no, we are not going to list them for you. Sequences and Probability Permutations – 381 382 – Permutations Sequences and Probability Advanced To use the Binomial Theorem to expand binomials raised to the nth power. binomial series – a binomial expansion which contains infinitely many terms. Although the Binomial Theorem is usually credited to the Chinese or to Isaac Newton, during the first half of the fifteenth century, the Persian mathematician Ghiyat ad din Ghamshid ibn Mas'ud al-Kashi also did important work on binomials. He also worked with exponential powers of whole numbers, n roots, and irrational numbers. Sequences and Probability → Binomial Theorem A binomial is a polynomial that has exactly two terms. If it had three terms, then it would be a trinomial. Now, we can raise a binomial to an integer power (such as 2). When that happens, we can expand the binomial by multiplying all its terms together. However, for any power of the binomial greater than 2, this becomes tedious and difficult. Therefore, what we want to find is a quick method of finding the coefficients of a binomial raised to any power higher than 2 (although the formula we come up with will work for binomials raised to any integer power). The key word here is integer power. The binomial theorem does not work for powers that are not integers. But we don't usually need to worry about that in algebra. Let's take a look at the expansion of (x + y)n for several values of n and see if we can't notice a pattern developing among the coefficients: (x + y)0 = 1 binomial raised to zero power (x + y)1 = x+y binomial raised to first power (x + y)2 = x2 + 2xy + y2 binomial raised to second power (x + y)3 = x3 + 3x2y + 3xy2 + y3 binomial raised to third power (x + y)4 = x4 + 4x3y + 6x2y2 + 4xy3 + y4 binomial raised to fourth power (x + y)5 = x5 + 5x4y + 10x3y2 + 10x2y3 + 5xy4 + y5 binomial raised to fifth power Here are some observations we can make based on the expansions given above: 1. In each expansion there are n + 1 terms. That is, if n is even, then we have an odd number of terms and if n is odd, then we have an even number of terms. 2. In each expression, x and y take on symmetrical roles. The powers of x decrease by 1 in each successive term and the powers of y increase by 1 in each successive term. 3. The sum of the powers of x and y equals n in each term. That is, the first term has xny0, the second term has xn–1y1, the third term has xn–2y2, and so on until we get to x0yn. 4. The coefficients increase and decrease in a symmetric manner. In order to determine the coefficients, we need to make use of the following theorem, which unfortunately requires calculus to prove: Sequences and Probability Binomial Theorem – 383 The Binomial Theorem In the expansion of (x + y)n (x + y)n = xn + nxn – 1y + . . . + nCrxn – ryr + . . . + nxyn – 1 + yn The coefficient nCr of xn – ryr is given by nCr Note: = As it turns out, this theorem is just a special case of a much more powerful theorem used for what are called binomial series . Isaac Newton, who came up with the binomial theorem about 2600 years after the Chinese, also figured out a way to expand a binomial that is raised to a rational power, such as 1/2. However, you don't get a nice neat package like you get with the binomial theorem. Instead you get an infinite number of terms, the coefficients of which are determined in a similar manner as described above. Find the binomial coefficients: a.) 9C7 b.) 4C1 c.) 6C3 solution a.) 9C7 = = = b.) 4C1 9⋅ 8⋅ 7⋅ 6⋅ 5⋅ 4⋅ 3⋅ 2⋅ 1 ( 2⋅ 1) ( 7⋅ 6⋅ 5⋅ 4⋅ 3⋅ 2⋅ 1) 9⋅ 8 write out factorials long hand cancel common factors in numerator and denominator 2⋅ 1 9x4 simplify rational expression = 36 answer = = = 6C3 Binomial Theorem ( 9 − 7)! 7! = = c.) 9! = 4! (4 1) ! 1 ! 4.3.2.1 ( 3.2.1) ( 1) 4 Binomial Theorem write out factorials long hand cancel common factors 1 4 answer 6! 3! 3! 384 – Binomial Theorem Binomial Theorem Sequences and Probability = 6.5.4.3 ! 3! 3! write out factorials long hand = 6.5.4 3.2.1 cancel common factorials = 5x4 cancel common factors = 20 answer Each coefficient belongs in front of xn –ryr, so the answer to part (a) above would go in front of x9 – 7y7. The full term would look like: 36x2y7 Sequences and Probability Binomial Theorem – 385 386 – Binomial Theorem Sequences and Probability Advanced To find the coefficients of a binomial expansion. No definitions on this page. Blaise Pascal (1623-1662) was a French geometer, probabilist, physicist, inventor, and philosopher. His most famous invention is a mechanical calculator that automatically adds and subtracts. The basic principle behind this device was still being used in the first half of the 20th century until modern electronics came along. The triangle that bears his name was actually invented 600 years earlier by the Chinese. Pascal used it so creatively in his studies on probability that the triangle was given his name. Pascal's work on binomial coefficients helped lead Isaac Newton to his discovery of the general binomial theorem, which works for all real numbers, not just integers. Sequences and Probability → Binomial Theorem → Pascal's Triangle There is a convenient method to remember the binomial coefficients. It is called Pascal's Triangle, after the French mathematician Blaise Pascal (1623 – 1662). The basic idea is to build a triangle out of the binomial coefficients. Each row of the triangle corresponds to a binomial raised to a particular degree. We start with degree 0 and work our way down the triangle. The coefficients of lower rows are formed by adding coefficients of the two numbers in the row above. The top row is row zero because it corresponds to the binomial (x + y )0. The first and last numbers of each row are 1. Each of the other numbers in the rows is formed by adding the two numbers directly above it. The middle number of row two, for example, is formed by adding the two ones in row 1. The second number in row 4 is formed by adding the 1 and 3 directly above it in row 3. Here are the expanded binomials of (x + y)n using Pascal’s Triangle to determine the coefficients: Sequences and Probability Pascal's Triangle – 387 Note: (x + y)0 = 1 zeroth row (x + y)1 = 1x + 1y first row (x + y)2 = 1x2 + 2xy + 1y2 second row (x + y)3 = 1x3 + 3x2y + 3xy2 + 1y3 third row (x + y)4 = 1x4 + 4x3y + 6x2y2 + 4xy3 + 1y4 fourth row (x + y)5 = 1x5 + 5x4y + 10x3y2 + 10x2y3 + 5xy4 + 1y4 fifth row (x + y)6 = 1x6 + 6x5y + 15x4y2 + 20x3y3 + 15x2y4 + 6xy5 + 1y6 sixth row (x + y)7 = 1x7 + 7x6y + 21x5y2 + 35x4y3 + 35x3y4 + 21x2y5 + 7xy6 + 1y7 seventh row The sum of the exponents of x and y always equals the degree of the binomial. Pascal's triangle is only useful for the first half-dozen rows or so. After that, it gets a bit tedious to build the triangle each time. However, if, for some reason, you have trouble remembering the Binomial Theorem method of generating coefficients, then by all means use Pascal's Triangle. Although credit for the triangle is given to Pascal, versions of the triangle were known to Eastern cultures (particularly the Chinese) long before its "discovery" by the West. 388 – Pascal's Triangle Sequences and Probability Advanced To learn the definition of probability. No definitions on this page. Dice are among the first examples of a random number generator. Besides gambling, dice have also been used for foretelling the future. Indeed, one of Julius Caesar's most famous classical quotations is, "Alea jacta est," meaning, "The die is cast." This refers to the fact that once one has started upon a course of action, the outcome is in the hands of Fate or Destiny, represented in the form of dice. Sequences and Probability → Probability Probability, ladies and gentlemen, is all about finding the likelihood of an event occurring. For example, the probability that the Cubs win the World Series might be one in a million, depending on whom you talk to. This means that the Cubs will only win the World Series once every million years. Probability is always associated with uncertainty. That is, there is seldom a guarantee that an event will happen. We can only look for the possibilities of an event occurring in terms of the possibility that the event will not occur. In general, any happening in which the result is uncertain is called an experiment. The possible results of the experiment are called outcomes. The set of all possible outcomes of an experiment is called the sample space. Any sub-collection of the sample space is called an event. Let's look at a simple example, a six-sided die (abbreviated as 1d6). When we toss 1d6 into the air, the sample space (i.e. the total number of the outcomes) can be represented by all the numbers 1 through 6. However, the event, or the number that we actually get, is just one of the numbers 1 through 6. In other words, if we roll a "5", then we have: S = {1, 2, 3, 4, 5, 6} sample space E = {5} event The probability that we roll a "5" is given by counting the number of times that this occurs in the sample space and dividing that number by the total number of outcomes. In this case, the number "5" only occurs 1 time in the sample space, which contains 6 outcomes. Therefore, the probability of rolling a "5" is given by: P(5) = 1 6 probability of rolling a "5" on 1d6 Suppose we now roll 2d6 (two six-sided dice) and add the number together. The smallest number we can get is "2" and the largest number is "12". Therefore, the sample space includes all numbers between 2 and 12. However, we now have a new wrinkle. Some numbers will occur more than once in the sample space. For instance, we can roll a 1 and a 4 to get 5 or we can roll a 2 and a 3 to get 5. Or we can reverse the order: 4 + 1 = 5 or 3 + 2 = 5. One way to get a better picture of what is going on is with a chart, listing each die roll and the total on 2d6: Sequences and Probability Probability – 389 die 1 2 3 4 5 6 1 2 3 4 5 6 7 2 3 4 5 6 7 8 3 4 5 6 7 8 9 4 5 6 7 8 9 10 5 6 7 8 9 10 11 6 7 8 9 10 11 12 Notice that there are actually 36 outcomes in the sample space. The number "5" occurs 4 times in the sample space, so the probability of rolling a total of 5 on 2d6 is therefore: 4 = P(5) 36 1 = 9 probability of rolling 5 on 2d6 simplify fraction In general, the number of outcomes in event E is denoted by n(E). The number of outcomes in the sample space is denoted by n(S). The probability that event E will occur is given by n(E) / n(S). This is summarized in the following theorem: The Probability of an Event If an event E has n(E) equally likely outcomes and the sample space S has n(S) equally likely outcomes, then the probability of event E is: P(E) = probability of event E Since the number of outcomes of an event E must be less than or equal to the number of outcomes in the sample space, the probability of an event occurring is ALWAYS between 0 and 1. That is, 0 ≤ P(E) ≤ 1 If P(E) = 0, then the event E cannot occur and E is called an impossible event. If P(E) = 1, then event E absolutely has to occur, and E is called a certain event. When determining the number of outcomes in a sample space, it is often very helpful to apply the Fundamental Counting Principle. A certain game requires a roll of 3d6. In order for it to be a "critical success" (i.e. extremely good), the total of all three dice must be 5 or less. What is the probability of rolling 5 or less on 3d6? solution First, we need to know the total number of outcomes. In order to do that, we use the Fundamental Counting Principle. Since each die has 6 outcomes, we conclude that the total number of outcomes is given by n(S) = (6)(6)(6) number of outcomes in sample space = 216 simplify Next, we need to determine how many of the outcomes are 5 or less. The smallest number we can roll on 3d6 is 3, which only occurs once. Here is the list of ways the dice can land: 390 – Probability Sequences and Probability first die second die third die total 1 1 1 3 2 1 1 4 1 2 1 4 1 1 2 4 2 2 1 5 2 1 2 5 1 2 2 5 3 1 1 5 1 3 1 5 1 1 3 5 When we count the columns, we see that there are 10 ways in which we can achieve a sum of 5 or less on 3d6. Thus, n(E) = 10, and the probability of rolling 5 or less on 3d6 is given by: P(5 or less) = = 10 216 5 108 probability of rolling 5 or less on 3d6 simplify fraction answer When we convert this fraction to percentile form, we find that we only have about a 4.6% chance of rolling a "5" on 3d6. Sequences and Probability Probability – 391 392 – Probability Sequences and Probability Advanced To find the probability of independent events. independent events – the occurrence of one event has no impact whatsoever on the occurrence of another event. Jakob Bernoulli (1645-1705) contributed significantly to the field of probability theory. One of his theorems, dubbed "The Law of Large Numbers" by Simeon-Denis Poisson, states that if one makes a very large number of independent trials - flipping a coin, for example - then the observed ratio of successes to failures for a given event - obtaining "heads" in a coin flip - will be very close to the theoretical probability for that event on each individual trial. In other words, flipping a coin 1000 times should yield roughly 500 instances of "heads", since 500/1000 = 1/2, which is the probability of obtaining "heads" on an individual trial. Sequences and Probability → Probability → Independent Events Two events are said to be independent events when the occurrence of one event has no impact whatsoever on the occurrence of the other event. For instance, rolling a "6" on 1d6 has no effect on future rolls of the same die. In general, we can use a very simple formula for finding the probability of independent events. Simply multiply the probabilities of each independent event together. Probability of Independent Events If A and B are independent events, the probability that both A and B will occur is given by: P(A and B) = P(A) · P(B) probability of independent events We have 6 dice, a four-sider (1d4), a six-sider (1d6), an eight-sider (1d8), a ten-sider (1d10), a twelve-sider (1d12) and a twenty-sider (1d20). What is the probability that we roll a "3" on all six dice if we roll them all together? solution Each die can be considered to be an independent event. The probability of rolling a "3" on 1d4 is P(3 on d4) = 1 /4. The probability of rolling a "3" on 1d6 is P(3 on d6) = 1/6. We can easily find the probabilities for each of the other dice in a similar manner. The probability for rolling a "3" on all six dice, then, is the product of each of the independent probabilities: P(3 on all 6 dice) = = ⎛ 1 ⎞⎛ 1 ⎞⎛ 1 ⎞⎛ 1 ⎞⎛ 1 ⎞⎛ 1 ⎞ ⎜ 4 ⎟ ⎜ 6 ⎟ ⎜ 8 ⎟ ⎜ 10 ⎟ ⎜ 12 ⎟ ⎜ 20 ⎟ ⎝ ⎠⎝ ⎠⎝ ⎠⎝ ⎠⎝ ⎠⎝ ⎠ 1 460800 probability of rolling a "3" on all six dice simplify answer Thus, there is only a 1 in 460,800 chance that we will roll a "3" on all six dice when we roll them all together. Sequences and Probability Independent Events – 393 394 – Independent Events Sequences and Probability Advanced To find the probability of the union of two events. mutually exclusive – sets with no common elements. Daniel Bernoulli (1700-1782) was a Swiss mathematician and physician. He is most famous for his theories on hydrodynamics. Thanks to Daniel Bernoulli, we understand how an airplane is able to fly. He also made significant contributions to the field of probability. Among his hypotheses is the supposition that as a person's material fortune increases geometrically, his or her moral fortune increase arithmetically. Thus Bernoulli drew a connection between economics and ethics, describing both in mathematical terms. Sequences and Probability → Probability → Union of Two Events In the previous thought, we looked at the probability of a single event occurring, independently of other events. Now we want to look at what happens to probability if we have two or more events occurring. Two events are called mutually exclusive if the events A and B have no outcomes in common. Oftentimes we express this notion in the terminology of sets. That is, if A and B are mutually exclusive then P(A = B) 0 probability of the intersection of sets A and B All this means is that A and B have nothing in common, so the probability that we will find an event that occurs in both the sets A and B is zero. For example, if we roll 2 six-sided dice (2d6), then the probability of rolling a 5 has nothing whatsoever to do with the probability of rolling a 10. They are mutually exclusive events. That is, there is no way possible that we can roll a 5 AND a 10. However, there is a distinct possibility of rolling a 5 OR a 10. In that case, to find the probability, all we need to do is add the probability of A to the probability of B. Let's see what the possibilities of rolling any number between 2 and 12 on 2d6 are again: die 1 2 3 4 5 6 1 2 3 4 5 6 7 2 3 4 5 6 7 8 3 4 5 6 7 8 9 4 5 6 7 8 9 10 5 6 7 8 9 10 11 6 7 8 9 10 11 12 From the chart, we can see that the probability of rolling a 5 on 2d6 is 4 out of 36 possibilities. The probability of rolling a 10 is 3 out of 36 possibilities. Therefore, the probability of rolling a 5 or a 10 is simply: P(5 or 10) = = 4 36 7 36 + 3 36 probability of mutually exclusive events simplify answer Simply adding probabilities is only good for when we have mutually exclusive events. When events are not mutually exclusive, we have to tack on another term into our little sum. For instance, if we want to find the probability of drawing an ace or a diamond from a deck of cards, we have to take into account that 1 of the cards is both an ace and a diamond. Therefore, the total probability is determined by subtracting the number of Sequences and Probability Union of Two Events – 395 elements in the larger set that fit both criteria. This keeps us from counting an element twice when we are determining the probability of an event. In general, we can summarize the probability of the union of two events in the following theorem: Probability of the Union of Two Events If A and B are events in the same sample space, the probability of A or B occurring is given by P(A U B) = P(A) + P(B) – P(A B) probability of A or B If A and B are mutually exclusive events, then P(A U B) = P(A) + P(B) probability of mutually exclusive events One card is selected from a standard 52-card deck. What is the probability that it is a number card or a club? A number card includes all those with a digit between 2 and 10. solution The deck has 13 clubs, so the probability of selecting a club (event A) is P(A) = 13/52. The deck also contains 36 number cards (9 of each suit), so the probability of selecting a number card (event B) is P(B) = 36/52. We also have to take into account that some of the number cards are also clubs (9 of them, to be exact), thus P(club and number card) = 9/52. To find the net probability of drawing a club or a number card, we have to subtract the redundant event of drawing a club and a number card. In other words, these events are not mutually exclusive, so we have to use the first formula given in the theorem above: P(A U B) = P(A) + P(B) – P(A B) probability of non-mutually exclusive events P(club or number card) = P(club) + P(number card) – P(club and number card) events for this situation P(club or number card) = = = 13 36 9 + − 52 52 52 substitute in our probabilities for each event 40 52 combine like fractions 10 13 reduce fraction to lowest terms answer The probability of drawing a club or a number card from standard 52-card deck is 10 out of 13. 396 – Union of Two Events Sequences and Probability Advanced To find the probability of the complement of a given event. complement of an event – the collection of outcomes in the sample space that are not part of a given event A. If a given event is denoted by A, then its complement is denoted by A'. With few exceptions, the use of probability in quality control in manufacturing is fairly new, dating back to 1920. One exception involves the accuracy of guns and the wear of gun barrels in late 19th-century France. Another exception is the use of a statistical model for small samples to maintain the quality of the fermentation process at the Guinness Brewery in Dublin, Ireland. A man named W. S. Gosset (a.k.a. "Student") developed this quality control process in the early 20th century. Sequences and Probability → Probability → Union of Two Events → Complementary Events In the previous thought we discussed the basics of probability. Recall that an event is defined as the subset of the sample space, containing none, some, or all of the elements within that sample space. But what do we call the elements of the sample space that are not a part of the subset defining the event? Do they have any meaning? As it turns out, sometimes it is more useful to know the probability of finding an outcome in the sample space that is not part of a given event. We call the collections of outcomes in the sample space that are not part of event A the complement of the event A. We denote the complement as A'. Suppose we roll a six-sided die (1d6). The sample space for this experiment is S = {1, 2, 3, 4, 5, 6}. One event we might want to look at is the possibility of rolling an even number. Then we would denote this as A = {2, 4, 6}. The complement of this event would then be all the odd numbers in the sample space: A' = {1, 3, 5}. The probability of rolling an even number is P(A) = 1/2. Similarly, the probability of rolling an odd number is P(A') = 1/2. The probability of rolling an even or an odd number has to be 1, because we MUST do one or the other. Therefore, P(A or B) = 1 1 = = 2 probability of rolling a number from 1 to 6 + 1 2 P(A) + P(A') sum of P(even) and P(odd) substitute in P(A) and P(A') If we rearrange this a little bit, we can see: P(A) + P(A') = 1 sum of probability of event and its complement is ALWAYS equal to 1 P(A') = 1 – P(A) solve for P(A') Formally, we can define the probability of a complement as follows: Sequences and Probability Complementary Events – 397 Probability of a Complement Let A be an event and let A' be its complement. If the probability of A is P(A), then the probability of its complement is: = P(A') 1 – P(A) probability of a complement On some occasions, it is actually quicker to find the probability of an event by first finding its complement and then subtracting the complement from 1. A manufacturer determines that one of its products has a failure rate of about 1 in 1500. That means that the probability the product fails is 1/1500. A retailer orders a shipment of 75 units of the product. What is the probability that at least one of the units is defective? solution One way to solve this problem is find the probability of having 1 defective unit, then find the probability of having 2 units defective, and so on for all 75 units in the order. This is the hard way. The easy way is to find the probability that all units are perfect. Then we subtract this value from 1. The probability that any given unit is perfect is 1499/1500. Since each unit’s perfection is independent of every other unit’s perfection, they are independent events and the probability that all 75 units are perfect is given by: P(A) = ⎛ 1499 ⎞ ⎜ ⎟ ⎝ 1500 ⎠ ≈ 0.951 75 probability of independent events decimal approximation Now we subtract this number from 1 to find the probability that at least 1 unit is defective: P(A') = 1 – P(A) probability of a complement ≈ 1 – 0.951 substitute in P(A) = 0.951 ≈ 0.049 simplify answer Thus, there is an approximately 4.9% probability that at least 1 unit of the shipment is defective. 398 – Complementary Events Sequences and Probability Advanced To define a sequence of numbers. term (sequence) - an element in the range of a sequence. finite sequence - a sequence whose domain consists of the first n integers only. recursive (sequence) - each term of a sequence depends on the value of the previous term of the sequence. The Italian mathematician Fibonacci was one of the leading mathematicians of the Middle Ages. He made significant contributions to the fields of algebra, arithmetic, and geometry, but is most famous for his sequence of numbers (1, 1, 2, 3, 5, 8, 13, ...). The Fibonacci sequence is actually the solution to an obscure problem that appeared in his book Liber Abaci. Interestingly, Fibonacci numbers (i.e. numbers that are members of his sequence) appear regularly in nature, particularly in flowers such as trillium, wildrose, bloodroot, cosmos, columbine, lily blossoms, and iris, each of which has a number of petals equal to a Fibonacci number. Sequences and Probability → Sequences Information is often arranged in some sort of meaningful order. In fact, random information is often classified as "noise" because our minds are not able to process it. Just about everything we encounter in the real world is a result of an attempt to form order from chaos. Usually, when we arrange pieces of information, we place them in a sequence, so that there is a first member, second member, third member, and so on. In mathematics, a sequence is a very special type of function (and yes, it is a function in every sense of the word). The domain of a sequence consists of positive integers (1, 2, 3, 4, etc.). Sequence notation is a little different than our usual function notation f(x). Instead, we denote members of a sequence using subscripts, as shown below: Definition of a Sequence An infinite sequence is a function whose domain is the set of positive integers. The function values a1, a2, a3, … , an, … terms of a sequence are the terms of the sequence. If the domain consists of the first n positive integers only, the sequence is a finite sequence. Note: Sometimes it is useful to begin the subscripts of a sequence with 0 rather than 1. In that case, the terms of the sequence become: a0, a1, a2, a3, … Let's look at a couple of examples of sequences to show a little bit what they are like: Sequences and Probability Sequences – 399 Find the first four terms for each sequence given below: a.) an = 3n + 4 b.) an = 5 + (–1)n c.) an = ( −1) n 2n − 1 solution a.) The first four terms of the sequence given by an = 3n + 4 are: a1 = 3(1) + 4 = 7 first term a2 = 3(2) + 4 = 10 second term a3 = 3(3) + 4 = 13 third term a4 = 3(4) + 4 = 16 fourth term answer Notice that the difference between two consecutive terms is 3. This is an example of an arithmetic sequence. b.) The first four terms of the sequence given by an = 5 + (–1)n are: a1 = 5 + (–1)1 = 4 first term a2 = 5 + (–1)2 = 6 second term a3 = 5 + (–1)3 = 4 third term a4 = 5 + (–1)4 = 6 fourth term answer Notice that we are alternating between 4 and 6 in our list of terms. If we continue this sequence out to infinity, the pattern will continue to repeat itself. c.) The first four terms of this sequence are as follows: a1 = a2 = a3 = a4 = ( −1) 1 2( 1) − 1 ( −1) 2 2( 2) − 1 ( −1) = 3 2( 3) − 1 ( −1) = = 4 2( 4) − 1 = –1 1 second term 3 − 1 7 first term 1 5 third term fourth term answer Just looking at the first few terms of a sequence is not enough to fully determine the pattern of a sequence. In order to fully define a unique sequence, the nth term must be given! Consider the following two sequences, both of which have the same first three terms: 400 – Sequences Sequences and Probability 1 1 1 1 1 , , , , ...., , .... n 2 4 8 16 2 and 6 1 1 1 1 , , , , ...., 2 2 4 8 15 ( n + 1) n − n + 6 ( ) After the third term, these two sequences start to look very different from each other. If we are just given the first few terms, all we can do is find an apparent nth sequence. If we had just been given the first three terms of the first sequence above, we would probably have agreed that the apparent nth term was 1/(2n). We would never have considered that the second nth term would be a possibility as well. Write an expression for the apparent nth term (an) of the given sequence. a.) 3, 7, 11, 15, 19, … b.) 1+ 1 1 ,1 + 1 2 1 ,1 + 3 ,1 + 1 4 ,1 + 1 5 , .... solution a.) n: 1 2 3 4 5 ...n Terms: 3 7 11 15 19 . . . an Apparent pattern: Each term is one less than 4 times n, which implies that: = 4n – 1 answer n 1 2 Terms: 1+ an b.) 1 1+ 1 3 1 2 1+ 4 1 3 1+ 5 1 4 1+ ...n 1 5 . . . an Apparent pattern: Each term is the sum of 1 and 1/n, which implies that: an = 1+ 1 n answer Finally, some sequences are defined recursively. That is, each term of the sequence depends on the previous term of the sequence. In order to find the terms of the sequence, you need the first few terms of the sequence as well as the general pattern for the recursion. One of the most famous examples of a recursive sequence is the Fibonacci Sequence: a0 = 1, a1 = 1, ak = ak – 2 + ak – 1 Sequences and Probability where k ≥ 2 Sequences – 401 The first six terms of the Fibonacci Sequence are: Note: a0 = 1 0th term a1 = 1 first term a2 = a0 + a1 = 1+1 = 2 second term a3 = a1 + a2 = 1+2 = 3 third term a4 = a2 + a3 = 2+3 = 5 fourth term a5 = a3 + a4 = 3+5 = 8 fifth term This is an example of a sequence that starts with a0 instead of a1. 402 – Sequences Sequences and Probability Advanced To learn about arithmetic sequences and nth partial sums. arithmetic sequence – a sequence whose consecutive terms all have a common difference separating them. common difference – the difference between any two consecutive terms in an arithmetic sequence is a constant, represented by d. nth partial sum – the sum of the first n terms of an infinite sequence. Archytus of Tarentum (c.428-c.350 B.C.) is credited as being the inventor of the arithmetic mean. He also invented the simple pulley and the screw. Besides being a well-known mathematician, he was also a brilliant military strategist and statesman. Supposedly, troops under his command never lost a battle. He served as general of the citizen army of Tarentum for seven years, despite the fact that local law prohibited anyone from holding the post for more than one year. Sequences and Probability → Sequences → Arithmetic Sequence In the previous thought, we introduced you to the basic concept of a sequence. There are an infinite number of sequences out there. Some of them can be classified into two different types: arithmetic or geometric. Simply put, an arithmetic sequence is one whose terms all have a common difference separating them. That is, if we take any two terms and subtract them from each other, we will obtain the same number, regardless of which terms we use. For example, the sequence 2, 4, 6, 8, 10, . . . is arithmetic because the difference between any two consecutive terms is 2. More formally, we can define an arithmetic sequence as: Definition of an Arithmetic Sequence A sequence is arithmetic if the differences between consecutive terms are the same. Thus, the sequence a1, a2, a3, . . . , an, . . . is arithmetic if there is a number d such that a2 – a1 = d and a3 – a2 = . . . = d and d and so on an + 1 – an The number d is the common difference of the arithmetic sequence. Now, the definition by itself is not all that helpful. If we see a sequence, we can use the above definition to help us determine if it is arithmetic or not (just find the difference of consecutive terms; if it is always the same, then Sequences and Probability Arithmetic Sequence – 403 the sequence is arithmetic). Once we know that a particular sequence is arithmetic, we usually want to find the nth term of that sequence. To do that, we need the following theorem. The nth Term of an Arithmetic Sequence The nth term of an arithmetic sequence has the form an = dn + c nth term of arithmetic sequence where d is the common difference between consecutive terms and c = a1 – d. Find a formula for the nth term of the arithmetic sequence whose common difference is 5 and whose first term is 3. solution We are told the sequence is arithmetic. Furthermore, we know that the common difference is 5 and the first term is 3. This implies that the second term is 3 + 5 = 8, the third term is 8 + 5 = 13, the fourth term is 13 + 5 = 18, and so on. The nth term must be in the form an = dn + c, where d in this case happens to be 5. Because a1 = 3, it follows that: c = a1 – d formula to find c = 3–5 substitute in a1 = 3 and d = 5 = –2 simplify Thus, the formula for the nth term is: an = dn + c nth term of arithmetic sequence an = 5n – 2 substitute in d = 5 and c = –2 answer Once we have determined the nth term of an arithmetic sequence, it is extremely easy to find any other term of the sequence, provided that we know the previous term. If the nth term is given by an = dn + c nth term of arithmetic sequence then the (n + 1)th term is given by an + 1 = an + d (n + 1)th term of arithmetic sequence This is known as a recursive formula because the (n + 1)th term is dependent on the nth term. Similarly, the (n + 2)th term is dependent on the (n + 1)th term, and so on. Finally, there is a very simple formula to find the sum of a finite arithmetic sequence. All we need to know is n, the number of terms, as well as the first and last terms of the finite arithmetic sequence. Then we can find the sum of the sequence using the following formula: 404 – Arithmetic Sequence Sequences and Probability The Sum of a Finite Arithmetic Sequence The sum of a finite arithmetic sequence with n terms is given by = Sn Note: sum of a finite arithmetic sequence This formula only works with arithmetic sequences. We use a different formula to find the sum of a finite geometric sequence. Other types of finite sequences also have different formulas to find the sum. The sum of the first n terms of an infinite sequence is called the nth partial sum. Find the sum of all integers from 1 to 150. solution The integers from 1 to 150 form an arithmetic sequence that has 150 terms. Thus, we can use the formula given above to find the sum of these 150 terms: = S150 = = 1 + 2 + 3 + 4 + . . . + 149 + 150 n 2 (a1 + an ) 150 2 sum of integers from 1 to 150 sum of a finite arithmetic sequence ( 1 + 150) substitute in n = 150, a1 = 1, and a150 = 150 = 75(151) simplify = 11325 simplify answer Find the sum of the first 32 terms of the arithmetic sequence 5, 18, 31, 44, 57, 70, . . . solution First, we need to know what the 32nd term is for this sequence. That means we need the nth term, which takes the form an = dn + c. In this case, taking the difference of consecutive terms reveals that the common difference d = 13. We are given that a1 = 5, which means that c = a1 – d = 5 – 13 = –8. Therefore, the nth term of the sequence is: an = 13n – 8 nth term of the sequence When we substitute in n = 32, we find that the 32nd term is: a32 = 13(32) – 8 substitute in n = 32 = 408 32nd term of our arithmetic sequence Sequences and Probability Arithmetic Sequence – 405 Now that we know the 32nd term (408) and the first term (5), we can go ahead and apply the formula for finding the sum of a finite arithmetic sequence: Sn = S32 = n 2 (a1 + an ) 32 2 ( 5 + 408) sum of a finite arithmetic sequence substitute in n = 32, a1 = 5 and a32 = 408 = 16(413) simplify = 6608 simplify answer 406 – Arithmetic Sequence Sequences and Probability Advanced To learn about the definition of geometric sequences and how to calculate the sum of a geometric sequence. geometric sequence – a sequence with a constant ratio between successive terms. common ratio – a ratio which is the same between all consecutive terms in a geometric sequence. nth partial sum – the sum of the first n terms of a sequence. Archytus of Tarentum (c.428-c.350 B.C.) developed the geometric mean, the arithmetic mean, and the harmonic mean. His most famous mathematical achievement was to develop an elegant solution to the problem of doubling a cube, i.e. enlarging a cube according to a given ratio. He solved it by inventing a new type of threedimensional curve through the intersection of a cylinder, a cone, and a torus (or doughnut-shape). Sequences and Probability → Sequences → Geometric Sequence In the thought Arithmetic Sequence, we defined an arithmetic sequence as one whose terms can be defined in terms of a common difference between consecutive terms. Now we will look at another major type of sequence: the geometric sequence, whose terms are defined in terms of a common ratio. That is, if we take any two consecutive terms and divide them into each other, we will obtain a constant ratio r. Here is an example of a geometric sequence: 2, 4, 8, 16, 32, 64, 128, 256, 512, 1024, 4096, . . . geometric sequence If we take any two consecutive terms of this sequence and divide one term by the other, we will obtain the common ratio of 2 (or 1/2, depending on which number gets divided into the other). More specifically, we can define a geometric sequence as follows: Definition of a Geometric Sequence A sequence is geometric if the ratios of consecutive terms are the same. = r and = r and r with r ≠ 0 . . . = The number r is called the common ratio of the sequence. All this definition can do for us is tell us if a sequence is geometric or not. To determine if a sequence is geometric, we take any two consecutive terms and find their quotient. If we obtain the same quotient for any two consecutive terms, then we have found the common ratio and the sequence is geometric. Sequences and Probability Geometric Sequence – 407 Usually, however, we also like to be able to find some formula for a sequence that will allow us to find the nth term of the geometric sequence. The nth Term of a Geometric Sequence The nth term of a geometric sequence has the form: a1rn – 1 = an nth term of a geometric sequence where r is the common ratio of consecutive terms of the sequence. Thus, every geometric sequence can be written in the form: a1, a1r, a1r2, a1r3, . . . , a1rn – 2, a1rn – 1, . . . Note: first n terms of a geometric sequence The subscript of the a-term and the superscript of r must add up to n. Thus, the 5th term of a geometric sequence can be written in the following ways: a1r4 = a2r3 a3r2 = = a4r = a5r0 = a5 equivalent expressions for the 5th term of a geometric sequence This is another example of recursive thinking, where the next term of the sequence is dependent on the preceding term of the sequence. Determine whether the sequence is geometric. If it is, find the common ratio and find the 9th term of the sequence. 5, 1, 0.2, 0.04, . . . solution To determine if the sequence is geometric, we divide consecutive terms and see if we obtain a constant number each time: a2 a1 a3 a2 a4 a3 = = = 1 5 0.2 1 0.04 0.2 = 0.2 divide second term by first term = 0.2 divide third term by second term = 0.2 divide fourth term by third term Since we obtain the same number each time we divide consecutive terms in this sequence, we conclude that it is geometric. Furthermore, the common ratio is r = 0.2 = 1/5. Once we have the common ratio, we can find the nth term of this geometric sequence, since we also know the first term is 5. This means that the nth term of the sequence has the form: an = 5(0.2)n – 1 nth term of a geometric sequence with a1 = 5 and r = 0.2 a25 = 5(0.2)9 – 1 substitute in n = 9 a25 = 5(0.2)8 simplify exponent = 0.0000128 simplify 408 – Geometric Sequence answer Sequences and Probability Sum of a Geometric Sequence Geometric sequences work quite differently than arithmetic sequences, so we need to take extra care that we are handling them correctly. For one thing, we can find a finite sum for a finite geometric sequence, but we can, in some cases, find a finite sum for an infinite geometric sequence. It is very important that we know if the geometric sequence we are working with is infinite or finite. The Sum of a FINITE Geometric Sequence The sum of the geometric sequence a1, a1r, a1r2, a1r3, a1r4, . . . , a1rn – 1 with common ratio r ≠ 1 is given by = Sn Note: sum of a finite geometric sequence This is called the nth partial sum of the geometric sequence. Find the sum. 9 ∑ 2 k− 1 =1 k solution Rather than write out each term, it is simpler to apply the formula just given. Here we are raising the number 2 to the power of k – 1, where k is the index of summation. Thus, the first term of the geometric sequence is: a1 = 21 – 1 = 1 first term of the geometric sequence Then our formula becomes: 9 ∑ k Note: k−1 2 = ⎛ 1 − rn ⎞ a 1⎜ ⎟ ⎝ 1−r ⎠ sum of a finite geometric sequence n is the upper limit of summation n=9 = 1⎜ ⎛ 1 − 29 ⎞ ⎟ ⎝ 1−2 ⎠ substitute in r = 2, n = 9 and a1 = 1 = 511 simplify using a calculator answer =1 We have to be careful with our indices of summation. If we had k = 0, then we have to adjust the formula for the nth partial sum. For instance, if our example had begun with k = 0, the sum would have been: 9 ∑ k k−1 2 = =0 Sequences and Probability 1 2 9 + ∑ k k−1 2 = 511.5 =1 Geometric Sequence – 409 The formula for the nth partial sum of a finite geometric sequence can, depending on the value of r, produce a formula for the sum of an infinite geometric sequence. More specifically, if | r | < 1, then it can be shown that rn becomes arbitrarily close to zero as n increases without bound. In calculus terms, we are talking about limits. If we have a common ratio r such that | r | < 1, then we can apply the following formula to find the sum of an infinite geometric sequence: The Sum of an INFINITE Geometric Sequence If | r | < 1, the infinite geometric sequence a1, a1r, a1r2, a1r3, a1r4, . . . , a1rn – 1, arn, . . . has the sum = S sum of an infinite geometric sequence Notice that this formula is completely independent of the number of terms we have in our geometric sequence. All we need to know is the first term, which is usually easy to figure out, and the common ratio, which again can be determined without too much difficulty. Find the sum. ∞ ∑ 4 ⎛⎜ =0 n 1⎞ n ⎟ ⎝ 4⎠ solution First of all notice that the index of summation starts at 0, not 1, so we have to make a slight adjustment to our summation notation: ∞ ∑ n =0 1 4 ⎛⎜ ⎞⎟ ⎝ 4⎠ ∞ n = 4+ ∑ n =1 4 ⎛⎜ 1⎞ ⎟ ⎝ 4⎠ n rewrite summation notation so that lower index starts at n = 1 Also notice that | r | = 1/4 < 1, which means that this infinite geometric sequence does, in fact, have a finite sum. Now we need to find a1, so we let n = 1: a1 = 4 ⎛⎜ = 1 1⎞ 1 ⎟ ⎝ 4⎠ 410 – Geometric Sequence let n = 1 simplify Sequences and Probability We are ready to apply the formula for the sum of an infinite geometric sequence now. We know that a1 = 1 and that r = 1/4. Don't forget the extra "4" in front of our summation notation: S = = 1 4+ 1− 4+ 4+ sum of this infinite geometric series 4 1 4 4 = 1 − 1 convert terms in denominator to LCD form 4 1 simplify denominator 3 4 = = S = 4+ 12 3 4 flip fraction in denominator upside down 3 + 16 3 Sequences and Probability 4 3 convert all terms to LCD form simplify answer Geometric Sequence – 411 412 – Geometric Sequence Sequences and Probability Advanced To learn a mathematical shortcut for adding many numbers together. summation notation – a symbol that indicates that we are taking the sum of a group of numbers. sigma notation – a Greek symbol indicating that we are taking the sum of a group of numbers, synonymous with summation notation. index of summation – variable indicating the position of a number in the summation. upper limit of summation – the largest value of the index of summation. lower limit of summation – the smallest value of the index of summation. convergent (sequence) – infinite sequence whose sum yields a finite number. series – the summation of the terms of a sequence (finite or infinite). infinite series – a series that contains an infinite number of terms in the summation. Aristarchus of Samos (c.320-c.230 B.C.), Greek astronomer and geometer, was among the first, if not the first, to propose the Earth traveled around the sun. He also suggested the Earth rotates on its own axis. Both ideas were ridiculed at the time because they contradicted visual evidence and Aristotelian thought. Aristarchus attempted to utilize trigonometric methods to find the distance between the Earth and the sun, but he lacked the right tools for the job. His estimate for the size of the sun is off by a factor of around 400. Sequences and Probability → Sequences → Summation Notation Whenever we have to add a lot of numbers together, especially the terms of a finite sequence, we use a special notation to indicate that we are taking the sum of a group of numbers. It is called summation notation or sigma notation because it involves the use of the Greek upper case letter sigma, written as . Definition of Summation Notation The sum of the first n terms of a sequence is represented by: = a1 + a2 + a3 + . . . + an summation notation where i is called the index of summation, n is the upper limit of summation, and 1 (or sometimes 0) is the lower limit of summation. Summation is read as: "The sum of the values of a sub i from i equals 1 to i equals n is…." While we use the letter i for the index of summation in the definition, we can use any letter we want for the index of summation. Other common letters for the index of summation include j and k. Also the lower limit of summation does not have to be 1. Many times it is more convenient to start at 0. We can use any positive integer we need for the lower limit of summation. Sequences and Probability Summation Notation – 413 Let's see how we can evaluate a given sum: Find the sum. 5 a.) ∑ ( 2⋅ i + 1) =1 i 4 b.) ∑ k =1 6 c.) ∑ j 10 =3 1 j solution a.) To find the sum, we need to add 5 terms together. The first term is (2(1) + 1), the second term is (2(2) + 1) and so on. 5 ∑ ( 2⋅ i + 1) = (2(1) + 1) + (2(2) + 1) + (2(3) + 1) + (2(4) + 1) + (2(5) + 1) =1 i add five terms together b.) = 3 + 5 + 7 + 9 + 11 simplify each term = 35 simplify answer Note that the argument of the summation is not dependent on the index of summation. This means that we have four terms, each of which is simply "10". 4 ∑ k c.) 10 = 10 + 10 + 10 + 10 add four tens together = 40 simplify answer =1 The argument in this case is dependent on the index of summation. Also note that the index starts at "3" instead of "1", so the first term of the summation is 1/3, the second term is 1/4, and so on: 6 ∑ j =3 1 j = = 414 – Summation Notation 1 3 + 19 20 1 4 + 1 5 + 1 6 add four terms together simplify answer Sequences and Probability Summations have some important properties that we can take advantage of. For example, if we have the sum of two summations with identical indices of summation and identical upper limits of summation, then we can combine them into a single summation expression. Properties of Summation 1.) = 2.) = 3.) = c is any constant Proving any of these properties is a matter of applying the Associative Property of Addition, the Commutative Property of Addition, and the Distributive Property of Multiplication Over Addition. If we take a look at the first property, we start with expanding the summation on the left so that we have the sum of all its terms. Then we factor out the "c" since it is common to all the terms. Finally, we can reduce the terms inside parentheses into summation notation. n ∑ i ca i = ca1 + ca2 + ca3 + . . . + can expand summation = c(a1 + a2 + a3 + . . . + an) factor out the "c" from each term = ⎛ n ⎞ c⎜ ai ⎟ ⎜ ⎟ ⎝i = 1 ⎠ collapse remaining terms into a summation proof =1 ∑ Sometimes two sums can have two different summation notations, depending on their lower and upper indices of notation. Consider the following three sums: 4 ∑ 3(2 ) i i 4 3⋅ = =1 ∑ i i = 3⋅ 5 i ∑ i ∑ ( i−1) =2 3(21 + 22 + 23 + 24) = 90 3 =0 32 = =1 3 ∑ 3(2i+1) i 2 i+ 1 2 = 3(21 + 22 + 23 + 24) = 90 = 3(21 + 22 + 23 + 24) = 90 =0 5 = 3⋅ ∑ i i− 1 2 =2 Even though each summation has different lower and upper limits, the summation still produces the exact same result in all three cases. So far we have been discussing the sums of finite sequences. The sum of a finite sequence must be, by definition (and common sense), a finite number. One of the more amazing discoveries in mathematics is that for some infinite sequences, the sum of an infinite number of terms is also a finite number. Infinite sequences Sequences and Probability Summation Notation – 415 whose sums yield a finite number are said to converge to that number. In the case of the sum of an infinite sequence, the upper index of summation is always written as (infinity). For example, we can use calculus to demonstrate that the sum of all the terms of an infinite sequence whose nth term is 1 / 2n (with n beginning at 0) is: ∞ ∑ i Note: =0 1 n = 1+ = 2 2 1 2 + 1 4 + 1 8 + 1 16 + .... sum of a convergent infinite sequence a finite number The summation of the terms of a sequence (finite or infinite) is called a series. The summation ∞ ∑ i ai = a1 + a2 + a3 + a4 + . . . infinite series =1 is called an infinite series. Infinite series have important uses in calculus. You will study infinite sequences and series in excruciating detail when you take calculus. 416 – Summation Notation Sequences and Probability Appendix Appendix – 417 418 – Appendix Advanced Every single letter of the Roman alphabet (the one used in English) has a use in algebra. Here is a list of some of the most common uses of letters in algebra: Note: These are the most common conventional meanings for these letters. A test question may want you to solve a problem that has e(b) = t(q) + 2 just to confuse you. It is important to understand the context in which each letter appears. The letter "m", for example, only means slope in the context of lines. If the problem wants you to find out how much an object weighs, then m most likely indicates mass. Any and all of these letters can be used to indicate either a constant term or a variable. a b c d These four letters are often used to represent constant terms, such as a = 1, b = 2, etc… d is also used in many problems to represent distance e Usually reserved for the special number e ≈ 2.7182818285 Used to define the natural logarithmic function f Most often used in function notation y = f(x) g If f is already being used for a function, then we use g to represent a second function h Used in function notation if f and g are already taken Also denotes height in many types of problems Used in conic sections to describe the x-component of translation of the conic i Reserved to represent the complex number √(–1) Also used in vector notation to describe the x-component of a vector Used with matrices to indicate a specific row of a matrix, as in [aij] j Used in electronics to represent the complex number √(–1) Used in vector notation to describe the y-component of a vector Sometimes used to represent a function if f, g, and h are already being used Used with matrices to indicate a specific column of a matrix, as in [aij] k Used in conic sections to describe the y-component of translation of the conic Used in vector notation to describe the z-component of the vector l Most often used to describe the length dimension, as in a rod of length l (or L) In general, l is avoided because it looks too similar to the numeral one (1 or l, which is which?) m In science, used to describe the mass dimension, as in a rock of mass m Also the slope of a line Used with matrices to indicate the number of rows of matrix A n Used as a constant term along with a, b, c, and d Often used as a subscript, such as xn, or a superscript, such as xn, where n is a positive integer (or 0) Used with matrices to indicate the number of columns of matrix A o Generally avoided as it creates too much confusion with 0 O (capitalized) is often used to mark the origin of a graph p P (capitalized) is used to mark a point on a graph p (lower case) is used to represent the numerator of a fraction: p / q In a parabola, p is the distance between the vertex and the focus, as well as the distance between the vertex and the directrix Appendix A Algebra Alphabet – 419 q Q (capitalized) is used to mark another point on a graph if P is already being used q (lower case) is used to represent the denominator of a fraction: p / q r Often stands for the radius of a circle or other round object (cylinder, sphere, etc…) s Used to denote arc length in the formula s = r (arc length of a sector of a circle), where is measured in radians t Most often used to denote time Used in parametric equations as the parameter of the equation u Used as a generic variable, much like x v Used as a second generic variable if u is already taken, sort of like y w Used as a third generic variable if u and v are already taken, sort of like z Also denotes width x The granddaddy of generic variables, often called the independent variable, can be used to represent any quantity Also denotes the horizontal distance and direction from the origin on a graph y The second most popular generic variable, usually dependent on x: y = f(x) Also denotes the vertical distance and direction from the origin on a graph z If x and y are already in use, then z is a third variable that is usually dependent on both x and y On a graph, the z-axis is perpendicular to both the x- and y-axes In general, we use capital letters (A, B, C, etc) in geometry to denote points or vertices of polygons. We use them in conjunction with matrices to represent a matrix A, B, C, etc. Elements within the matrix are denoted with lower case letters (a11, b12, etc.). The matrix X is used to indicate a matrix whose elements are all unknowns and represents the solution to a system of linear equations. 420 – Algebra Alphabet Appendix A Glossary Algebra Glossary (Advanced) – 421 422 – Algebra Glossary (Advanced) Advanced –A– absolute value – the absolute value of a number is its distance from 0 on a number line. additive identity (matrices) – a matrix O such that A + O = A, provided the matrices A and O have the same dimensions. additive inverse – the opposite of a number. If the number a is negative, then the additive inverse of a is a positive number. adjoining (matrices) – creating a larger matrix out of two or more smaller matrices. algebraic expression – a combining of numbers, variables and operations in a way that stands for a number. arithmetic combination – the sum, difference, product, or quotient of two or more functions. arithmetic sequence – a sequence whose consecutive terms all have a common difference separating them. asymptote – a line approached by the graph of a function such that the graph never touches or crosses the line. asymptotes (hyperbola) – the two lines that are approached by the points on the branches of a hyperbola as the points get farther from the foci. The hyperbola never touches the asymptotes. augmented matrix – a matrix derived from a system of linear equations, where the last column of the matrix represents the constant terms of the system. axis of symmetry – the line passing through both the focus and the vertex. –B– base – the repeated factor in a power. In a2, a is the base. bell-shaped curve – the graph of a normal distribution. binomial series – a binomial expansion which contains infinitely many terms. bounded – any interval which has a finite upper and lower boundary. –C– center (circle) – the point in the plane from which all other points on the circle are defined. circle – a plane curve consisting of all points at a given distance from a fixed point in the plane. coefficient – the numerical factor of a monomial expression. coefficient matrix – the matrix derived from the coefficients of the variables in a linear system. It does not include any of the constant terms of the system. columns – vertical lines of numbers or variables in a matrix. common difference – the difference between any two consecutive terms in an arithmetic sequence is a constant, represented by d. common logarithmic function – logarithmic function that uses base 10. common ratio – a ratio which is the same between all consecutive terms in a geometric sequence. complement of an event – the collection of outcomes in the sample space that are not part of a given event A. If a given event is denoted by A, then its complement is denoted by A'. Algebra Glossary (Advanced) – 423 complex conjugate – the complex conjugate of a + bi is a – bi and vice versa. The product of complex conjugates is a real number. complex number – any number, real or imaginary, of the form a + bi, where a and b are real numbers and i = √(–1). composition of functions – the function that results from first applying one function, then another; denoted by the symbol o. conditional equation – an equation that is true for only a few members in the domain. conic section algebraic – plane curves represented by second-degree polynomial equations. geometric – the planar shapes that result from intersecting a cone with a plane at various angles. conjugate axis – a line perpendicular to the transverse axis and passing through the center of the hyperbola. consistent – a system of linear equations that has at least one solution. constant – a numerical symbol that never changes its values. constant function – a function in which f(x) is simply equal to a real (or complex) number. constant term – term in an expression that does not contain any variables. convergent (sequence) – infinite sequence whose sum yields a finite number. cube root – a cube root of a number c is the solution of the equation x3 = c. –D– denominator – lies below the line of division in a fraction. This is the divisor. If the numerator equals 1, then the denominator determines in how many parts the unit is to be divided. dependent variable – a variable whose value(s) always depend on the values of other variable(s). determinant – associated with each square matrix is a real number called the determinant of the matrix. The number of elements in any row or column of the square matrix is called the order of the determinant. diagonal matrix – matrix that has nonzero elements on the main diagonal and zero elements everywhere else. difference of focal radii – the absolute value of the difference of the distances from a point on a hyperbola to the two foci of the hyperbola. directrix – a line associated with a parabola such that the distance from it to any point on the parabola is equal to the distance from that point to the focus. distance – a measure of how far apart two things are from each other. domain (of a function) – the set of values which are allowable substitutions for the independent variables. –E– eccentricity – a measure of the ovalness of an ellipse. eccentricity (hyperbola) – a measure of the flatness or roundedness of the branches of a hyperbola. elementary row operations – interchanging, multiplying, and adding rows of a matrix to find solutions for each of the variables in the linear system represented by the matrix. ellipse – the set of points P in a plane which satisfy PF1 + PF2 = d, where F1 and F2 (its foci) are any two fixed points and d (its focal constant) is a constant with d > F1F2. 424 – Algebra Glossary (Advanced) entry (matrix) – the object in a particular row and column of a matrix. equivalent equations – equations having the same solution set over a given domain. evaluate – to find the value of an expression for a given value. expanding by cofactors – calculating the determinant of a matrix by taking the sum of the entries in any row and multiplying by the cofactors associated with that row. exponent – in a power, the number of times the base occurs as a factor. In the expression 23, 3 is the exponent and tells us that 2 occurs as a factor three times. exponential function – a function with the independent variable as an exponent. A function with an equation of the form y = abx. extracting square roots – the process of solving a quadratic equation by isolating the x2 term on one side, constant terms on the other sides, and then finding the square root of both sides. –F– factorial – for a positive integer n, the product of all the positive integers less than or equal to n. finite sequence – a sequence whose domain consists of the first n integers only. focal radius – a line connecting a point and a focus. foci (hyperbola) – the two points from which the difference of distances to a point on the hyperbola is constant. focus (plural foci) – fixed points from which the sum of distance to a point on the conic is a constant. focus (parabola) – the point along with the directrix from which a point on a parabola is equidistant. function – an association of exactly one object from one set (the range) with each object from another set (the domain). A relationship in which different ordered pairs have different first coordinates. –G– geometric sequence – a sequence with a constant ratio between successive terms. greater than – one value is larger than a compared value. greater than or equal to – one value is larger than or the same as a compared value. –H– horizontal asymptote – a horizontal line approached by the graph of a function such that the graph never touches or crosses the horizontal line. –I– identity – an equation whose sides are equivalent expressions. identity matrix – a matrix that consists of 1's on its main diagonal and 0's elsewhere. imaginary number – a number that is the square root of a negative real number. inclusive or – in the statement a or b, the answer may be a, it may be b, and the answer could also be both a and b. inconsistent – a system of linear equations that has no solution. independent events – the occurrence of one event has no impact whatsoever on the occurrence of another event. Algebra Glossary (Advanced) – 425 independent variable – in a formula, a variable upon whose value other variables depend. index (radical) – a real number placed above and to the left of a radical symbol to indicate what root is sought. index of summation – variable indicating the position of a number in the summation. infinite series – a series that contains an infinite number of terms in the summation. integers – the set of whole numbers and their negative counterparts. interval – any subset of real numbers. irrational number – any number that cannot be expressed as a terminating or repeating decimal. inverse function – the functions f and g are inverses of each other if f(g(x)) = x in the domain of g and g(f(x)) = x for all x in the domain of f. inverse properties – one property that can be used to undo another property, and vice versa. invertible – a matrix with an inverse. –J– –K– –L– leading 1 – the first nonzero entry of a row that does not consist entirely of zeros. less than – one value is smaller than a compared value. less than or equal to – one value is smaller than or the same as a compared value. like terms – terms that have exactly the same variables to the same powers. linear equation – an equation in which each term is either a constant or a monomial of degree 1. linear function – a polynomial function of the first degree. logistics curve – a growth curve described by the equation below: a y − ( x− c) = 1 + be d lower limit of summation – the smallest value of the index of summation. lower triangular matrix – matrix containing nonzero elements on and below the main diagonal and zero elements above the main diagonal. –M– main diagonal entries – matrix entries that form a diagonal line from the top left corner to the lower right. major axis – the line segment joining the two foci of an ellipse and which has endpoints at the vertices of the ellipse. matrix – a rectangular array of elements used to facilitate the study of problems in which the relation between these elements is fundamental. 426 – Algebra Glossary (Advanced) minor axis – line segment perpendicular to the major axis, passing through the center of the ellipse. multiplicative inverse – the reciprocal of a number. multiplicity of a root r – for a root r of a polynomial equation P(x) = 0, the highest power of x – r that appears as a factor of P(x). mutually exclusive – sets with no common elements. –N– natural logarithm – logarithms using base e. natural numbers – all positive integers. negative real zeros – negative real x-values for which the function equals zero. nonsingular – a matrix with an inverse. This is synonymous with invertible. normal distribution – the continuous function that the binomial distribution approaches as n, the number of trials, increases without bound. nth partial sum – the sum of the first n terms of a sequence. numerator – signifies the number of parts of the denominator that is taken. In a fraction, the numerator lies above the line representing division. –O– one-to-one correspondence – every point on the number line corresponds to exactly one real number. origin – the point 0 on the real number line. –P– parabola – the set of all points in the plane of a line l and a point F not on l whose distance from F equals its distance from l. parallel lines – two lines are parallel if there is a plane in which they both lie and they do not cross at any point within the plane. partial fraction decomposition – the method by which we transform one rational function into the sum of two or more simpler rational functions. perpendicular lines – two straight lines in a plane, the intersection of which forms right angles. point-slope form – one form of a linear equation derived from knowing one point on the line and its slope. positive real zeros – positive real x-values for which the function equals zero. power – a product of equal factors. The repeated factor is the base. A positive exponent tells the number of times the base occurs as a factor. principle nth root of a – the nth root that has the same sign as a. –Q– Quadratic Formula – an important formula that can be used to solve any quadratic equation for the independent variable. quadratic function – a polynomial function of the second degree. Algebra Glossary (Advanced) – 427 –R– radical symbol – an operator symbol that indicates we are taking the nth root of a real number, variable, or algebraic expression. radicand – the quantity under a radical symbol. radius – the distance between the center and a point on the circle. range – the set of values of the function evaluated at points contained within the domain. rational function – a function defined by a simplified rational expression. rational number – any number that can be expressed as a terminating or repeating decimal. real number – a rational or an irrational number, which can be represented by a finite or infinite decimal. real number line – a line with numbers evenly spaced throughout the line. recursive (sequence) – each term of a sequence depends on the value of the previous term of the sequence. reduced row-echelon form – an augmented matrix that has 1's along the main diagonal of the coefficient matrix and zeros in every entry above and below the main diagonal of the coefficient matrix. repeated roots (or zeros) – a solution to a polynomial equation that occurs more than once. row-echelon form (linear system) – a linear system in a stair-step form with the leading coefficients of each line being 1. row-echelon form (matrices) – a matrix that has a stair-step pattern with leading coefficients of 1 along the main diagonal and zeros below the main diagonal. row-equivalent – two matrices are said to be row-equivalent when one matrix can be obtained from the other matrix by applying elementary row operations. rows – horizontal lines of numbers and variables in a matrix. –S– scalar – a real number. scalar multiplication – the product of a real number and a matrix. second-degree polynomial equation – a polynomial which takes the form ax2 + bx + c = 0, where a ≠ 0. semi-major axis – line segment that is half the length of the major axis. Its endpoints are the center of the ellipse and one of the endpoints. semi-minor axis – a line segment half the length of the minor axis, perpendicular to the semi-major axis. series – the summation of the terms of a sequence (finite or infinite). sigma notation – a Greek symbol (Σ) indicating that we are taking the sum of a group of numbers, synonymous with summation notation. sigmoidal curve – another name for a logistics growth curve. singular – a matrix without an inverse. slant asymptote – an asymptote to a rational function that occurs when the degree of the numerator is greater than the degree of the denominator. solution – a replacement of the variable(s) in an algebraic sentence that makes the sentence true. 428 – Algebra Glossary (Advanced) solve – to find the result by the use of certain given data, previously known facts or methods, and newly observed relations. square root – a square root of a number b is the solution of the equation x2 = b. summation notation – a symbol that indicates that we are taking the sum of a group of numbers. –T– term – a constant number of a variable or the product of numbers and variables. term (sequence) – an element in the range of a sequence. transverse axis – the line passing through both the vertices and the foci of a hyperbola. –U– unbounded – an interval that has infinity as one or both of its endpoints. upper limit of summation – the largest value of the index of summation. upper triangular matrix – matrix containing nonzero elements on and above the main diagonal and zero elements below the main diagonal. –V– variable – a symbol which can represent any one of a set of numbers or other objects (including other algebraic expressions). variable term – term in an expression that contains variables. variation in sign – two consecutive coefficients in a polynomial function having opposite signs. vertical asymptote – a vertical line approached by the graph of a function such that the graph never touches or crosses the vertical line. vertices (ellipse) – endpoints of the major axis of an ellipse. vertex – the intersection of a parabola with its axis of symmetry. –W– whole numbers – all positive integers and zero. –X– –Y– –Z– zero matrix – a matrix in which all the elements are zero. Algebra Glossary (Advanced) – 429 430 – Algebra Glossary (Advanced) Works Consulted Dolciani, Mary P., et al. Algebra and Trigonometry: Structure and Method, Book 2. Boston: Houghton Mifflin Company, 1988. Fair, Jan, and Sadie C. Bragg. Algebra 1. Englewood Cliffs, NJ: Prentice Hall, 1990. Ifrah, George. The Universal History of Numbers. New York: John Wiley & Sons, Inc., 1991. Larson, Roland E., Robert P. Hostetler, and Bruce H. Edwards. Calculus with Analytic Geometry, Fourth Edition. Lexington, MA: D.C. Heath and Company, 1990. Larson, Roland E., and Robert P. Hostetler. Algebra and Trigonometry, Fourth Edition. Boston: Houghton Mifflin Company, 1997. Larson, Ron, and Robert P. Hostetler. Algebra and Trigonometry, Fifth Edition. Boston: Houghton Mifflin Company, 2001. McConnell, John W., et al. The University of Chicago School Mathematics Project: Algebra, Second Edition. Glenview, IL: Scott, Foresman, Addison, Wesley, 1998. Pappas, Theoni. The Magic of Mathematics. San Carlos, CA: Wide World Publishing/Tetra, 1994. Senk, Sharon L, et al. The University of Chicago School Mathematics Project: Advanced Algebra, Second Edition. Glenview, IL: Scott, Foresman, Addison, Wesley, 1998. Stewart, James. Calculus, Fourth Edition. Pacific Grove, CA: Brooks/Cole Publishing Company, 1999. Young, Robyn V, ed. Notable Mathematicians: From Ancient Times to the Present. Detroit: Gale Research, 1998. Works Consulted – 431 432 – Works Consulted Index Index – 433 Index Page numbers in boldfaced type indicate a page containing a formula or definition. –A– absolute value distance between points properties of absolute value inequality solving an algebraic expression asymptote(s) horizontal of a rational function slant vertical 33 34 33 63 63 5 151 152 153 158 152 – D – (cont'd) two-point form of equation of line cofactors of minors of properties of of a square matrix Distance Formula distinguishable permutation domain of a function 367 342 342 347 – 349, 350 344 22 381 99, 103 –E– –B– basic rules of algebra negation, properties of binomial theorem 7, 8 8 383, 384 –C– Cartesian plane Distance Formula Midpoint Formula change-of-base formula circle translation of cofactors of determinants combination(s) completing the square complex conjugate(s) complex number(s) addition of conjugate of division of equality of multiplication of subtraction of composition of functions conic section(s) circle translation of eccentricity of an ellipse of a hyperbola ellipse eccentricity of translation of hyperbola asymptotes of eccentricity of translation of parabola translation of translation of conic sections, general counting principle Cramer's Rule –D– Descartes's Rule of Signs determinant(s) applications of area of a triangle Cramer's Rule test for collinear points 434 – Index 21 22 24 195 263, 264, 265 283 342 377 257, 258 146 143, 144 145 146 146 144 146 146 227 263, 264, 265 283 273 273 278 271, 272 273 289 275, 276 276, 294 278 293 267, 268 285 281, 282 375, 376 359 – 361, 362 133 341, 342 353 355, 356 359 – 361, 362 365, 366 eccentricity of an ellipse of a hyperbola elementary row operations ellipse eccentricity of translation of equality properties of equation(s) exponential of a line in general form in one variable in point-slope form in slope-intercept form in two-point form logarithmic quadratic completing the square extracting square roots factoring Quadratic Formula solutions of exponent(s) properties of scientific notation exponential decay model exponential equation exponential function(s) inverse properties of exponential growth model Extended Distributive Property extracting square roots 273 273 278 319 271, 272 273 289 9 203, 204 59 42 55 53, 54 56 203, 204 249 257, 258 255, 256 253 259, 260 260 11, 12 12 210 203, 204 187 203 209 114 255, 256 –F– factorial(s) Factor Theorem factoring a quadratic equation fraction(s) partial properties of function(s) combinations, arithmetic composition of domain of horizontal shift of implied domain of inverse finding horizontal line test polynomial 955 125 672 15 161, 164 15, 16 99 225, 226 227 99, 103 232 103 235, 236 239 243, 244 105 – F – (cont'd) addition of Descartes's Rule of Signs division of factoring patterns, special Intermediate Value Theorem multiplication of product patterns, special Rational Zero Test real zeros of subtraction of range of rational asymptote of graphing reflections in coordinate axes transcendental exponential inverse properties of logarithmic graphs of natural base e vertical line test vertical shift of Fundamental Theorem of Algebra linear factorization theorem –G– Gaussian elimination Gaussian model Gauss-Jordan elimination graphing rational functions 111 133 121, 122 117 131 113 115 137 129 111 99, 103 149 153 155 229, 230 185 187 203 191 192 – 193 199 – 201 101 231, 232 107 107 323 213 327 155 –H– horizontal horizontal horizontal horizontal asymptote line (in a plane) line test shift of a function –I– identity matrix independent events, probability of index of summation inequality (inequalities) absolute value solving an linear combined (double) double (combined) polynomial finding test intervals properties of rational Intermediate Value Theorem inverse function(s) finding horizontal line test 152 47, 59 243, 244 232 303 393 413 63 63 61 62 62 65 66 17, 18 69 131 235, 236 239 243, 244 –J– –K– –L– line(s) in the plane horizontal parallel perpendicular slope of vertical linear equation in one variable 45 47, 59 163 163 45 152 41 42 – L – (cont'd) generating equivalent equations fractional expressions linear inequality combined (double) double (combined) logarithm(s) change-of-base formula properties of logarithmic equation logarithmic graphs of logarithmic model logistics growth model lower limit of summation –M– matrix (matrices) addition of properties of elementary row operations Gaussian elimination Gauss-Jordan elimination identity inverse of finding matrix inverse 2 x 2 matrix properties of linear systems in multiplication of properties of properties of matrix operations reduced row-echelon form representation of row-echelon form scalar multiplication of properties of triangular matrix method of elimination Midpoint Formula minors of a determinant –N– natural base e 42 43 61 62 62 191 195 192, 197, 200 203, 204 191 192 – 193 215 219 413 299 305 313 319 323 327 303 329 333, 334 337 317 309 315 313, 315 321 301 321 307 313 351 83, 84 24 342 199 – 201 –O– –P– parallel lines partial fractions linear factors, distinct linear factors, repeated mixed factors quadratic factors, distinct quadratic factors, repeated solving basic equation Pascal's Triangle permutation(s) distinguishable perpendicular lines point-slope form of the equation of a line polynomial function(s) addition of Descartes's Rule of Signs division of factoring patterns, special Intermediate Value Theorem multiplication of product patterns, special Rational Zero Test real zeros of 51 161, 164 165 173 181 169 177 164 387 379, 380 381 51 55 105 111 133 121, 122 117 131 113 115 137 129 Index – 435 – P – (cont'd) subtraction of polynomial inequality finding test intervals probability of a complement of an event of independent events of the union of two events –Q– quadratic(s) quadratic equation(s) completing the square extracting square roots factoring Quadratic Formula solutions of Quadratic Formula –R– radical(s) nth root of a number principle nth root properties of rational function(s) asymptote of graphing rational inequality Rational Zero Test real number(s) ordering of bounded intervals unbounded intervals Remainder Theorem 111 65 66 389, 390 397, 398 390 393 395, 396 247 249 257, 255, 253 259, 260 259, 436 – Index transcendental function(s) exponential inverse properties of logarithmic graphs of natural base e triangular matrix two-point form of the equation of a line –U– upper limit of summation 258 256 260 260 –V– vertical asymptote vertical line 185 187 203 191 192 – 193 199 – 201 351 56 413 152 48, 59 –w– –x– 27 27 27 29 149 153 155 69 137 31 35 36 37 127 –S– sequence(s) arithmetic nth term of sum of geometric finite sum of infinite sum of nth term of slant asymptote slope of a line in the plane slope-intercept form of the equation of a line substitution method summation index of lower limit of notation properties of upper limit of synthetic division Factor Theorem Remainder Theorem systems graphical approach graphical interpretation of solutions of inequalities sketching the graph of linear equations method of elimination multivariable Gaussian elimination substitution method –T– 399 403 404 405 407 409 410 408 158 45 53, 54 73, 74 413 413 413 413 415 413 123 125 127 77 79, 80 93 94 71 83, 87 88 73, 74 –Y– –Z– zero properties of Zero Factor Property 19 19 19