try ai
Popular Science
Edit
Share
Feedback
  • General Equation of the Second Degree

General Equation of the Second Degree

SciencePediaSciencePedia
Key Takeaways
  • The general equation of the second degree, Ax2+Bxy+Cy2+Dx+Ey+F=0Ax^2 + Bxy + Cy^2 + Dx + Ey + F = 0Ax2+Bxy+Cy2+Dx+Ey+F=0, can describe every conic section—the circle, ellipse, parabola, and hyperbola.
  • The discriminant, Δ=B2−4AC\Delta = B^2 - 4ACΔ=B2−4AC, classifies the conic as an ellipse (Δ<0\Delta < 0Δ<0), hyperbola (Δ>0\Delta > 0Δ>0), or parabola (Δ=0\Delta = 0Δ=0).
  • Rotating the coordinate axes can eliminate the BxyBxyBxy term, simplifying the equation and revealing the conic's true orientation without altering its fundamental invariants.
  • The mathematical structure of the second-degree equation extends beyond geometry, appearing in physics to describe dynamical systems and in higher dimensions for quadric surfaces.

Introduction

From the graceful orbits of planets to the sharp trajectories of comets, the universe is filled with shapes known since antiquity as conic sections. While the ellipse, parabola, and hyperbola appear distinct, they share a common, powerful origin: a single algebraic formula known as the general equation of the second degree. This raises a fundamental question: how can one equation hold the blueprint for such a diverse family of curves? This article deciphers this algebraic code, revealing the elegant logic that connects the coefficients of the equation to the geometry of the curve it represents.

Our exploration will unfold in two main parts. First, in ​​Principles and Mechanisms​​, we will dissect the equation itself, learning how to classify conics using the discriminant, simplify them through rotation, and understand the deep truths revealed by invariants. Then, in ​​Applications and Interdisciplinary Connections​​, we will venture beyond pure geometry to witness how this same mathematical structure provides a powerful language for describing phenomena in physics, engineering, and beyond. We begin by examining the core principles that allow us to master this versatile equation.

Principles and Mechanisms

It is one of the most remarkable facts in mathematics that the elegant curves of the ancient Greeks—the circle, the ellipse, the parabola, and the hyperbola—all spring from a single, rather unassuming algebraic source. Every one of these "conic sections" can be described by the general equation of the second degree:

Ax2+Bxy+Cy2+Dx+Ey+F=0Ax^2 + Bxy + Cy^2 + Dx + Ey + F = 0Ax2+Bxy+Cy2+Dx+Ey+F=0

At first glance, this equation seems like a jumble of terms. How can this one formula contain both the perfect, finite symmetry of a circle and the wild, infinite arms of a hyperbola? The secret, as we will discover, lies not in the variables xxx and yyy, but in the coefficients—the numbers A,B,C,D,E,A, B, C, D, E,A,B,C,D,E, and FFF. They are the dials and knobs that control the geometry. By turning them, we can transform one shape into another, stretch it, shift it, and rotate it. Our journey is to understand the laws that govern this machine.

The Great Sorting Hat: The Discriminant

Imagine you are presented with a hundred different equations in the general form above. How would you begin to make sense of them? Your first and most powerful tool is a simple combination of three coefficients, a quantity known as the ​​discriminant​​, Δ=B2−4AC\Delta = B^2 - 4ACΔ=B2−4AC. This number acts like a grand sorting hat, immediately placing any given conic into one of three fundamental families, without any need to graph it.

The rules of the game are wonderfully simple:

  • If Δ<0\Delta \lt 0Δ<0, the equation describes an ​​ellipse​​ or a circle. These are the closed, bounded curves, forever looping back on themselves. Think of planetary orbits.

  • If Δ>0\Delta \gt 0Δ>0, the equation describes a ​​hyperbola​​. These are the unbound curves, with two distinct branches that flee to infinity. Think of the path of a comet slingshotting around a star.

  • If Δ=0\Delta = 0Δ=0, the equation describes a ​​parabola​​. This is the critical case, a delicate balance between the closed ellipse and the open hyperbola. Think of the path of a thrown ball under gravity.

Let's see this in action. An equation like (log⁡164)x2+(log⁡3(273))xy+(log⁡100.01)y2=5(\log_{16} 4)x^2 + (\log_3 (27\sqrt{3}))xy + (\log_{10} 0.01)y^2 = 5(log16​4)x2+(log3​(273​))xy+(log10​0.01)y2=5 looks intimidating. But by evaluating the coefficients to A=12A = \frac{1}{2}A=21​, B=72B = \frac{7}{2}B=27​, and C=−2C = -2C=−2, we find the discriminant is Δ=(72)2−4(12)(−2)=654\Delta = (\frac{7}{2})^2 - 4(\frac{1}{2})(-2) = \frac{65}{4}Δ=(27​)2−4(21​)(−2)=465​. Since this is greater than zero, we know instantly, without plotting a single point, that this curve is a hyperbola. Similarly, if the coefficients AAA and CCC are the eigenvalues of the matrix (3113)\begin{pmatrix} 3 & 1 \\ 1 & 3 \end{pmatrix}(31​13​) (which are 2 and 4) and BBB is the trace of (17−24)\begin{pmatrix} 1 & 7 \\ -2 & 4 \end{pmatrix}(1−2​74​) (which is 5), the discriminant is Δ=52−4(2)(4)=25−32=−7\Delta = 5^2 - 4(2)(4) = 25 - 32 = -7Δ=52−4(2)(4)=25−32=−7. It's negative, so the curve must be an ellipse.

This algebraic classification has a beautiful geometric counterpart. The conic sections, after all, get their name from being slices of a double cone. Imagine a plane cutting through a cone standing upright. If the plane is horizontal, you get a circle. Tilt it slightly, and you get an ellipse. Tilt it further until it's exactly parallel to the side of the cone, and you get a parabola. Tilt it even more, and it cuts through both halves of the double cone, creating a hyperbola.

A thought experiment reveals the connection: if we model a cone by x2+y2=z2x^2 + y^2 = z^2x2+y2=z2 and slice it with a plane z=mx+cz = mx + cz=mx+c, the projection of the intersection onto the xyxyxy-plane is a conic. The algebra shows that the discriminant of this projected curve is −4(1−m2)-4(1-m^2)−4(1−m2). The parabolic case, Δ=0\Delta=0Δ=0, happens precisely when 1−m2=01-m^2=01−m2=0, or ∣m∣=1|m|=1∣m∣=1. This means the slope of the plane exactly matches the slope of the cone's side! The algebraic condition B2−4AC=0B^2 - 4AC = 0B2−4AC=0 is the direct echo of the geometric condition for creating a parabola.

The Troublemaker and the Twist: Eliminating the xyxyxy-Term

The quadratic terms, Ax2Ax^2Ax2, BxyBxyBxy, and Cy2Cy^2Cy2, determine the fundamental shape. The linear terms, DxDxDx and EyEyEy, shift the shape around the plane, moving its center or vertex away from the origin. But the most interesting term is the BxyBxyBxy term. Its presence is a sign that the conic's natural axes of symmetry—its major and minor axes for an ellipse, for instance—are tilted with respect to our xxx and yyy axes.

To simplify the equation and understand the conic's true geometry, we can rotate our coordinate system to align with the conic's own axes. This is like straightening a crookedly hung picture frame. The goal of this rotation is to find a new coordinate system (x′,y′)(x', y')(x′,y′) where the troublesome cross-term vanishes. The angle of rotation, θ\thetaθ, that achieves this is given by a beautifully compact formula:

cot⁡(2θ)=A−CB\cot(2\theta) = \frac{A-C}{B}cot(2θ)=BA−C​

This formula tells us that the required rotation depends on the "imbalance" between the x2x^2x2 and y2y^2y2 coefficients, compared to the size of the xyxyxy coefficient. Consider the special case where the quadratic part is symmetric, meaning A=CA=CA=C. What happens then? The formula becomes cot⁡(2θ)=0\cot(2\theta) = 0cot(2θ)=0, which implies 2θ=90∘2\theta = 90^\circ2θ=90∘, so θ=45∘\theta = 45^\circθ=45∘. This means that any conic of the form Ax2+Bxy+Ay2+⋯=0Ax^2 + Bxy + Ay^2 + \dots = 0Ax2+Bxy+Ay2+⋯=0 (with B≠0B \neq 0B=0) is simply a standard conic tilted by exactly 45 degrees. The algebra reflects the symmetry perfectly. We can even work backwards; if we demand that a rotation of θ=π/3\theta = \pi/3θ=π/3 (60∘60^\circ60∘) eliminates the cross-term for a conic family, we can use the formula to find the precise relationship that must hold between its coefficients.

The Unchanging Truths: Invariants

When we rotate or shift our coordinate system, the individual coefficients A,B,C,…A, B, C, \dotsA,B,C,… all change. Their values are artifacts of our chosen viewpoint. It would be dizzying if everything changed, but fortunately, some fundamental quantities remain constant. These are the ​​invariants​​, properties that belong to the geometric object itself, not our description of it.

The first, as you might guess, is the discriminant. After any rotation, the new coefficients A′,B′,C′A', B', C'A′,B′,C′ will be different, but the quantity (B′)2−4A′C′(B')^2 - 4A'C'(B′)2−4A′C′ will be exactly the same as B2−4ACB^2 - 4ACB2−4AC. This is profoundly important. It means rotation can't turn a hyperbola into an ellipse. The essential nature of the curve is invariant.

A second, more subtle invariant under rotation is the sum of the quadratic coefficients, A+CA+CA+C. This simple sum has a direct geometric meaning. For a hyperbola, the condition A+C=0A+C = 0A+C=0 signals that it is a ​​rectangular hyperbola​​, one whose asymptotes are perpendicular. If you calculate A+CA+CA+C for an equation like 3x2+8xy−3y2=03x^2 + 8xy - 3y^2 = 03x2+8xy−3y2=0, you find it's zero. Even if you rotate the axes and get a completely different-looking equation, the new coefficients A′A'A′ and C′C'C′ will still sum to zero. The property of having perpendicular asymptotes is baked into the conic's very being.

Even the humble constant term FFF has its moment of invariance. While it changes under translation (shifting), it remains unchanged under a pure rotation about the origin. This makes intuitive sense: rotating a curve around the origin shouldn't affect its distance from the origin in any special way encoded by the constant.

To find an invariant that withstands both rotation and translation, we must assemble our coefficients into a more powerful structure: a 3×33 \times 33×3 matrix. M=(AB/2D/2B/2CE/2D/2E/2F)M = \begin{pmatrix} A & B/2 & D/2 \\ B/2 & C & E/2 \\ D/2 & E/2 & F \end{pmatrix}M=​AB/2D/2​B/2CE/2​D/2E/2F​​ The determinant of this matrix, det⁡(M)\det(M)det(M), is a powerful invariant under all rigid motions (rotations and translations). If you are given two different equations for the same conic in two different coordinate systems, the coefficients may look wildly different, but the determinant of their associated matrices will be identical.

On the Edge of Geometry: Degenerate Conics

What happens when the machinery of our equation produces something... less than a perfect conic? What if, instead of an ellipse, we get a single point? Or instead of a hyperbola, we get two intersecting lines? These are the ​​degenerate conics​​, and they occur precisely when our grand invariant, the determinant of the 3×33 \times 33×3 matrix, is zero. det⁡(M)=0\det(M) = 0det(M)=0 is the sign of geometric collapse.

The parabolic case, B2−4AC=0B^2-4AC=0B2−4AC=0, is particularly rich with degeneracy. This algebraic condition is equivalent to saying that the quadratic part of the equation, Ax2+Bxy+Cy2Ax^2+Bxy+Cy^2Ax2+Bxy+Cy2, can be factored into a perfect square of a linear expression, something like (αx+βy)2(\alpha x + \beta y)^2(αx+βy)2. This is a deep insight!

Consider the equation 4x2−4xy+y2−12x+6y+F=04x^2 - 4xy + y^2 - 12x + 6y + F = 04x2−4xy+y2−12x+6y+F=0. Here, B2−4AC=(−4)2−4(4)(1)=0B^2 - 4AC = (-4)^2 - 4(4)(1) = 0B2−4AC=(−4)2−4(4)(1)=0, so we expect a parabola. But notice that the quadratic part is (2x−y)2(2x-y)^2(2x−y)2, and the linear part is −6(2x−y)-6(2x-y)−6(2x−y). If we make a substitution t=2x−yt = 2x - yt=2x−y, the entire equation collapses into a simple quadratic: t2−6t+F=0t^2 - 6t + F = 0t2−6t+F=0 This equation doesn't describe a parabola at all! Its solutions are values for ttt, like t=t1t=t_1t=t1​ and t=t2t=t_2t=t2​. Geometrically, each solution represents a straight line, 2x−y=t12x-y = t_12x−y=t1​. If the quadratic has two distinct real roots (when 36−4F>036-4F > 036−4F>0), we get two parallel lines. If it has no real roots (when 36−4F<036-4F < 036−4F<0), there is no curve at all. And at the critical value F=9F=9F=9, the quadratic has exactly one repeated root, t=3t=3t=3. The two parallel lines have coalesced into a single line, 2x−y=32x-y=32x−y=3.

Thus, the general equation of the second degree holds more than just the classical conics. It contains their shadows, their degenerate forms, and the rules for their transformation. By understanding the roles of the coefficients and the deep truths of the invariants, we move from simply memorizing a formula to grasping the unified and beautiful structure that governs this entire family of shapes.

Applications and Interdisciplinary Connections

After a journey through the principles and mechanisms of the general second-degree equation, one might be tempted to put it away in a box labeled "Geometry of Conic Sections." That would be a tremendous mistake. To do so would be like studying the alphabet and never realizing it can be used to write poetry, history, and the laws of nature. The true beauty of this equation isn't just that it describes a family of elegant curves; it's that the very same mathematical structure appears, like a recurring theme in a grand symphony, across a vast landscape of science and engineering.

Let's begin our exploration back in the familiar world of geometry, but look at it with new eyes. You are given a complicated-looking equation, say Ax2+Bxy+Cy2+Dx+Ey+F=0Ax^2 + Bxy + Cy^2 + Dx + Ey + F = 0Ax2+Bxy+Cy2+Dx+Ey+F=0. What is it? A circle? A tilted ellipse? A hyperbola? The brute-force method of plotting points is tedious and unenlightening. The magic lies in recognizing that we don't have to work so hard. The coefficients themselves hold the answer. By simply performing algebraic manipulations like completing the square, we can rearrange the equation into a standard form that immediately reveals the shape's identity, its center, and its orientation.

But what about that troublesome BxyBxyBxy term, the one responsible for tilting the conic? Our instinct is to "straighten it out," to find the natural axes of the shape itself. We can do this by rotating our point of view—our coordinate system. This is more than just an algebraic trick; it's a profound physical idea. The laws of physics don't depend on how we orient our laboratory, and the intrinsic properties of a shape don't depend on how we draw our axes. When we perform the correct rotation, the BxyBxyBxy term vanishes, and the equation simplifies to a form like A′(x′)2+C′(y′)2=1A'(x')^2 + C'(y')^2 = 1A′(x′)2+C′(y′)2=1. The mathematics tells us exactly how to find this special angle of rotation. Remarkably, the coefficients of the old, complicated equation and the new, simple one are intimately related, revealing a deep, underlying structure.

This process of "straightening things out" is a cornerstone of physics and mathematics, and it has a much more powerful name: finding eigenvalues and eigenvectors. The quadratic part of our equation, Ax2+Bxy+Cy2Ax^2 + Bxy + Cy^2Ax2+Bxy+Cy2, can be represented by a symmetric matrix, (AB/2B/2C)\begin{pmatrix} A & B/2 \\ B/2 & C \end{pmatrix}(AB/2​B/2C​). The "natural axes" of the conic are nothing more than the eigenvectors of this matrix. The "stretch" along these axes is determined by the eigenvalues, λ1\lambda_1λ1​ and λ2\lambda_2λ2​. It’s a beautiful fact that for a real symmetric matrix, these eigenvalues are always real numbers, which guarantees that we can always find such a rotation in the real world. This isn't just a coincidence; it's a deep truth about the nature of space and linear transformations. The eigenvalues don't just give us the orientation; they encode fundamental geometric properties. For instance, the product of the radius of curvature at the ends of the major and minor axes of an ellipse is directly related to the product of the eigenvalues of its matrix. The algebra of matrices is speaking the language of geometry.

Once we have the equation, we can ask more sophisticated questions. What is the slope of the tangent at a given point? Calculus provides a direct answer through implicit differentiation, giving us a formula for the tangent line in terms of the coefficients and the point of contact. But the old masters of geometry, like Apollonius of Perga, discovered astonishing relationships without modern calculus. He described the concept of "conjugate diameters," a pair of lines through the center of an ellipse where each one bisects all chords parallel to the other. The condition for two diameters to be conjugate is a simple and elegant algebraic relationship between their slopes and the conic's coefficients. The truly magical moment comes when we combine these ideas: the tangent to an ellipse at a point PPP is perfectly parallel to the diameter that is conjugate to the diameter passing through PPP. This is a theorem of stunning elegance, weaving together tangents, diameters, and the algebraic heart of the conic section.

Now, let's step outside the world of static shapes and see where else this pattern emerges. Consider a physical system that changes in time—a mass bobbing on a spring, a pendulum swinging, or the voltage in an electrical circuit. The governing law is often a second-order differential equation: ay′′+by′+cy=0a y'' + b y' + c y = 0ay′′+by′+cy=0. How do we solve this? We look for solutions of the form y(t)=exp⁡(rt)y(t) = \exp(rt)y(t)=exp(rt), and upon substitution, we are left with a purely algebraic problem: the characteristic equation ar2+br+c=0ar^2 + br + c = 0ar2+br+c=0. Look familiar? It's our old friend, the quadratic equation. And its discriminant, b2−4acb^2 - 4acb2−4ac, which once told us the difference between an ellipse and a hyperbola, now tells us about the fate of our physical system. If the discriminant is negative, the roots are complex, and the system oscillates forever (like an ideal pendulum). If it's positive, the roots are real, and the system decays smoothly back to equilibrium (like a door closer). The geometry of shapes and the dynamics of motion are described by the very same equation!

The connection goes even deeper. Consider a more complex system, like the populations of predators and prey, or the trajectory of a particle in a force field. We can often describe its evolution with a system of two linear differential equations: dxdt=Ax\frac{d\mathbf{x}}{dt} = A\mathbf{x}dtdx​=Ax. The behavior of this system is visualized in a "phase portrait," a map of arrows showing which way the system will evolve from any starting point. Are there any special directions, lines along which the flow is always directed along the line itself? Yes. These are the invariant lines of the system, and to find their slopes, we must solve... a quadratic equation whose coefficients are determined by the matrix AAA. These invariant lines are, in fact, the eigenvectors of the matrix AAA, the same concept we used to straighten out our tilted ellipse. The geometry of conic sections finds a direct and powerful echo in the geometry of change.

Finally, why stop at two dimensions? Our world has three. The general second-degree equation can be extended to include zzz, giving us quadric surfaces: spheres, ellipsoids, paraboloids, and hyperboloids. These are the shapes of planets, cooling towers, and satellite dishes. The algebraic machinery scales up beautifully. A general quadric surface is represented by a 4x4 matrix, and its properties are hidden in that matrix. For instance, whether a surface is a simple ellipsoid or a "degenerate" case like a pair of intersecting planes is determined by whether the determinant of this matrix is zero. This single number acts as the ultimate arbiter of the surface's identity, a direct generalization of the 2D discriminant.

From the slope of a tangent on an ellipse to the oscillations of a circuit, from the axes of a conic to the stable states of a dynamical system, the general equation of the second degree is a unifying thread. It teaches us a profound lesson: that by understanding the deep structure of a simple mathematical form, we gain the power to describe, predict, and appreciate a dazzling variety of phenomena in the world around us.