NP-complete problems

The class \(\NP\) contains both problems that are easy (those in the class \(\Poly\)) as well as problems for which we do not currently know efficient algorithms (but whose solutions are small and easy to check). There is a special class of problems in \(\NP\) called \(\NP\)-complete problems. They have the property that if any of them can be solved in polynomial time, then all of them can. In a sense, they are the hardest problems in the class \(\NP\). And there are lots of such problems.

Definition: Polynomial-time computable reduction

A polynomial-time computable reduction from a decision problem \(B\) to a decision problem \(A\) is a function \(R\) from the instances of \(B\) to the instances of \(A\) such that for each instance \(x\) of \(B\),

\(R(x)\) can be computed in polynomial-time, and
\(x\) is a “yes” instance of \(B\) if and only if \(R(x)\) is a “yes” instance of \(A\).

Therefore, if we have

a polynomial time reduction from \(B\) to \(A\), and
a polynomial-time algorithm solving \(A\),

then we have a polynomial-time algorithm for solving \(B\) as well: given an instance \(x\) of \(B\), compute \(R(x)\) and then run the algorithm for \(A\) on \(R(x)\).

Definition: NP-completeness

A problem \(A\) in \(\NP\) is NP-complete if every other problem \(B\) in \(\NP\) can be reduced to it with a polynomial-time computable reduction.

Therefore,

if an \(\NP\)-complete problem \(A\) can be solved in polynomial time, then all the problems in \(\NP\) can be solved in polynomial time, and thus
\(\NP\)-complete problems are, in a sense, the most difficult ones in \(\NP\).

We do not know whether \(\NP\)-complete problems can be solved efficiently or not.

The Cook-Levin Theorem

The fact that \(\NP\)-complete problems exist at all was proven by Cook and Levin in 1970’s.

Theorem: Cook 1971, Levin 1973

\(\SAT{}\) (and \(\SATT{}\)) is \(\NP\)-complete.

Soon after this, Karp [1972] then listed the next 21 \(\NP\)-complete problems. Since then, hundreds of problems have been shown \(\NP\)-complete. For instance, \(\Prob{traveling salesperson}\), \(\Prob{generalized Sudokus}\) etc are all \(\NP\)-complete. For incomplete lists, one can see

Garey and Johnson, 1979: Computers and Intractability: A Guide to the Theory of NP-Completeness, or
this Wikipedia page.

NP-completeness: significance

Of course, the question of whether \(\Poly = \NP\), or can \(\NP\)-complete problems be solved in polynomial time, is a very important open problem in computer science. In fact, it is one of the seven (of which six are still open) one million dollar prize Clay Mathematics Institute Millennium Prize problems.

Because there are hundreds of \(\NP\)-complete problems, the \(\Poly = \NP\) question has practical significance as well. As Christos Papadimitriou says in his book “Computational complexity”:

There is nothing wrong in trying to prove that an \(\NP\)-complete problem can be solved in polynomial time. The point is that without an \(\NP\)-completeness proof one would be trying the same thing without knowing it!

That is, suppose that you have a new problem for which you should develop an algorithm that is always efficient, i.e. runs in polynomial time. You may end up in trying really hard for very long time, probably without success, if the problem turns out to be \(\NP\)-complete or harder. This is because during the last decades, lots of very clever people have tried to develop polynomial-time algorithms for many \(\NP\)-complete problems, but no-one has succeeded so far.