I've given the below a more formal presentation in a note (PDF). If you're interested in this open problem, you can find more details and omitted proofs there.
If you believe that tools from additive combinatorics are highly relevant to research in algorithms (some people do), it's natural to wonder about applications that run in the opposite direction: what can algorithms research tell us about additive combinatorics?
This blog post considers one such question that I think is natural. We start with the notion of a sumset: given an integer set , its sumset is the set
containing every sum of two integers chosen from . The ratio of the size of the sumset to the original set is an important indicator of "additive structure": up to constants, the statements
- ,
- with summands, has cardinality at most , and
- can be contained in a generalized arithmetic progression of dimension and volume ,
among others, are all equivalent. Additive combinatorics is, more or less, the study of sumsets.
So: given a set , is it a sumset? (That is, can be expressed as for some ?) Designing algorithms to answer this question tests our understanding of integer behavior under addition and might, I hope, lead to further discoveries.
Problem Statement:
Sumset Containment (SSC):
Input: An (integer) set of cardinality and a parameter .
Output: A solution satisfying and .
This formulation includes the extra parameter , but if we set we recover our original question, which I'll call Sumset Equivalence (SSE). (We could also parameterize in the size of the set ; this turns out to be exactly equivalent to -Clique on a certain graph, as we'll see below.)
Exact Algorithms:
The only approaches to this problem that I'm aware of begin by constructing a candidate set satisfying the property that for every possible solution . Once an algorithm has constructed , it can search for solutions within it.
Approach 1. For example, if there exists a set such that , we know that
In this case, contains elements, so we can do a brute force search for solutions in time . (The notation hides polynomial factors.)
Approach 2. Even better, we can construct the candidate graph
and observe that any set satisfies if and only if induces a clique in (and includes every self-loop on a clique vertex). Moreover, the clique corresponds to a solution if and only if it covers edges associated with at least different sums .
Because every solution clique is a subgraph of some maximal solution clique, to solve the problem, it suffices to check every maximal clique in the candidate graph. By a classic result, a graph on vertices contains at most maximal cliques; moreover, these can be efficiently enumerated.
This gives us a better algorithm: first, build the candidate set and the candidate graph , then check every maximal clique in to see if it corresponds to a solution. This takes time if we use the -element candidate set defined above.
Approach 3. A further improvement in runtime comes from building a better candidate set. Suppose our algorithm is given a slightly modified "promise problem" version of SSC: in addition to the set , the algorithm is guaranteed that some solution exists and is given and , the smallest and largest elements of , respectively.
We claim that is a subset of the candidate set
and furthermore that (proof in the attached PDF). Solving the SSC by enumerating the maximal cliques of the candidate graph takes time .
Finally, we observe that the promise of and to the algorithm is without loss of generality: since and (defined with respect to a certain solution ) are elements of the set , we can run our algorithm times, one for every possible pair . Because our algorithm can easily check for false positives, the iterations that make incorrect guesses fail harmlessly.
Open Problems:
SSC is both NP-complete and W[1]-complete (parameterized in ), so efficient exact or parameterized algorithms appear unlikely. However, improvements in exact algorithms could still be interesting: in particular, improving on the -time algorithm above seems like it might require a more sophisticated use of the structure of sumsets on integers. Open problems include:
Open Problem 1: Does there exist an -time algorithm for SSC or SSE, for some constant ?
Open Problem 2: How hard are SSC/SSE on other groups besides the integers? (For example, on the Boolean cube , the problem remains NP-complete and can be solved in time , but easy improvements might be possible here.)
Open Problem 3: Design algorithms that determine whether contains more complicated sum and difference sets. For example, does contain a large sumset for some , (ignoring trivial solutions like )? What about sets of the form for integers and ?