Persiflage

Hiring Season

Posted on January 7, 2018 by Persiflage

Lizard 1: Wait, explain again why we bury our young in the sand and thereby place them into mortal peril?

Lizard 2: So they develop character! If it was good enough for me, it’s good enough for them.

(Feel free to choose your own metaphors.)

Posted in Mathematics, Politics, Rant, Travel | Tagged Galapagos Islands, Hiring Season, Lizards, Mortal Peril, Planet Earth, Snakes | 1 Comment

Abandonware

Posted on December 25, 2017 by Persiflage

For a young mathematician, there is a lot of pressure to publish (or perish). The role of for-profit academic publishing is to publish large amounts of crappy mathematics papers, make a lot of money, but at least in return grant the authors a certain imprimatur, which can then be converted into reputation, and then into job offers, and finally into pure cash, and then coffee, and then back into research. One great advantage of being a tenured full professor (at an institution not run by bean counters) is that I don’t have to play that game, and I can very selective in what papers I choose to submit. In these times — where it is easy to make unpublished work available online, either on the ArXiv, a blog, or a webpage — there is no reason for me to do otherwise. Akshay and I are just putting the finishing touches on our manuscript on the torsion Jacquet–Langlands correspondence (a project begun in 2007!), and approximately 100 pages of the original version has been excised from the manuscript. It’s probably unlikely we will publish the rest, not because we don’t think its interesting, but because it can already be found online. (Although we might collect the remains into a supplemental “apocrypha” to make referencing easier.) Sarnak writes lots of great letters and simply posts them online. I wrote a paper a few years ago called “Semistable modularity lifting over imaginary quadratic fields.” It has (IMHO) a few interesting ideas, including one strategy for overcoming the non-vanishing of cohomology in multiple degrees in an $l_0 = 1$ situation, a way of proving a non-minimal modularity lifting theorem in an (admittedly restricted) $l_0 = 1$ situation without having to use Taylor’s Ihara Avoidance or base change (instead using the congruence subgroup property), and an argument explaining why the existence of Nilpotent ideals in Scholze’s Galois representation is no obstruction to the modularity lifting approach in my paper with David. But while I wrote up a detailed sketch of the argument, gave a seminar about it, and put the preprint on my webpage, I never actually submitted it. One reason was that David and I were (at the time, this was written in 2014-2015 or so) under the cosh by an extremely persnickety referee (to give you some idea, the paper was submitted in 2012 and was only just accepted), and I couldn’t stomach the idea of being raked over the coals a second time merely to include tedious details. (A tiny Bernard Woolley voice at the back of my head is now saying: excuse me minister, you can’t be raked over by a cosh, it doesn’t have any teeth. Well done if you have any idea what I am talking about.) But no matter, the paper is on my webpage where anyone can read it. As it happens, the 10 author paper has certainly made the results of this preprint pretty much entirely redundant, but there are still some ideas which might be useful in the future someday. But I don’t see any purpose whatsoever in subjecting an editor, a reviewer, and (especially) myself the extra work of publishing this paper.

So I am all in favor of avoiding publishing all but a select number of papers if you can help it, and blogging about math instead. So take a spoon, pass around the brandy butter and plum pudding, and, for the rest of this post, let us tuck in to something from the apocrypha.

Galois Extensions Unramified Away From One Place:

I learned about one version of this question in the tea room at Havard from Dick Gross. Namely, does there exist a non-solvable Galois extension K/Q unramified at all primes except p? Modular forms (even just restricting to the two eigenforms of level one and weights 12 and 16) provide a positive answer for p greater than 7. On the other hand, Serre’s conjecture shows that this won’t work for the last three remaining primes. Dick explained a natural approach for the remaining primes, namely to consider instead Hilbert modular forms over a totally real cyclotomic extension ramified at p (once you work out how to actually compute such beasts in practice). And indeed, this idea was successfully used to find such representations by Lassina Dembélé in this paper and also this paper (with Greenberg and Voight). But there is something a little unsatisfactory to me about this, namely, these extensions are all ramified at $p$ and $\infty.$ What if one instead asks Gross’ question for a single place?

Minkowski showed there are no such extensions when $v = \{\infty\},$ but I don’t see any obstruction to there being a positive answer for a finite place. The first obvious remark, however, is that Galois representations coming from Hilbert modular forms are not going to be so useful in this case at least when the residual characteristic is odd, for parity reasons.

On the other hand, conjecturally, the Langlands program still has something to say about this question. One could ask, for example, for the smallest prime p for which there exists a Galois representation:

$\displaystyle{\overline{\rho}: G_{\mathbf{Q}} \rightarrow \mathrm{GL}_2(\overline{\mathbf{F}}_p)}$

whose image is big (say not only irreducible but also not projectively exceptional) and is unramified at all places away from p including infinity. (This is related to my first ever blog post.) Here is how one might go about finding such a representation, assuming the usual suite of conjectures. First, take an imaginary quadratic field F, and then look to see if there is any extra mod-p cohomology of $\mathrm{GL}_2(\mathcal{O}_F)$ in some automorphic local system which is not coming from any of the “obvious” sources. If you find such a class, you could then try to do the (computationally difficult) job of computing Hecke eigenvalues, or alternatively you could do the same computation for a different such imaginary quadratic field E, and see if you find a weight for which there is an “interesting” class simultaneously for both number fields. If there are no such classes for any of the (finitely many) irreducible local systems modulo p, then there are (conjecturally) no Galois representations of the above form.

There are some heuristics (explained to me by Akshay) which predict that the number of Galois representations of the shape we are looking for (ignoring twists) is of the order of 1/p. On the other hand, no such extensions will exist for very small p by combining an argument of Tate together with the Odlyzko bounds. So the number of primes up to X for which there exist such a representation might be expected to be of the form

$\log \log X – \log \log C$

for some constant C to account for the lack of small primes (which won’t contribute by Tate + Odlyzko GRH discriminant bounds). This is unfortunately a function well-known to be constant, and in this case, with the irritating correction term, it looks pretty much like the zero constant. Even worse, the required computation becomes harder and harder for larger p, since one needs to compute the cohomology in the corresponding local system of weight $(k,k)$ for k up to (roughly) p. Alas, as it turns out, these things are quite slippery:

Lemma: Suppose $\overline{\rho}$ is absolutely irreducible with Serre level 1 and Serre weight k and is even. Assume all conjectures. Then:

The prime $p$ is at least 79.
The weight $k$ is at least 33.
If $\overline{\rho}$ exists with $k \le 53,$ then $p > 1000.$
If $\overline{\rho}$ exists with $k = 55,$ then $p > 200,$ or $p =163,$ and $\overline{\rho}$ is the unique representation with projective image $A_4.$

Of course the extension for $p = 163$ (which is well-known) does not have big image in the sense described above.
The most annoying thing about this computation (which is described in the apocrypha) is that it can only be done once! Namely, someone who could actually program might be able to extend the computation to (say) $p \le 200,$ but the number of extensions which one would expect to see is roughly $\log \log 200 – \log \log 79,$ which is smaller than a fifth. So maybe an extension of this kind will never be found! (Apologies for ruining it by not getting it right the first time.)

Posted in Mathematics | Tagged Abandonware, Akshay Venkatesh, Andrew Odlyzko, Bernard Woolley, David Geraghty, Dick Gross, Discriminant Bounds, Galois Representations, Imaginary Quadratic Fields, Jacquet-Langlands, John Voight, Lassina Dembélé, Matthew Greenberg, Peter Sarnak, Publishing, Taylor-Wiles, Under The Cosh, Yes Minister | 4 Comments

The ABC conjecture has (still) not been proved

Posted on December 17, 2017 by Persiflage

The ABC conjecture has (still) not been proved.

Five years ago, Cathy O’Neil laid out a perfectly cogent case for why the (at that point recent) claims by Shinichi Mochizuki should not (yet) be regarded as constituting a proof of the ABC conjecture. I have nothing further to add on the sociological aspects of mathematics discussed in that post, but I just wanted to report on how the situation looks to professional number theorists today. The answer? It is a complete disaster.

This post is not about making epistemological claims about the truth or otherwise of Mochizuki’s arguments. To take an extreme example, if Mochizuki had carved his argument on slate in Linear A and then dropped it into the Mariana Trench, then there would be little doubt that asking about the veracity of the argument would be beside the point. The reality, however, is that this description is not so far from the truth.

Each time I hear of an analysis of Mochizuki’s papers by an expert (off the record) the report is disturbingly familiar: vast fields of trivialities followed by an enormous cliff of unjustified conclusions. The defense of Mochizuki usually rests on the following point: The mathematics coming out of the Grothendieck school followed a similar pattern, and that has proved to be a cornerstone of modern mathematics. There is the following anecdote that goes as follows:

The author hears the following two stories: Once Grothendieck said that there were two ways of cracking a nutshell. One way was to crack it in one breath by using a nutcracker. Another way was to soak it in a large amount of water, to soak, to soak, and to soak, then it cracked by itself. Grothendieck’s mathematics is the latter one.

While rhetorically expedient, the comparison between Mochizuki and Grothendieck is a poor one. Yes, the Grothendieck revolution upended mathematics during the 1960’s “from the ground up.” But the ideas coming out of IHES immediately spread around the world, to the seminars of Paris, Princeton, Moscow, Harvard/MIT, Bonn, the Netherlands, etc. Ultimately, the success of the Grothendieck school is not measured in the theorems coming out of IHES in the ’60s but in how the ideas completely changed how everyone in the subject (and surrounding subjects) thought about algebraic geometry.

This is not a complaint about idiosyncrasy or about failing to play by the rules of the “system.” Perelman more directly repudiated the conventions of academia by simply posting his papers to the arXiV and then walking away. (Edit: Perelman did go on an extensive lecture tour and made himself available to other experts, although he never submitted his papers.) But in the end, in mathematics, ideas always win. And people were able to read Perelman’s papers and find that the ideas were all there (and multiple groups of people released complete accounts of all the details which were also published within five years). Usually when there is a breakthrough in mathematics, there is an explosion of new activity when other mathematicians are able to exploit the new ideas to prove new theorems, usually in directions not anticipated by the original discoverer(s). This has manifestly not been the case for ABC, and this fact alone is one of the most compelling reasons why people are suspicious.

The fact that these papers have apparently now been accepted by the Publications of the RIMS (a journal where Mochizuki himself is the managing editor, not necessary itself a red flag but poor optics none the less) really doesn’t change the situation as far as giving anyone a reason to accept the proof. If anything, the value of the referee process is not merely in getting some reasonable confidence in the correctness of a paper (not absolute certainty; errors do occur in published papers, usually of a minor sort that can be either instantly fixed by any knowledgeable reader or sometimes with an erratum, and more rarely requiring a retraction). Namely, just as importantly, it forces the author(s) to bring the clarity of the writing up to a reasonable standard for professionals to read it (so they don’t need to take the same time duration that was required for the referees, amongst other things). This latter aspect has been a complete failure, calling into question both the quality of the referee work that was done and the judgement of the editorial board at PRIMS to permit papers in such an unacceptable and widely recognized state of opaqueness to be published. We do now have the ridiculous situation where ABC is a theorem in Kyoto but a conjecture everywhere else. (edit: a Japanese reader has clarified to me that the newspaper articles do not definitively say that the papers have been accepted, but rather the wording is something along the lines of “it is planned that PRIMS will accept the paper,” whatever that means. This makes no change to the substance of this post, except that, while there is still a chance the papers will not be accepted in their current form, I retract my criticism of the PRIMS editorial board.)

So why has this state persisted so long? I think I can identify three basic reasons. The first is that mathematicians are often very careful (cue the joke about a sheep at least one side of which is black). Mathematicians are very loath to claim that there is a problem with Mochizuki’s argument because they can’t point to any definitive error. So they tend to be very circumspect (reasonably enough) about making any claims to the contrary. We are usually trained as mathematicians to consider an inability to understand an argument as a failure on our part. Second, whenever extraordinary claims are made in mathematics, the initial reaction takes into account the past work of the author. In this case, Shinichi Mochizuki was someone who commanded significant respect and was considered by many who knew him to be very smart. It’s true (as in the recent case of Yitang Zhang) that an unknown person can claim to have proved an important result and be taken seriously, but if a similarly obscure mathematician had released 1000 pages of mathematics written in the style of Mochizuki’s papers, they would have been immediately dismissed. Finally, in contrast to the first two points, there are people willing to come out publicly and proclaim that all is well, and that the doubters just haven’t put in the necessary work to understand the foundations of inter-universal geometry. I’m not interested in speculating about the reasons they might be doing so. But the idea that several hundred hours at least would be required even to scratch the beginnings of the theory is either utter rubbish, or so far beyond the usual experience of how things work that it would be unique not only in mathematics, but in all of science itself.

So where to from here? There are a number of possibilities. One is that someone who examines the papers in depth is able to grasp a key idea, come up with a major simplification, and transform the subject by making it accessible. This was the dream scenario after the release of the paper, but it becomes less and less likely by the day (and year). But it is still possible that this could happen. The flip side of this is that someone could find a serious error, which would also resolve the situation in the opposite way. A third possibility is that we have (roughly) the status quo: no coup de grâce is found to kill off the approach, but at the same time the consensus remains that people can’t understand the key ideas. (I should say that whether the papers are accepted or not in a journal is pretty much irrelevant here; it’s not good enough for people to attest that they have read the argument and it is fine, someone has to be able to explain it.) In this case, the mathematical community moves on and then, whether it be a year, a decade, or a century, when someone ultimately does prove ABC, one can go back and compare to see if (in the end) the ideas were really there after all.

Posted in Mathematics, Politics, Rant | Tagged ABC, Grothendieck, Shinichi Mochizuki | 34 Comments

Graduation Day

Posted on November 26, 2017 by Persiflage

This last summer, I undertook my last official activity as a faculty member at Northwestern University, namely, graduation day! (I had a 0% courtesy appointment for two years until my last Northwestern students graduated.)

Here I am with four of my six former students. (Richard and Vlad actually graduated in 2016, but were hooded together with Joel in 2017.)

$Me and some of my students$

From left-to-right: Richard Moy is a postdoc at Wilamette College in Portland (for previous blog posts on Richard’s work, see Hilbert Modular Forms Part II and Part III), Zili Huang (Thurston and Random Polynomials) has a real job at a consulting firm in Chicago but swung by to say hello on graduation day, Vlad Serban (The Thick Diagonal) has as postdoctoral position in Vienna, and Joel Specter (Hilbert Modular Forms Part II and … hmmm, I guess I didn’t blog about any of his other papers) has just started a postdoc position at Johns Hopkins. Missing are Zoey Guo (Abelian Spiders), now at the Institute of Solid Mechanics at Tsinghua University in Beijing , and my first student Maria Stadnik (who just moved to Florida Atlantic University, and whose thesis predates this blog).

It’s easy to get the sense as a student that math departments are fairly static (which is mostly true over the 4 years or so it takes to do a PhD), but as time goes on, people end up moving around much more than you expect, and the characters of various departments change quite a bit. A sign of good hiring is that your faculty leave because they have been recruited elsewhere! And even though my departure two years ago brought one era of number theory at Northwestern to an end — starting with Matt, then me, two one-year cameo appearances by Toby, and a string of very successful postdocs (not to mention the occasional visitors) — a new era has already begun, with the hiring of Yifeng Liu and Bao Le Hung.

Posted in Mathematics | Tagged Bao Le Hung, David Savitt, Ellen Eischen, Florian Herzig, Graduation, Joel Specter, Langlands, Maria Stadnik, Matthew Emerton, Northwestern, Patrick Allen, Richard Moy, Simon Marshall, Students, The Hawk, Toby Gee, Vlad Serban, Yifeng Liu, Zili Huang, Zoey Guo | 1 Comment

Abelian Surfaces are Potentially Modular

Posted on November 11, 2017 by Persiflage

Today I wanted (in the spirit of this post) to report on some new work in progress with George Boxer, Toby Gee, and Vincent Pilloni. Edit: The paper is now available here.

Recal that, for a smooth projective variety X over a number field F unramified outside a finite set of primes S, one may write down a global Hasse-Weil zeta function:

$\displaystyle{ \zeta_{X,S}(s) = \prod \frac{1}{1 – N(x)^{-s}}}$

where the product runs over closed points of a smooth integral model. From the Weil conjectures, the function $\zeta_{X,S}(s)$ is absolutely convergent for s with real part at least $1+m/2,$ where $m = \mathrm{dim}(X).$ One has the following well-known conjecture:

Hasse–Weil Conjecture: The function $\zeta_{X,S}(s)$ extends to a meromorphic function on the complex plane. Moreover, there exists a rational number A, a collection of polynomials $P_v(T)$ for v dividing S, and infinite Gamma factors $\Gamma_v(s)$ such that

$\displaystyle{ \xi_{X}(s) = \zeta_{X,S}(s) \cdot A^{s/2} \cdot \prod_{v|\infty} \Gamma_v(s) \cdot \prod_{v|S} \frac{1}{P_v(N(v)^{-s})}}$

satisfies the functional equation $\xi_X(s) = w \cdot \xi_X(m+1-s)$ with $w = \pm 1.$

Naturally, one can be more precise about the conductor and the factors at the bad primes. In the special case when F = Q and X is a point, then $\zeta_{X,S}(s)$ is essentially the Riemann zeta function, and the conjecture follows from Riemann’s proof of the functional equation. If F is a general number field but X is still a point, then $\zeta_{X,S}(s)$ is (up to some missing Euler factors at S) the Dedekind zeta function $\zeta_F(s)$ of F, and the conjecture is a theorem of Hecke. If X is a curve of genus zero over F, then $\zeta_{X,S}(s)$ is $\zeta_F(s) \zeta_F(s-1),$ and one can reduce to the previous case. More generally, by combining Hecke’s results with an argument of Artin and Brauer about writing a representation as a virtual sum of induced characters from solvable (Brauer elementary) subgroups, one can prove the result for any X for which the l-adic cohomology groups are potentially abelian. This class of varieties includes those for which all the cohomology of X is generated by algebraic cycles.

For a long time, not much was known beyond these special cases. But that is not to say there was not a lot of progress, particularly in the conjectural understanding of what this conjecture really was about. The first huge step was the discovery and formulation of the Taniyama-Shimura conjecture, and the related converse theorems of Weil. The second was the fundamental work of Langlands which cast the entire problem in the (correct) setting of automorphic forms. In this context, the Hasse-Weil zeta functions of modular curves were directly lined to the L-functions of classical weight 2 modular curves. More generally, the Hasse-Weil zeta functions of all Shimura varieties (such as Picard modular surfaces) should be linked (via the trace formula and conjectures of Langlands and Kottwitz) to the L-functions of automorphic representations. On the other hand, these examples are directly linked to the theory of automorphic forms, so the fact that their Hasse-Weil zeta functions are automorphic, while still very important, is not necessarily evidence for the general case. In particular, there was no real strategy for taking a variety that occurred “in nature” and saying anything non-trivial about the Hasse-Weil zeta function beyond the fact it converged for real part greater than $1 + m/2,$ which itself requires the full strength of the Weil conjectures.

The first genuinely new example arrived in the work of Wiles (extended by others, including Breuil-Conrad-Diamond-Taylor), who proved that elliptic curves E/Q were modular. An immediate consequence of this theorem is that Hasse-Weil conjecture holds for elliptic curves over Q. Taylor’s subsequent work on potentially modularity, while not enough to prove modularity of all elliptic curves over all totally real fields, was still strong enough to allow him to deduce the Hasse-Weil conjecture for any elliptic curve over a totally real field. You might ask what have been the developments since these results. After all, the methods of modularity have been a very intense subject of study over the past 25 years. One problem is that these methods have been extremely reliant on a regularity assumption on the corresponding motives. One nice example of a regular motive is the symmetric power of any elliptic curve. On the other hand, if one takes a curve X over a number field, then h^{1,0} = h^{0,1} = g, and the corresponding motive is regular only for g = 0 or 1. The biggest progress in automorphy of non-regular motives has actually come in the form of new cases of the Artin conjecture — first by Buzzard-Taylor and Buzzard, then in the proof of Serre’s conjecture by Khare-Wintenberger over Q, and more recently in subsequent results by a number of people (Kassaei, Sasaki, Pilloni, Stroh, Tian) over totally real fields. But these results provide no new cases of the Hasse-Weil conjecture, since the Artin cases were already known in this setting by Brauer. (It should be said, however, that the generalized modularity conjecture is now considered more fundamental than the Hasse-Weil conjecture.) There are a few other examples of Hasse-Weil one can prove by using various forms of functoriality to get non-regular motives from regular ones, for example, by using the Arthur-Clozel theory of base change, or by Rankin-Selberg. We succeed, however, in establishing the conjecture for a class of motives which is non-regular in an essential way. The first corollary of our main result is as follows:

Theorem [Boxer,C,Gee,Pilloni] Let X be a genus two curve over a totally real field. The the Hasse-Weil conjecture holds for X.

It will be no surprise to the experts that we deduce the theorem above from the following:

Theorem [BCGP] Let A be an abelian surface over a totally real field F. Then A is potentially modular.

In the case when A has trivial endomorphisms (the most interesting case), this theorem was only known for a finite number of examples over $\mathbf{Q}.$ In each of those cases, the stronger statement that A is modular was proved by first explicitly computing the corresponding low weight Siegel modular form. For example, the team of Brumer-Pacetti-Tornaría-Poor-Voight-Yuen prove that the abelian surfaces of conductors 277, 353, and 587 are all modular, using (on the Galois side) the Faltings-Serre method, and (on the automorphic side) some really quite subtle computational methods developed by Poor and Yuen. A paper of Berger-Klosin handles a case of conductor 731 by a related method that replaces the Falting-Serre argument by an analysis of certain reducible deformation rings.

The arguments of our paper are a little difficult to summarize for the non-expert. But George Boxer did a very nice job presenting an overview of the main ideas, and you can watch his lecture online (posted below, together with Vincent’s lecture on higher Hida theory). The three sentence version of our approach is as follows. There was a program initiated by Tilouine to generalize the Buzzard-Taylor method to GSp(4), which ran into technical problems related to the fact that Siegel modular forms are not directly reconstructible from their Hecke eigenvalues. There was a second approach coming from my work with David Geraghty, which used instead a variation of the Taylor-Wiles method; this ran into technical problems related to the difficulty of studying torsion in the higher coherent cohomology of Shimura varieties. Our method is a synthesis of these two approaches using Higher Hida theory as recently developed by Pilloni. Let me instead address one or two questions here that GB did not get around to in his talk:

What is the overlap of this result with [ACCGHLNSTT]? Perhaps surprisingly, not so much. For example, our results are independent of the arguments of Scholze (and now Caraiani-Scholze) on constructing Galois representations to torsion classes in Betti cohomology. We do give a new proof of the result that elliptic curves over CM fields are potentially modular, but that is the maximal point of intersection. In contrast, we don’t prove that higher symmetric powers of elliptic curves are modular. We do, however, prove potentially modularity of all elliptic curves over all quadratic extensions of totally real fields with mixed signature, like $\mathbf{Q}(2^{1/4}).$ The common theme is (not surprisingly) the Taylor-Wiles method (modified using the ideas in my paper with David Geraghty).

What’s new in this paper which allows you to make progress on this problem? George explains this well in his lecture. But let me at least stress this point: Vincent Pilloni’s recent work on higher Hida Theory was absolutely crucial. Boxer, Gee, and I were working on questions related to modularity in the symplectic case, but when Pilloni’s paper first came out, we immediately dropped what we were doing and started working (very soon with Pilloni) on this problem. If you have read the Calegari-Geraghty paper on GSp(4) and are not an author of the current paper (hi David!), and you look through our manuscript (currently a little over 200 pages and [optimistically?!] ready by the end of the year), then you also recognize other key technical points, including a more philosophically satisfactory doubling argument and Ihara avoidance in the symplectic case, amongst other things.

So what about modularity? Of course, we deduce our potential modularity result from a modularity lifting theorem. The reason we cannot deduce that Abelian surfaces are all modular, even assuming for example that they are ordinary at 3 with big residual image, is that Serre’s conjecture is not so easy. Not only is $\mathrm{GSp}_4(\mathbf{F}_3)$ not a solvable group, but — and this is more problematic — Artin representations do not contribute to the coherent cohomology of Shimura varieties in any setting other than holomorphic modular forms of weight one. Still, there are some sources of residually modular representations, including the representations induced from totally real quadratic extensions (for small primes, at least). We do, however, prove the following (which GB forgot to mention in his talk, so I bring up here):

Proposition [BCGP]: There exist infinitely modular abelian surfaces (up to twist) over Q with End_C(A) = Z.

This is proved in an amusing way. It suffices to show that, given a residual representation

$\overline{\rho}: G_{\mathbf{Q}} \rightarrow \mathrm{GSp}_4(\mathbf{F}_3)$

with cyclotomic similitude character (or rather inverse cyclotomic character with our cohomological normalizations) which has big enough image and is modular (plus some other technical conditions, including ordinary and p-distinguished) that it comes from infinitely many abelian surfaces over Q, and then to prove the modularity of those surfaces using the residual modularity of $\overline{\rho}.$ This immediately reduces to the question of finding rational points on some twist of the moduli space $\mathcal{A}_2(3).$ And this space is rational! Moreover, it turns out to be a very famous hypersurface much studied in the literature — it is the Burkhardt Quartic. Now unfortunately — unlike for curves — it’s not so obvious to determine whether a twist of a higher dimensional rational variety is rational or not. The problem is that the twisting is coming from an action by $\mathrm{Sp}_4(\mathbf{F}_3),$ and that action is not compatible with the birational map to $\mathbf{P}^3,$ so the resulting twist is not a priori a Severi-Brauer variety. However, something quite pleasant happens — there is a degree six cover

$\mathcal{A}^{-}_2(3) \stackrel{6:1}{\rightarrow} \mathcal{A}_2(3)$

(coming from a choice of odd theta characteristic) which is not only still rational, but now rational in an equivariant way. So now one can proceed following the argument of Shepherd-Barron and Taylor in their earlier paper on mod-2 and mod-5 Galois representations.

What about curves of genus g > 2?: Over $\mathbf{Q},$ there is a tetrachotomy corresponding to the cases g = 0, g = 1, g = 2, and g > 2. The g = 0 case goes back to the work of Riemann. The key point in the g = 1 case (where the relevant objects are modular forms of weight two) is that there are two very natural ways to study these objects. The first (and more classical) way to think about a modular form is as a holomorphic function on the upper half plane which satisfies specific transformation properties under the action of a finite index subgroup of $\mathrm{SL}_2(\mathbf{Z}).$ This gives a direct relationship between modular forms and the coherent cohomology of modular curves; namely, cuspidal modular forms of weight two and level $\Gamma_0(N)$ are exactly holomorphic differentials on the modular curve $X_0(N).$ On the other hand, there is a second interpretation of modular forms of weight two in terms of the Betti (or etale or de Rham) cohomology of the modular curve. A direct way to see this is that holomorphic differentials can be thought of as smooth differentials, and these satisfy a duality with the homology group $H_1(X_0(N),\mathbf{R})$ by integrating a differential along a loop. And it is the second description (in terms of etale cohomology) which is vital for studying the arithmetic of modular forms. When g = 2, there is still a description of the relevant forms in terms of coherent cohomology of Shimura varieties (now Siegel 3-folds), but there is no longer any direct link between these coherent cohomology groups and etale cohomology. Finally, when g > 2, even the relationship with coherent cohomology disappears — the relevant automorphic objects have some description in terms of differential equations on locally symmetric spaces, but there is no longer any way to get a handle on these spaces. For those that know about Maass forms, the situation for g > 2 is at least as hard (probably much harder) than the notorious open problem of constructing Galois representations associated to Maass forms of eigenvalue 1/4. In other words, it’s probably very hard! (Of course, there are special cases in higher genus when the Jacobian of the curve admits extra endomorphisms which can be handled by current methods.)

Finally, as promised, here are the videos:

Posted in Mathematics | Tagged Abelian Surfaces, Ana Caraiani, Andrew Wiles, Ariel Pacetti, Armand Brumer, Benoit Stroh, Betti cohomology, coherent cohomology, Cris Poor, David Geraghty, David Yuen, etale cohomology, George Boxer, Gonzalo Tornaría, Hasse-Weil Conjecture, Higher Hida Theory, IAS, Jacques Tilouine, John Voight, Kevin Buzzard, Kris Klosin, Langlands, Maass Forms, Payman Kassaei, Peter Scholze, RLT, Shekar Khare, Shepherd-Barron, Shu Susaki, Tetrachotomy, Tobias Berger, Toby Gee, Vincent Pilloni, Wintenberger, Yichao Tian | 12 Comments

Jobs Related Public Service Announcements

Posted on November 4, 2017 by Persiflage

Job season is upon us. Now is probably a good time to give applicants (and letter writers!) a few pointers. Of course, there are many other sources of advice on this topic, so let me try to narrow the focus on suggestions that you might not find elsewhere.

But first, I am contractually obligated (and also happy) to remind you all to make sure all your best graduate students (in all fields) apply for a Dickson Instructorship at Chicago. Occasionally people get the impression that our deadline is November 1st. In fact, that is merely the date after which we are allowed to start reading recommendations. In reality, committee members will most likely start reading the files over Thanksgiving break, so definitely try to have all your materials (and letters of recommendation) submitted by then. In contrast, some of the public schools (including the UC system, correct me if I’m wrong) have hard application deadlines. In those cases, it is vital that you submit your application before the deadline (it doesn’t need to be complete, just submitted).

I’m applying (or writing a letter) for the second year in a row. Any tips? A number of people apply when they have an extra year remaining in their current position to a limited number of schools. I don’t know enough game theory to evaluate this strategy, but the scales are definitely tipped in favor of doing this when two body problems are involved. But be warned! There is a technical issue on mathjobs which arises which you almost definitely will not be able to anticipate as an applicant. It is the following. When a letter writer submits a letter of recommendation to mathjobs, there is a default setting on how long that letter can be viewed. And for some ridiculous reason, that time period is something like 18 months. A letter writer can, and I do, change the default period to any date one wants (I usually make the letter expire sometime during the following summer). But not all of your letter writers seem to realize this! That means that when you go to apply the following year, your mathjobs listing will have your letters from the current year AND your letters from the previous year, unless your letter writer actively makes the effort to delete the old letter. The first thing this signals to those reading your letters is that you applied the previous year. This on its own is not so bad. However, it is very often the case that the letter in year N+1 is pretty much identical to the letter in year N. And that does give the impression that the applicant hasn’t really done anything in the previous twelve months. The worst aspect of this problem is that there is not really any way for the candidate prevent it, beyond warning their letter writers about the problem. So this is mostly a reminder for letter writers who are writing for the second time in two years: make sure you delete/replace your letters from the previous year! (Or do make sure your secretaries do this on your behalf, if that’s how you roll.)

Should I write to people at universities letting them know about my application? This is generally considered a worthwhile thing to do, because, even in cases in which you are not offered the job, it does give a way of letting people know about your research. In the other direction, a suitably customized and genuine email can let the relevant people know that you might accept a position if you are offered one. A few caveats, however. I appreciate letters which let me know about an application but don’t require a response. Secondly, there should be some synergy between your own research and the person you are writing to, otherwise it looks a little like you are just spamming everyone. Finally, there should be something at least slightly realistic about your application, especially for more senior positions. (But slightly is good enough.)

How many letters do I really need? Let’s specialize now to the case of postdoc applications, although some of this also applies to tenure track letters. This definitely a case where “more” is usually not “better.” Counting the teaching letter separately, a first approximation would be as follows:

Four shalt thou not count, neither count thou two, excepting that thou then proceed to three. Five is right out.

Here’s the problem with having (say) six letters. Most of the time, as a graduate student, there are not going to be six people who know your thesis work really well. Maybe you feel your application looks a little fancier because Professor Fancy McBoatface agreed to write for you, even though you just had that one conversation at a conference. But then the first letter people will click on will be from Professor McBoatface, which will say something like “I chatted with X at a conference once, it seems like they are doing something interesting, although I don’t know the work very well.” Basically, too many letters will dilute the message. Of course, it does look good if you can get a strong letter from a well known expert who is not at your university, but that is much more likely to happen if you have had some genuine sustained mathematical interaction with that person, rather than some fleeting interaction. (I had letters out of graduate school from Kevin Buzzard, with whom I was writing a paper, and René Schoof, who visited Berkeley for a semester and with whom my interaction was directly related to part of my thesis.) There are circumstances in which there is someone (say your advisor) who has to write for you, but for some reason you suspect that their letter may not be as strong as you would like; that’s one justifiable reason to hedge with an extra letter. But in the end, the people who are going to write the strongest letters for you are probably going to be the people who know your work the best.

Posted in Politics | Tagged applications, Dickson Instructorship, Kevin Buzzard, mathjobs, Rene Schoof, University of Chicago | 1 Comment

Schaefer and Stubley on Class Groups

Posted on October 28, 2017 by Persiflage

I talked previously about work of Wake and Wang-Erickson on deformations of Eisenstein residual representations. In that post, I also mentioned a paper of Emmanuel Lecouturier who has also proved some very interesting theorems. Today, I wanted to talk about some complementary results by my student Eric Stubley in collaboration with Karl Schaefer (a student of Matthew Emerton). To duplicate slightly from that previous post, recall that Matt and I proved the following:

Theorem Let p > 3 be prime, and let N = 1 mod p be prime. If the rank of the cuspidal Hecke algebra of level $\Gamma_0(N)$ localized at the Eisenstein prime is greater than one, then

$K = \mathbf{Q}(N^{1/p})$

has non-cyclic p-class group. Using work of Merel, one can dispense with the discussion of Hecke algebras and instead give an equivalent reformulation of the first condition, namely, $e > 1$ if and only if $M_1$ is a p-th power, where

$M_1 = \displaystyle{\prod_{k=1}^{p-1} (Mk)!^k \in \mathbf{F}^{\times}_N, \qquad M = \frac{N-1}{p}}$

We followed up this result with the comment:

We expect (based on the numerical evidence) that the condition that the class group of K has p-rank [at least] two is equivalent to the existence of an appropriate group scheme, and thus to [the rank being greater than one].

As noted previously, there are counter-examples, already for p = 7 and N = 337. However, there was still clearly some relationship between these quantities beyond the one-way implication above. In particular, the numerical evidence still stubbornly supported the hope that the converse may indeed be true for p = 5. This is the first theorem that Schaefer and Stubley prove. More precisely, they completely determine the rank of the class group of $\mathbf{Q}(N^{1/5})$ for primes N which are 1 mod 5.

Theorem [Schaefer, Stubley]: Let $N \equiv 1 \mod 5$ be prime. Then the 5-rank r_K of the class group of $K = \mathbf{Q}(N^{1/5})$ is either 1, 2, or 3. Moreover:

$r_K = 1$ if and only if the Merel invariant $M_1$ is not a perfect 5th power.
$r_K = 2$ if and only if $M_1$ is a perfect 5th power, and $\displaystyle{\alpha = \frac{\sqrt{5} – 1}{2}}$ is not a perfect 5th power modulo N.
$r_K = 3$ if and only if $M_1$ and $\alpha$ are both 5th powers modulo N.

This also answers a conjecture of Lecouturier. Their argument greatly clarified (to me) the exact relationship between the class group of K and a number of other related quantities in this picture. To recall, a third reformulation of whether the Hecke algebra has non-trivial deformations can be given (as in Wake–Wang-Erickson) by whether a certain pairing between specific classes $b$ and $c_{-1}$ in $H^1_{Np}(\mathbf{Q},\epsilon)$ and $H^1_{Np}(\mathbf{Q},\epsilon^{-1})$ vanish or not. The point is that the vanishing of a cup product ensures the existence of an extension

$\left( \begin{matrix} 1 & b & c_0 \\ 0 & \epsilon^{-1} & c_{-1} \\ 0 & 0 & 1 \end{matrix} \right)$

and one can show (after some massaging) that c_0 gives rise to something in the p-class group of K. Conversely, if one starts with a class in the p-class group of K, and then takes the Galois closure over Q, then (sometimes) one arrives with a Galois extension M/Q with a Galois representation to GL(3) of the above form. The problem is, in other circumstances, one arrives at a representation which has a much larger Galois group and a map to the Borel subgroup in higher dimension, which looks something like this:

$ \displaystyle{
\left( \begin{matrix} 1 & \epsilon^{-1} \cdot b & \epsilon^{-2} \cdot b^2/2 & \epsilon^{-3} \cdot b^3/6 & & \ldots & c_{0} \\
0 & \epsilon^{-1} & \epsilon^{-2} \cdot b & \epsilon^{-3} \cdot b^2/2 & & \ldots & c_{-1} \\
& & \ddots & & & \\
\ldots & & & & \epsilon^{1-m} & \epsilon^{-m} \cdot b & c_{1-m} \\
\ldots & & & & & \epsilon^{-m} & c_{-m} \\
\ldots & & & & & & 1 \end{matrix} \right)}$

Suppose one now tries to construct a representation of this form in order to find a non-trivial class in the p-class group of K. First, one can start by finding a suitable class $ c_{-m} \in H^1_{Np}(\mathbf{Q},\epsilon^{-m})$ which cups trivially with $latex b.$ The vanishing of a generalized Merel invariant (under a regularity hypothesis) is exactly what guarantees the existence of such a suitable class $c_{-m},$ at least when m is odd. However, one is then faced with an increasing sequence of obstruction problems in order to climb the ladder and get all the way to the full representation of the form above. Here one has to deal with not only cup products, but also (implicitly) higher Massey products. Ultimately, the relation between the quantity $r_K$ and the deformation rings of Hecke algebras is most precise only when $p = 5$. It turns out that there is still something one can say for $p = 7,$ however. Consider the higher Merel invariant

$M_n = \displaystyle{\prod_{k=1}^{p-1} (Mk)!^{k^n} \in \mathbf{F}^{\times}_N, \qquad M = \frac{N-1}{p}}$

for odd values of n. Suppose that p is a regular prime. One can show that if $r_K \ge 2$, then at least one of these quantities M_n is a perfect pth power for an odd $n \le p-4.$ When p = 5, this is a weaker version of the theorem above. So an optimistic variation on the conjecture above is that $r_K \ge 2$ if and only if $M_n$ is a perfect pth power of for at least one odd $n \le p-4.$ The description of the relationship between these classes (which also come up in Lecouturier, they arise via an explicit analysis of Gauss sums and Stickelberger’s theorem) suggests that this conjecture is too optimistic in general, and indeed there are counter-examples for p = 11. But, Schaefer and Stubley do prove the following:

Theorem [Schaefer, Stubley]: Let p = 7, and let N = 1 mod p be prime. Then the 7-class group of $K = \mathbf{Q}(N^{1/p})$ has rank $r_K \ge 2$ if and only if either M_1 or M_3 is a perfect 7th power modulo N.

For example, consider the previous “counter-example” for N = 337 and p = 7. Here the non-trivial class group is explained by the fact that M_3 is a perfect 7th power modulo N.

One thing I especially like about this result is that there are three groups of people (Wake–Wang-Erickson, Lecouturier, and Schaefer–Stubley) are all working around a similar problem, but their results are complementary to each other. I believe that all five people will be at the upcoming IAS workshop, so I hope to hear more about this then.

Posted in Mathematics, Students | Tagged Barry Mazur, Carl Wang Erickson, Class Field Theory, Eisenstein Ideal, Emmanuel Lecouturier, Eric Stubley, Karl Schaefer, Matthew Emerton, Preston Wake, Stickelberger's Theorem | 1 Comment

J’accuse!

Posted on October 21, 2017 by Persiflage

I found the following documentary remarkable and quite interesting. Without offering here any opinion on its merits, I certainly give it credit for taking an unpopular position and sticking with it. This blog is no stranger to challenging perceived wisdom, although I usually aim to be slightly more subtle (some may argue I do not always succeed). Here is an excerpt from the opening:

The fishing village of Aldeburgh, home and inspiration to Benjamin Britten, England’s finest 20th century composer, or so it’s widely claimed. In fact, much of what he wrote in the sycophantic, closed world of Aldeburgh was anaemic, and loveless; spiritually dead long before he was buried here in 1976.

I’m not entirely sure what the academic consensus about Britten is nowadays (if any exists). I do appreciate some of his smaller scale choral works. I wouldn’t say that Britten’s work is played excessively in relation to its merit in the US, but possibly things are different in London.

Posted in Music | Tagged Benjamin Britten, J'accuse, Tall Poppy Syndrome | 4 Comments

Mathieu Magic

Posted on October 16, 2017 by Persiflage

I previously mentioned that I once made (in a footnote) the false claim that for a 11-dimensional representation V of the Mathieu group M_12, the 120 dimensional representation Ad^0(V) was irreducible. I had wanted to write down representations W of large dimension n such that Ad^0(W) of dimension n^2 – 1 was irreducible. In the comments, Emmanuel Kowalski pointed to a paper of Katz where he discusses actual examples (including the 1333 dimensional representation of the Janko group J_4). On the other hand, I recently learned from Liubomir Chirac’s thesis:

https://thesis.library.caltech.edu/8942/1/Chiriac_Thesis.pdf

that it’s an open problem to determine whether there exists such a representation for all n (although he does write down infinitely many examples in prime power dimension). Chirac’s thesis also lead me to the paper of Magaard, Malle, and Tiep, who do classify all such examples for (central extensions of) simple groups. Turns out that I could have used M_12 after all, or rather the 10-dimensional representation of the double cover 2.M_12, which does have the required property (the 99-dimensional representation factors through M_12, naturally).

One reason (amongst many) that (either of the) 11-dimensional representations V of M_12 do not have Ad^0(V) irreducible is that they are self-dual (oops). On the other hand, if you eyeball the character table, you will find that there is an irreducible representation W of dimension 120. Moreover, let me write down the characters of [V \otimes V^*] – [1] and [W]:

$latex \begin{aligned}
& [V \otimes V^*] – [1]: & \ 120, 0, \ \ 8, 3, 0, 0, 8, 0, 0, -1, 0, 0, 0, -1, -1; \\
& [W]: & \ 120, 0, -8, 3, 0, 0, 0, 0, 0, \ \ 1, 0, 0, 0, -1, -1. \end{aligned}$

These seem surprisingly close to me! So now the question is, as one ranges over (some class perhaps all) finite groups G, what is the minimum number of conjugacy classes for which

\chi = [V \otimes V^*] – [1] – [W]

can be non-zero for irreducible V and W, assuming that it is non-zero? Since V is irreducible, by Schur’s Lemma, this virtual representation is orthogonal to [1] (unless [W] = [1] which would be silly). So $\langle \chi,1 \rangle = 0,$ which certainly implies that there must be at least two non-zero entries of opposite signs. I don’t see any immediate soft argument which pushes that bound to 3. I admit, this is a slightly silly question. But still, a beer to anyone who proves the example above is either optimal or comes up with an example with only two non-zero terms. (To avoid silliness, say that the dimension of V has to be at least 5.) The characters above look strikingly similar to me, and it does make we wonder if there is any reason for why they are so close. Perhaps if I knew more about groups, I could feel more confident in just chalking up the resemblance above to a law of small numbers.

Probably a more sensible question is to ask for how small the number of non-zero entries of of [V]-[W] can be for two distinct irreducibles. That question has surely been studied!

Posted in Mathematics | Tagged Barry Mazur, Emmanuel Kowalski, Liubomir Chirac, Magaard, Malle, Mathieu Group, Tiep | 3 Comments

Commuting

Posted on October 8, 2017 by Persiflage

I found out a good way to describe how long my commute is: about three minutes more than the length of the second movement of Beethoven’s 9th (the greatest movement!)

On the other hand, that measure proved inaccurate the very next day, when I also found out the answer to “is the drawbridge on Lake Shore Drive ever used?”

(channelling my inner Stanley Kubrick with a little well-timed help from 98.7WFMT). The whole opening/closing of the bridge did cause quite some delay, but the process did, in in the end, finish.

Posted in Music | Tagged Beethoven, Bridges, Chicago, Poor Puns, Schubert | Leave a comment

Hiring Season

Abandonware

The ABC conjecture has (still) not been proved

Graduation Day

Abelian Surfaces are Potentially Modular

Jobs Related Public Service Announcements

Schaefer and Stubley on Class Groups

J’accuse!

Mathieu Magic

Commuting

Recent Posts

Recent Comments

Blogroll

Categories

Archives

Meta