The problem was using endorsements to predict election results when polls did not back it up at all. The polls plus was worse than polls only overall,
I’m not sure what you mean about “polls did not back it up at all”. Polls are one observable. Endorsements are another. In past presidential primary races, they both had predictive power. Sure, the polls have much more predictive power, but for two candidates who are equal in the polls, the candidate with more endorsements was more likely to do well on election day. That was the point of “polls plus”. To incorporate additional variables that aren’t captured in polling.
Yes, the polls plus model did worse than polls only this time, but would have done better in previous races. You can’t know for sure which model is going to work better in advance. That said, as Nate said in his mea culpa, the use of fundamentals in “polls plus” was probably an example of “overfitting”. So, again, we can knock him for that, but I don’t see a problem per se with incorporating additional information besides polls alone.
OK, there are two different things here which we shouldn’t confuse. You can make a model that you develop based on previous years’ presidential primaries. You look at how predictive a poll or any other variable is X number of days before primary day. Then you just wait for the data points to roll in over the course of the campaign. Your model doesn’t actually change over the course of the campaign. Just the data inputs. Both “polls only” and “polls plus” worked this way.
That’s one way to do it. But another way is to wait for election results from the early primary states, then build a demographic model for the rest of the primary campaign based on how each candidate does with each demographic. That’s more of a dynamic model, where you need election returns
from this election in order to even get started. NYT’s Upshot did that, and I had a thread on it. 538 also had a quickie version of this, which they used for their “delegate targets”.
But the problem with this approach is that you’re assuming that there is no movement in candidate support over the course of the campaign. You’re assuming that all of the differences from state to state are demographic differences only, rather than an indication that the race is moving. Maybe that works for the Democratic race this year, but it definitely doesn’t work for the Republican race. I mean, it can give you crude benchmarks, but it’s complicated by the fact that candidates kept dropping out every couple of weeks, and you also saw some real movement, like after New York, when Trump started blowing through demographic targets in subsequent states. It’s a fun thing to look at, but it would be tough to use that in a serious way when the number of candidates in the race changes so much over the course of the campaign.