In a curious experiment, I explicitly asked the model to disagree with me if it spotted a flaw. It disagreed on cue, but the pushback was synthetic. It offered a timid counter like "I partially agree with you..." before reaffirming that my original idea was in fact perfectly valid and disagreement was only with my formulation.
Claude 4.1 feels more sycophantic than Claude 4.0
- Posted 3 weeks ago by shabie
- 2 points