Nutshell version: Normal 2d sonar uses single pulses (pings) of a single frequency to get the information translated into the picture on your screen. CHIRP uses a series of pulses - each a slightly different frequency - in each ping. This creates a more refined picture on your screen and provides increased target separation.
This is my best understanding of its essence.