Top
合 supplies the sound (hé → hé, identical) and a tight semantic echo:
合 means 'to close, fit together' — exactly what a box does with its lid.
合 itself pictures a lid
亼 over a mouth/vessel
口. So
盒 is a phono-semantic compound where the phonetic also reinforces the meaning.