Macrocycles have gained significant attention in drug discovery, with over 70 macrocyclic compounds currently in clinical use. Despite this progress, the effective methods for designing macrocycles remain elusive. In this study, we present Macro-Hop, a reinforced learning framework designed to rapidly and comprehensively explore the macrocycle chemical space. Macro-Hop efficiently generates novel macrocyclic scaffolds that not only align with predefined physicochemical properties but also exhibit 3D structural similarities to a specified reference compound. As a proof of concept, we applied Macro-Hop to design a new series of macrocycle inhibitors targeting PDGFRαD842 V kinase. The representative compound L7 exhibited high potency against PDGFRαD842 V in both biochemical and cellular assays with IC50 values of 23.8 and 2.1 nM, respectively. L7 effectively inhibited clinically relevant secondary mutants PDGFRαD842 V/G680R (IC50 = 64.1 nM) and PDGFRαD842 V/T674I (IC50 = 27.6 nM), highlighting the rapid effectiveness of wet-leb validation with Macro-Hop.