Performance optimization and modeling of fine-grained irregular communication in UPC