This document discusses data integration challenges in a big data context using the Open PHACTS case study. Open PHACTS aims to integrate multiple biomedical data resources into a single open access point. It has developed a cloud-based production level system that provides semantic web-based APIs to access integrated data on diseases, tissues, targets, compounds and pathways. The system addresses issues like identity resolution, data quality, provenance and licensing to enable complex queries across diverse data sources.