Summary:ย FracFocus disclosures that contain unintentional duplication of records distort the quantity of the chemicals used. Open-FF1 found that Pennsylvania hadย 1,781ย such disclosures byย 34ย different operators. On average, these duplicates inflated chemical mass in disclosures by 90,100 pounds.
Background on duplication errors
The national disclosure instrument,ย FracFocus, documents the fracking chemicals that have been used in over 220,000 fracking jobs in the US since 2011. This extensive data set is especially important because it is one of the few resources available to the public about fracking chemicals. As such, FracFocus should be central to analyses and debates about fracking patterns and impacts. However, poor data integrity can undermine the usefulness of this resource.
The Open-FF project is trying to make errors in FracFocus more visible to stakeholders. Our aim is to alert users of FracFocus data to weaknesses and to encourage operators to correct problems and prevent future errors.
We recently summarized an odd problem in a large number of FracFocus disclosures. Many disclosures have duplicate records, that is, two single lines share identical values in many identifying columns. Because such records can be confused with legitimate records, users of the data are confronted with significant ambiguity. If they are errors, these records distort the quantity of chemical usage.
We’ve found evidence that these duplicates are unintentional. They have also gone largely uncorrected, some for over 8 years. As we’ve noted, because our project is based only on publicly available data, we cannot be certain that the duplicates are not legitimate; only feedback from the companies can clarify this issue.
The scope of Pennsylvania’s duplication issue
As of this date, Pennsylvania has 11,042 disclosures in FracFocus. We have detected duplication issues in 16.1% of those disclosures. The following figure illustrates Pennsylvania’s pattern over time.

Two specific notes about record duplication in Pennsylvania. First, the PA DEP has a state-focus disclosure instrument. When we compared affected disclosures from FracFocus with the PA disclosures, we found no Pennsylvania disclosures with duplicates. Second, the company PennEnergy Resources recently replaced about 90 of its FracFocus disclosures with versions that cleanly removed the duplicates.
The following table summarizes the operating companies with affected disclosures in Pennsylvania. To see examples of the duplication, follow “Link to Report” for summaries of specific companies.
| Operator | detected number of disclosures in Pennsylvania | link to Open-FF report |
|---|---|---|
| EQT Production | 476 | link to report |
| Cabot Oil & Gas Corp | 366 | link to report |
| CNX Gas Company LLC | 166 | link to report |
| Snyder Brothers Inc. | 100 | link to report |
| Repsol O&G, LLC. | 80 | link to report |
| Chesapeake Operating, Inc. | 67 | link to report |
| PennEnergy Resources, LLC | 62 | |
| Seneca Resources Corporation | 56 | |
| Alta Resources | 47 | link to report |
| CONSOL Energy Inc. | 41 | link to report |
| Rex Energy | 33 | |
| Apex Energy LLC | 31 | |
| Chevron USA Inc. | 31 | link to report |
| Greylock Production LLC | 31 | |
| JKLM ENERGY | 25 | |
| Southwestern Energy | 24 | |
| Inflection Energy (PA) LLC | 21 | |
| Vantage Energy Appalachia II LLC | 18 | |
| Rice Drilling B, LLC | 15 | |
| Shell Oil Company affiliate | 14 | |
| EdgeMarc Energy Holdings, LLC | 13 | |
| Pennsylvania General Energy | 12 | |
| XTO Energy/ExxonMobil | 9 | link to report |
| Blackhill Energy LLC | 8 | |
| LOLA Energy PetroCo | 8 | |
| INR Operating, LLC | 6 | |
| COTERRA ENERGY INC. | 5 | |
| Olympus Energy | 5 | |
| BKV Operating LLC | 3 | |
| Beech Resources LLC | 2 | |
| LPR Energy LLC | 2 | |
| Hilcorp Energy Company | 2 | |
| Travis Peak Resources, LLC | 1 | |
| XPR Resources LLC | 1 |
As of this date, many companies continue to publish disclosures with duplications.
What it means to have duplicate records in Pennsylvania disclosures
At the center of this issue is the core purpose of FracFocus: to provide an accurate picture of a critical part of fracking operations to all stakeholders. Where that accuracy is undermined, stakeholder trust will be undermined. To be sure, errors in such an instrument are inevitable and expected. But because of that, there must be some form of quality assurance or auditing to correct problems discovered after publication. So, when visible errors that touch so many disclosures (about 20,000 across all states) and so many companies (more than 500) remain uncorrected for many years, users of FracFocus data may reasonably assume that review of data is poor.
Following the progression of this issue
This report is based on a snapshot of the FracFocus data (Oct. 8, 2024). The number of disclosures with duplicates will likely change as operators add more disclosures and correct mistaken ones.
To see the most updated status of Pennsylvania’s disclosures with duplicate records and a comparison to a baseline snapshot, visit this link.
Duplicate records in Open-FF data sets
Users of FracFocus data via of Open-FF data sets can remove duplicates by using a standard data set. However, because we only have access to publicly available FracFocus data, we can’t be sure that the records we flag are truly mistakes. In some cases, our detection process may be flagging legitimate records. We leave it to the users of our data sets to weigh the costs and benefits of keeping vs. removing these records. In our experience, it is safest to remove the duplicates.
Title image credit: Becky Mansfield (modified by author)
- This work is part ofย an effortย atย The Open-FF Projectย to improve the quality and accessibility of the FracFocus data. โฉ๏ธ
