Cleaning validation and verification are based on the premise of risk management. Several regulatory and guidance documents make this clear. The International Conference on Harmonization's (ICH) guideline on risk management outlines several approaches to making and documenting risk-based decisions (1). It clearly states that risk management should be based on scientific knowledge and that personnel should evaluate the effect of potential failures on the patient. In addition, it notes that the levels of effort, formality (e.g., use of tools), and documentation of the quality risk-management process should be commensurate with the level of risk.
The US Code of Federal Regulations states that equipment and utensils shall be cleaned, maintained, and sanitized at appropriate intervals to prevent malfunctions or contamination that would alter the safety, identity, strength, quality, or purity of the drug product (2). In accordance with 21 CFR 211.67, ICH issued recommendations on equipment maintenance and cleaning (Q7A, Sections 5.20–5.26) for compliance and safety that include similar, but more detailed requirements (3).
The US Food and Drug Administration's 1993 guidance on cleaning inspections states that for a swab method, recovery should be established from the surface (4). The guidance contains no specific requirements about how to establish these recovery estimates, or the acceptance limits. It is up to the manufacturer to document the cleaning rationale (i.e., process and acceptance limits) for maintaining the quality and purity of the drug product being manufactured.
Cleaning validation and verification
Cleaning verification consists of routine monitoring (e.g., swab analysis) of equipment-cleaning processes. Cleaning validation confirms the effectiveness and consistency of a cleaning procedure and eliminates the need for routine testing (5). For example, cleaning limits are established to determine the maximum allowance of Product A that can carry over to Product B. The calculation of these limits is well documented and includes factors that increase the margin of safety to protect the patient (6, 7). Because it is not feasible to swab every square inch of the equipment, swabbing locations are chosen based upon factors such as how difficult the area is to clean, the size of the equipment, and the areas where product buildup is likely. All product-contact surfaces must be considered during cleaning verification to demonstrate that equipment is clean, and a recovery value is expected to be established for each product-contact surface during method validation. The recovery is used to correct the submitted swab result for incomplete removal from the surface and to compare it with the acceptance limit. This last aspect of risk management (i.e., establishing the surface recovery) is the focus of this article.
|
Design of experiments
Several variables (i.e., roughness average, material of construction, active ingredient, and spiked amount) were evaluated in a randomized fashion to prevent systematic bias that could be introduced by going from the lowest to the highest acceptance limit, from the smoothest to the roughest surface, or from one material of construction to the next. The initial design of experiments included two active pharmaceutical ingredients (APIs), three spiked acceptance-limit levels (i.e., 0.5, 5.0, and 50 μg/swab), seven surface types, four target roughness averages (Ra <>
The authors chose two APIs for this evaluation on the basis of their solubility profiles to represent the most- and least-soluble compounds a company would likely manufacture. Compound A, the less soluble, is slightly soluble in methanol and insoluble across the pH range, but Compound B is soluble in all solvents. In addition, Eli Lilly (Indianapolis, IN) identified Compound A as one of the most difficult compounds to clean from equipment, based on its low solubility and staining properties. A control (i.e., stainless steel 316L, 0.5 μg/swab, Compound A) was run each day that data were generated.
Equipment and operating conditions
|
Results and discussion
In this study, a single analyst evaluated the analytical swab recovery from a representative set of surfaces found in the CTM manufacturing and packaging areas. The surfaces were manufactured specifically for this study to have a broad range of Ras. In addition to Ra, the effect of the material of construction, acceptance limit, compound, and method variability also were evaluated. Based upon these data sets, the authors used a strategy involving three groups of materials to represent all of the surfaces in CTM operations. Merck and Co. used a similar strategy to establish five recovery groups (9). The authors expanded on Merck's strategy by adding a detailed study supporting the groups and an approach for determining the appropriate placement of new surfaces into pre-established groups.
Roughness average (Ra). The Ra targets listed above were difficult to achieve. The intermediate Ra values were significantly lower than the target values given in the design of experiments section above. Both intermediate Ra values, initially targeted for 75 and 125 Ra, were measured to be approximately 40 μin. Although the machining process at each level yielded visually different surfaces, the measured Ra changed little from surface to surface. The authors decided to proceed with the surfaces and define smooth surfaces as Ra <> 100 μin. This approach allowed for an assessment of the anticipated relationship between Ra and analytical recovery.
|
|
For both compounds, the Type III hard anodized aluminum exhibited the poorest recovery. The next logical break point grouped bronze and cast iron. The recovery of Compound B from bronze suggested that the material was representative of Group 1. The recovery of Compound A on bronze was lower and more variable, however, so the authors placed bronze into Group 2. For the majority of the surfaces, the recovery of Compound A was lower than that for Compound B at a given limit. In some cases, the recovery was approximately the same (i.e., of 5- and 50-μg spikes on cast iron, and of the 50-μg spike on Type III hard anodized aluminum). In addition, the predominant trend was that the average recovery of a compound increased as the spiked amount increased on a given material of construction. For example, the recovery of Compound B from stainless steel 316L was approximately 74%, 90%, and 95% at 0.5-μg, 5-μg, and 50-μg swabs, respectively.
|
Note that polymers were grouped together with metals and might not be considered to be similar on first pass. The SEM image of Lexan in Figure 4d, however, illustrated that the polymer surface was smooth, albeit with some surface debris, which prevented the loss of analyte. The polymer surface was grouped with stainless steel in Group 1. The SEM images were good supporting evidence that the groupings were logical based upon surface characteristics.
|
Analytical methods. The model and worst-case compound evaluation did not replace any analytical-method validation activities. Analytical recovery must be established for each compound in the portfolio, but not on all surfaces. If multiple limits are to be considered, or if a range of reporting is required, the lowest limit may be evaluated, and that recovery can be applied to all acceptance limits as a conservative estimate. In the analytical method, three recovery factors were presented: Group 1, Group 2, and Group 3. Methods could be validated for any surface within a group and could be considered representative. The authors chose stainless steel 316L because it is the most prevalent, and cast iron because it is a common material on a tablet press. Type III hard anodized aluminum is the only surface in Group 3. This strategy did not ignore any uncommon surfaces. It grouped them appropriately, swabbed them, and applied a representative recovery factor.
Variability. Method variability was evaluated by performing a control sample (Compound A, 6 replicates, 0.5-μg swab, stainless steel 316L, Ra = 3.5) each day. The mean recovery of the entire experiment was 52%. These data suggested that the swabbing ability of the analyst did not change over time. The standard deviation within a day typically was less than 6. The pooled-within-run standard deviation was 3.99 over the course of the experiments. This value was used as a criterion for grouping new surfaces. The day-to-day standard deviation was 15.34.
|
Suppose that new equipment incorporated three new materials: a crystalline thermoplastic polyester marketed under the trade name of Ertalyte (Quadrant Engineering, Reading, PA), stainless steel 420, and stainless steel 630. Without the grouping strategy in place, method revalidation would have to occur for all compounds handled by this piece of equipment. With the strategy in place, these surfaces are placed into groups based on model-compound recovery. No method revisions are required.
The analytical recovery of Compound A was evaluated for Ertalyte, stainless steel 420, and stainless steel 630 at the 5.0-μg/in.2 level. The authors used the validated method to evaluate the recovery of the three new surfaces compared with a representative surface from Group 1 (i.e., stainless steel 316L), Group 2 (i.e., cast iron), and Group 3 (Type III hard anodized aluminum). Recovery was evaluated for both the group representative and the new surfaces on the same two days with three replicates on each day. As an alternative, six replicates may be performed on the same day as the controls because the comparison of recovery is relative.
|
This calculation did not indicate a difference between a control surface and the new surface under evaluation if the recovery differed by less than 3.0%. This approach was conservative because Eq. 1 categorized a new surface into Group 2 if it differed from stainless steel by more than 3.0%, which could be viewed as a strict criterion. Because the results were obtained on the same days and runs, the authors believed that this approach was reasonable. When evaluating the recovery of a new surface, this strategy helps personnel to place each material into the appropriate group. The grouping starts with a comparison of the new surface average to that of Group 1 (stainless steel 316L) and continues sequentially. If the new surface recovery (NSR) is more than 3.0% less than that of the group reference, it is compared with the reference surface in the next lower group until a group is found with which it does not differ by more than 3.0%. If no such group is found, then the new surface forms a new, lower group. The procedure is as follows:
|
The placement of these two grades of stainless steel into Group 2 highlighted the conservative nature of this approach because their recoveries were only 4% and 8% less than that from stainless steel 316L. The recoveries were 12% and 9% greater than that from cast iron for stainless steel 630 and 420, respectively. Table II was updated to reflect this placement. Because the grouping strategy was conservative, it prevented the underestimation of recovery factors when assay values were reported and prevented the formation of additional groups for method validation unless the recovery value for a new surface is sufficiently low to warrant such an addition.
Conclusion
The authors' data-driven risk-management approach to cleaning verification methods uses analytical-recovery values for a model compound to place product-contact surfaces into groupings for analytical-method validation. The data generated during the studies supported the formation of three recovery groups to validate analytical swab methods. Groups 1–3 were represented by stainless steel 316L, cast iron, and Type III hard anodized aluminum, respectively. This approach allowed all surfaces to be considered during analytical-method validation and provided an objective mechanism to incorporate new surfaces into the strategy.
The benefits of this strategy are numerous. First, only three surfaces must be validated on each compound, which drastically minimizes the number of recovery values established to support the entire portfolio. Second, the strategy includes a way to add new materials of construction to the cleaning program if new equipment is purchased. Traditionally, all swab methods must be revalidated to incorporate the new surface. With this strategy in place, a model compound is evaluated, the new surface is grouped, and no changes to existing methods are required. Third, the strategy allows for a constant state of compliance. A relative recovery value is known for any material of construction for all equipment.
Because the grouping strategy is applied to a small fraction of the total surface area, no surface material of construction is ignored, each molecule undergoes a typical method validation, and the strategy places surfaces into groups conservatively. The authors believe that the strategy controls risks appropriately and that the data set given in this study scientifically supports the strategy of grouping materials of construction to support analytical methods within the cleaning program.
Acknowledgments
The authors would like to acknowledge the following colleagues at Eli Lilly: Gifford Fitzgerald, intern, for generating the swab-recovery data; Ron Iacocca, research advisor, for the SEM data; Sarah Davison, consultant chemist; Mike Ritchie, senior specialist; Mark Strege, senior research scientist; Matt Embry, associate consultant chemist; Kelly Hill, associate consultant for quality assurance; Bill Cleary, analytical chemist; and Laura Montgomery, senior technician, for their contributions and insightful suggestions throughout the project. In addition, Leo Manley, associate consultant engineer, provided the roughness measurements in support of this project.
No comments:
Post a Comment