Kim, Choi, Kim, Byun, and Jang: Extracting the Size Distribution of Gold Nanoparticles from UV-Visible Spectrum:an Artificial Neural Network Model

INTRODUCTION▼

Metallic nanoparticles (NPs) find wide applications in fuel cells, solar energy storage, growth catalysts and bio-medicines, just to name a few.^1,2 In particular, the NPs of gold have been extensively utilized in the diagnostics, treatment, catalysis, optical sensing, and biomarkers and building blocks in nanotechnology.³ Various properties of a single NP, such as the optical and electrical properties, sensitively depend on its size.^3,4

More often than not, it is an ensemble of NPs of different sizes that is utilized in experiment. It is therefore important to determine the particle size distribution (PSD) of NPs. An ex-situ method, such as the transmission electron⁵ or scanning electron⁶ microscopy, can be used to measure the sizes of NPs. Preferably however, a non-destructive in-situ detection of the size distribution is called for. At present, a popular in-situ method is the dynamic light scattering (DLS)^7-9 which extracts the PSD by utilizing the Stokes-Einstein relation. DLS technique can be used to measure particle size for particles with a monodisperse size distribution. However, it becomes challenging to measure for particles with polydisperse or large-size distribution because big particles can screen smaller objects.^8,10,11

UV-visible spectroscopy is widely used in the study of nanomaterials as a diagnostic of nanoparticle formation. It is a very useful tool employed for the characterization, estimation of the sies of NPs, concentration, and aggregation level. UV-visible spectrometers can be found in many laboratories, the analysis does not change the sample, and the time needed for registration of the spectrum is very short.^4,12-15 Several attempts have been made to derive the PSD from a UV-visible extinction spectrum. Kim et al. assumed that the size distribution of gold NPs follows a Gaussian distribution and demonstrated that PSD can be derived solely from UV-visible spectra under this assumption. Calculations considering PSD were more experimentally accurate than calculations that ignored size distribution.³ Haiss et al. demonstrated the predictability of gold NPs sizes using the mean-free-path-corrected Mie theory.⁴ Doak et al. also utilized Mie theory to predict the size distribution of gold NPs from UV-visible spectra. In this study gold NPs were considered to have a broad size distribution, which resulted in better agreement with experimental data compared to theoretical calculations that only considered a single particle size.² Karlik et al. showed the multilayered perceptron is efficient for the prediction of the diameter of gold nanospheres. The principal component dimension reduction technique was used during the preprocessing of UV-visible spectra. The training set consisted of 25 spectra calculated according to the Mie theory for nanospheres with diameters from 2 to 50 nm. The approach was validated for theoretical spectra of particles with 5, 7, 10, 15, and 30 nm diameter.¹³

Herein, we propose a novel alternative method to extract the PSD from the UV-visible extinction spectrum. The present method employs an artificial neural network (ANN) model of the PSD for a colloidal gold NPs. By utilizing the extensive dataset of UV-visible spectra generated from the exact Mie theory, we trained the ANN model to predict the PSD solely from the UV-visible spectra. The present ANN model is favorably tested against a finite set of experimental spectra collected from the literature.

COMPUTATIONAL METHODS▼

Preparation of the UV-Visible Spectra for Training the ANN Model

In order to train the present ANN model, we prepared the UV-visible spectra of gold NPs whose PSDs are known. We generated the discretized distribution of the radii of NPs, P(r), which ranged from r = 1 to 200 nm with an interval of 5 nm (40 nodes in the output layer). P(r) was sampled from the random numbers obeying a Gaussian distribution with a mean μ and standard deviation σ. The mean radius of NP, μ, was varied from 5 nm to 150 nm with an interval of 1 nm. For a given μ, we have chosen various σs ranging from 0.05 μ to 0.125 μ, in order to consider diverse PSDs. To each Gaussian P(r) generated as above, we added a small random noise uniform sampled from the range between -0.2P(r) and 0.2P(r). This addition of random noises was done to mimic the small noises in the experimental size distribution, and the other was to double the number of data for training the present ANN model.

For a given P(r) generated as above, the extinction spectrum Q_ext(λ) at a specific wavelength of λ was calculated as

(1)

Q e x t λ = Δ r ∑ i = 1 N P r i Q e x t λ , r i ,

where Q_ext(λ, r_i) is the extinction coefficient for a specific radius of NP. We discretized the range of radii by using the bin of Δr = 0.5 nm and N = 200. λ was varied from 400 to 700 nm with an increment 1 nm, in order to include the peak of a typical extinction spectrum (located between 500 and 600 nm). The absorbance, A, is related to Q_ext(λ) through the following relation,

(2)

A = π r 2 Q e x t l N ln 10 ,

where the path length of light l varies from equipment to equipment and it is difficult to know exactly the number density N of the samples.² To solve these, the value of spectrum at each wavelength Q_ext(λ) was divided by the difference between the maximum and minimum of the spectrum.

The size-dependent extinction coefficient Q_ext(λ, r) of a gold NP was calculated by using Mie theory,¹⁶

(3)

Q e x t r , λ = 1 x 2 ∑ n = 1 ∞ 2 n + 1 R e a n + b n ,

where λ is the wavelength of an incident light. a_n and b_n are the scattering coefficients given by

(4)

a n = m ψ n m x ψ n ' x − ψ n x ψ n ' m x m ψ n m x ζ n ' x − ζ n x ψ n ' m x

and

(5)

b n = ψ n m x ψ n ' x − m ψ n x ψ n ' m x ψ n m x ζ n ' x − m ζ n x ψ n ' m x

In equations (4) and (5), we introduced a dimensionless size parameter x = 2πn_m r/λ. Also, Ψ_n and ξ_n are the Ricatti-Bessel functions given by

(6)

ψ n x = x π 2 1 2 J n + 1 / 2 x

and

(7)

ζ n x = x π 2 1 2 H j + 1 / 2 1 x

where J_n+1/2(x) and H j + 1 / 2 1 x are the Bessel j and primary Hankel functions, respectively.¹⁷ m is the relative refraction coefficient defines as m = n_p/n_m where n_m and n_p the real refractive index of the solvent and complex refractive index of the NP, respectively.

We took the solvent to be water, and it was assumed that the refractive index of the solvent did not depend on the wavelength of light, therefore using n_m = 1.333. The refractive index of gold, taken to be dependent on the frequency of an incident light, is taken from the data reported by Johnson and Christy.¹⁸ We used a cubic spline interpolation¹⁹ to calculate the values of refractive indices intermediate between the discrete raw data reported by Johnson and Christy. We generated total of 1,752 spectra by using an in-house Python code.

Artificial Neural Network Model

We divided the dataset of extinction spectra into the training, validation, and testing sets by randomly taking 60, 20, and 20% of 1,752 spectra in total. We used the validation set to determine the presence of an overfitting. The present ANN model, illustrated in Fig. 1, is made of three hidden layers. The input layer consists of 301 nodes each of which contains the discretized spectrum of the solution of gold NPs where the wavelength of an incident light ranges from 400 to 700 nm with a 1 nm interval. Each hidden layer consists of 256 nodes. The output is discretized PSD of nanoparticles radii, with each node corresponding to the fraction of particles falling into a specific size bin. We used the ReLu function for an activation function, except for the output layer where the identity function was used.

Figure1.

Scheme for the present ANN model to extract the size distribution of NPs from an input extinction spectrum. Each of 301 nodes in the input layer contains the extinction cross-section value for the wavelength of an incident light. Each of 3 hidden layers consists of 256 nodes. Each of 40 nodes in the output layer gives discretized PSD of gold NPs.

We trained the present ANN model by using the back-propagation algorithm and found the optimized values of weights. We employed an Adam optimizer with a learning-rate of 10^-6 to minimize cost function defined as the mean-squared error.²⁰ In order to prevent overfitting, we applied a dropout method where the nodes in the hidden layers were dropped out with a rate of 0.7. The present ANN model was trained in 200,000 steps, and the cost values were saved for every 10,000 steps.

We calculated the R² value, root-mean-square error (RMSE), and mean-absolute error (MAE) as the performance indicators of the ANN model. R² was defined as R 2 = ∑ i = 1 n d ^ l − d ¯ / ∑ i = 1 n d i − d ¯ 2 , where i and n represent the index and the total number of data points, respectively. d ^ l and d_i are the ith value of PSD, P(rⁱ), from the ANN model and the Gaussian distribution with a random noise, respectively. d is the mean value of d i = 1 n ∑ i = 1 n d i . The MAE and RMSE were defined as MAE = 1 n ∑ i = 1 n d i − d ^ l and RMSE = 1 n ∑ i = 1 n d i − d ^ l 2 . All the ANN methods were implemented by using TensorFlow package.²¹

Experimental Data

We further tested the present ANN model against 41 experimental data. The experimental UV-visible spectra were obtained from the nanoComposix²² and Cytodiagnostics²³ web pages, and from the paper reported by López-Muñoz, G.A., et al..²⁴ After that, the graphic images of the experimental spectra were converted to digital numbers by using Engauge Digitizer.²⁵ Some experimental data are given size distributions as mean and standard deviation of diameters, mean diameters were calculated from the predicted size distributions to compare the ANN model with experimental value.

RESULTS AND DISCUSSION▼

Fig. 2 shows the variation in the cost function as the training progresses in 200,000 steps. If the error of prediction using the training set is larger than the error of the prediction using the validation set, the learning is stopped. In this study, no learning interruption occurred. The blue line and orange line show the training and validation cost respectively. Training and validation costs decreased from 75.9811 and 75.3383 at 10,000th step to 13.2096 and 8.3396 at 200,000th step, respectively. Since the training cost decreased smoothly during training, it can be seen training of ANN finished well. In addition, the validation cost decreased as smoothly as training cost, and showed lower values during training, so it can be considered that there was no overfitting.

Figure2.

Change in the cost function with the progression of training. Drawn as the broken and dotted lines are the variations in cost for training and validation cost, respectively.

As illustrated in Fig. 3, the present ANN model trained well against the theoretical calculation. The ANN model gave the R2 values of 0.7949 and 0.7945 for the training and test sets, respectively (Table 1). The nearly identical values of R2 for training and test sets show the absence of overfitting. The RMSEs of the ANN model were 2.810 and 2.7639 for the training and test sets, respectively (Table 1). The MAEs were 0.5708 and 0.5740 for the training and test sets, respectively. However, discrepancies were observed at high fraction values. These higher fractions typically occurred in sharp PSDs and corresponded to smaller particle sizes (~30 nm), since the standard deviation σ was expressed as a ratio to the mean diameter during PSD generation. The discrepancies in these regions indicate lower prediction accuracy for smaller particles, which may be attributed to minimal variation in the extinction spectra within this small size range. This problem can be solved by making the intervals of size distribution smaller than 5 nm in a small size section, but in very small cases the prediction may not be as good.

Figure3.

Performance of the present ANN model vs. the original data for the training (a) and test (b) datasets. Each data point indicates the fraction of particles falling into a specific size bin. The training and test datasets consisted of 1050 and 351 data, respectively. As a reference, the dotted lines represent the exact match between the ANN and theoretical values.

Table1.

Training results of the present artificial neural net-work model

	RMSE	MAE	R2 Value
Train	2.801	0.5708	0.7949
Test	2.7639	0.5740	0.7945
\|Train-Test\|	0.0371	0.0032	0.0004

We investigated whether the ANN model predicts well in the experiment. The RMSE and MAE were 7.944 nm and 5.280 nm, respectively. Table 2 shows error and correlation value between the mean diameter from experiment and the ANN model, and Fig. 4 shows the correlation between them. In Fig. 4, x and y axis represent the mean diameters from experiment and the ANN model, respectively. Overall, they show high correlation, and especially when the average size is 45 nm or more it can be seen that the prediction is very good. However, in the case below size 45 nm it has relatively high RMSE and MAE values although high correlation coefficient.

Figure4.

Mean diameters of experimental NPs plotted vs. those predicted from the ANN model, drawn as diamonds. As a reference, the dotted line represents the exact match between the ANN and theoretical values. The R² value for the mean diameter is 0.932.

Table2.

RMSE, MAE, R2 values between the calculated mean diameters based on predicted size distribution and the mean diameters from experiment

	RMSE (nm)	MAE (nm)	R2 Value
Total	7.944	5.280	0.932
Size < 45 nm	11.277	8.366	0.393
Size > 45 nm	3.602	2.865	0.960

Table 2 shows the predicted average size by ANN and absolute error and relative error between predicted and experimental values. It can be considered as follows that the prediction results in the section below the size of 45 nm are not good. Note the present ANN model showed lower prediction accuracy for smaller particle sizes. Additionally, in the previous reported study by Haiss et al., when the size was small (below 25 nm), the peak position of the experimental surface plasmon resonance (SPR) was smaller than the theoretical calculation.⁴ Since the ANN model used in this study predicts the size distribution of gold nanoparticles using only theoretical UV-visible extinction spectra, the difference in the shape between experimental and theoretical spectra can be considered as the reason. It seems that this can be solved by training the experimental spectrum in the case of small sizes.

Figure5.

Examples of predicted PSD using ANN model which matches well with experimental data. The experimentally measured mean diameter and standard deviation for each sample were 52 nm and 5 nm(a), 60 nm and 6 nm(b), 71 nm and 8 nm(c), 80 nm and 5 nm(d), and 98 nm and 11 nm(e).

CONCLUSION▼

In this study, Artificial neural network (ANN) that predicts size distributions of colloidal gold nanoparticles using UV-visible extinction spectrum is introduced. Extinction spectra of colloidal gold nanoparticles which considering size distribution were calculated by using Mie theory. ANN was trained using calculated extinction spectra as input factors and size distribution of gold nanoparticles as out-put factors. For comparison with the experiment, prediction size distributions using 41 experimental data were attempted. The average sizes were calculated from the predicted size distribution and compared with experimentally measured values. Overall, it showed high correlation, especially when the average size was more than 45 nm. However, in the case of size smaller than 45 nm, they showed a high error value although the high correlation coefficient. This can be seen since small particles shows differences in the shape of UV-visible extinction spectrum of theory and experiment, and the size distributions were gathered at 5 nm intervals. If the prediction in small sized section is successfully done, it will lead to the development of a non-destructive measurement method that effectively predicts the particle size using UV-visible extinction spectrum.

Acknowledgements▼

This work was supported by a 2-Year Research Grant of Pusan National University

References▼

A. De Vyt R. Gijbels H. Davock C. Van Roost I. Geuens Journal of Analytical Atomic Spectrometry199914499 [CrossRef]

J. Doak R. K. Gupta K. Manivannan K. Ghosh P. K. Kahol Physica E: Low-dimensional Systems and Nanostructures2010421605 [CrossRef]

H. Kim J. Jang S. Leton Chandra Journal of the Korean Chemical Society200751407 [CrossRef]

W. Haiss N.T.K. Thanh J. Aveyard D. G. Fernig Anal. Chem2007794215 [CrossRef]

L. Reimer Transmission Electron Microscopy: Physics of Image Formation and MicroanalysisSpringer2013

L. Reimer Measurement Science and Technology2000111826

B. J. Berne R. Pecora Dynamic Light Scattering: With Applications to Chemistry, Biology, and PhysicsCourier Corporation2000

B. G. Zanetti-Ramos M. B. Fritzen-Garcia C. S. de Oliveira A. A. Pasa V. Soldi R. Borsali T. B. Creczynski-Pasa Materials Science and Engineering: C200929638 [CrossRef]

H. Jans X. Liu L. Austin G. Maes Q. Huo Analytical Chemistry2009819425 [CrossRef]

10.

B. N. Khlebtsov N. G. Khlebtsov Colloid Journal201173118 [CrossRef]

11.

E. Tomaszewska K. Soliwoda K. Kadziola B. Tkacz-Szczesna G. Celichowski M. Cichomski W. Szmaja J. Grobelny Journal of Nanomaterials2013113081

12.

J. C. Martínez N. A. Chequer J. L. González T. Cordova Nanosci. Nanotechnol20122184

13.

B. Karlik M. F. Yilmaz M. Ozdemir C. T. Yavuz Y. Danisman Plasmonics202116147 [CrossRef]

14.

V. Amendola M. Meneghetti The Journal of Physical Chemistry C20091134277 [CrossRef]

15.

S. J. ldenburg R. D. Averitt S. L. Westcott N. J. Halas Chemical Physics Letters1998288243 [CrossRef]

16.

C. F. Bohren D. R. Huffman Absorption and Scattering of Light by Small ParticlesJohn Wiley & Sons2008

17.

M. Abramowitz I. A. Stegun Handbook of Mathematical Functions With Formulas, Graphs, and Mathematical TablesUS Government printing office1964

18.

P. B. Johnson R.-W. Christy Physical Review B197264370 [CrossRef]

19.

S. McKinley M. Levine College of the Redwoods1998451049

20.

D. E. Rumelhart G. E. Hinton R. J. Williams Nature1986232533

21.

M. Abadi A. Agarwal P. Barham E. Brevdo Z. Chen C. Citro G.S. Corrado A. Davis J. Dean M. Devin S. Ghemawat I. Goodfellow A. Harp G. Irving M. Isard Y. Jia R. Jozefowicz L. Kaiser M. Kudlur J. Levenberg D. Mane R. Monga S. Moore D. Murray C. Olah M. Schuster J. Shlens B. Steiner I. Sutskever K. Talwar P. Tucker V. Vanhoucke V. Vasudevan F. Viegas O. Vinyals P. Warden M. Wattenberg M. Wicke Y. Yu X. Zheng 2016arXiv preprint arXiv:1603.04467

22.

nanoComposixhttps://nanocomposix.com/

23.

Cytodiagnosticshttps://cytodiagnostics.com

24.

G. A. López-Muñoz J. A. Pescador-Rojas J. Ortega-Lopez J. S. Salazar J. A. Balderas-López J. Nanoscale Research Letters20127667 [CrossRef]

25.

M. Mitchell B. Muftakhidinov T. Winchen J. Jędrzejewski-Szmek Badger G. Zenodo Engauge Digitizer Software2016

Journal Information

Article Information

Extracting the Size Distribution of Gold Nanoparticles from UV-Visible Spectrum:an Artificial Neural Network Model

Abstract

INTRODUCTION▼

COMPUTATIONAL METHODS▼