A global Jacobian smoothing algorithm for nonlinear complementarity problems

SÁNCHEZ, WILMER; PÉREZ, ROSANA; MARTÍNEZ, HÉCTOR J.; SÁNCHEZ, WILMER; PÉREZ, ROSANA; MARTÍNEZ, HÉCTOR J.

doi:10.18273/revint.v39n2-20210004

Services on Demand

Journal

Article

Indicators

Cited by SciELO
Access statistics

Revista Integración

Print version ISSN 0120-419XOn-line version ISSN 2145-8472

Integración - UIS vol.39 no.2 Bucaramanga July/Dec. 2021 Epub Apr 18, 2022

https://doi.org/10.18273/revint.v39n2-20210004

Artículos originales

A global Jacobian smoothing algorithm for nonlinear complementarity problems

Un algoritmo global con jacobiano suavizado para problemas de complementariedad no lineal

WILMER SÁNCHEZ^a

ROSANA PÉREZ^b

HÉCTOR J. MARTÍNEZ^c

^{^a} Universidad del Cauca, Departamento de Matemáticas, Popayán, Colombia. wiLnersanchez@unicauca.edu.co

^{^b} Universidad del Cauca, Departamento de Matemáticas, Popayán, Colombia. rosana@unicauca.edu.co

^{^c} Universidad del Valle, Departamento de Matemáticas, Cali, Colombia. hector.martinez@correounivalle.edu.co

Abstract.

In this paper, we use the smoothing Jacobian strategy to propose a new algorithm for solving complementarity problems based on its reformulation as a nonsmooth system of equations. This algorithm can be seen as a generalization of the one proposed in [¹⁸]. We develop its global convergence theory and under certain assumptions, we demonstrate that the proposed algorithm converges locally and, q-superlinearly or q-quadratically to a solution of the problem. Some numerical experiments show a good performance of this algorithm.

MSC2010: 49M15, 90C06, 90C30.

Keywords: Nonlinear complementarity problems; complementarity function; generalized Newton methods; Jacobian smoothing method; global convergence; superlinear convergence; quadratic convergence

Resumen.

En este artículo, usamos la estrategia del jacobiano suavizado para proponer un nuevo algoritmo para resolver problemas de complementariedad no lineal basado en su reformulación como un sistema de ecuaciones no lineales. Este algoritmo puede verse como una generalización del propuesto en [¹⁸]. Desarrollamos su teoría de convergencia global y bajo ciertas hipótesis, demostramos que el algoritmo converge local y q superlineal o q cuadráticamente a la solución del problema. Pruebas numéricas muestran un buen desempeño del algoritmo propuesto.

Palabras clave: Complementariedad no lineal; función de complementariedad; método de Newton generalizado; Jacobiano suavizado; convergencia global; convergencia superlineal; convergencia cuadrática

1. Introduction

The Nonlinear Complementarity Problem, (NCP), which in some contexts is synonymous with system in equilibrium, arises among others, in applications to Physics, Engineering, and Economics [³], [⁸], [¹⁴], [¹⁹]. The problem is to find a vector x Є ℝⁿ such that x ≥ 0, F(x) ≥ 0 and x ^T F(x) = 0, with F: ℝⁿ → ℝⁿ continuously differentiable. Here, a vector is nonnegative if all its components are nonnegative. A widely used technique for solving the NCP is to reformulate it as a system of nonlinear equations using a complementarity function φ: ℝ² → R such that

Then, we consider Ф: ℝⁿ - ℝⁿ and define the nonlinear system of equations

which is a nondifferentiable system due to the lack of smoothness of φ. From (1), a vector x* is a solution of (2), if, and only if, x* it is a solution of the NCP. To solve (2) and thus, to solve the NCP, a nonsmooth algorithms type Newton [²⁷], [²⁹] and quasi-Newton [²⁰], [²¹], among others [¹], [⁶], [²²], [²⁶], [³¹] have been proposed. The natural merit function [²⁴] Ψ: ℝⁿ → ℝ, defined by , is used in the globalization of these methods. Thus, Ψ(x) is minimized in ℝⁿ. These methods use the concept of generalized Jacobian [¹⁰] defined by the set

for a Lipschitz continuous function G: ℝⁿ → ℝⁿ , in x, where D _G denotes the set of all points where is G is differentiable and hull {A} is the convex envelope of A. This set is nonempty, convex, and compact [¹¹]. Usually, the set (3) is difficult to compute, for this reason, we use the overestimation ∂G(x) ^T ⊆ ∂G ₁ (x) x… x ∂G _n (x) given in [¹¹], where the right side, for short BeG(x) ^T [¹⁸], called C-sub differential of G at x, is the set of matrices ℝ^nxn , whose i-th column is the generalized gradient of the i-th component of the function G.

Another strategy to solve (2) is to smooth the Jacobian proposed in [⁹] and called Jacobian smoothing in [¹⁸]. The general idea of methods using this strategy is to approximate Ф by a smooth function Ф_μ : ℝⁿ → ℝⁿ , where μ > 0 is the smoothing parameter, and then solving a sequence of smooth nonlinear equation systems,

with μ going to zero and φ _μ a smoothing function of φ used in (2). The system (4) is solved at each iteration by solving the mixed Newton equation) Ф’_μ(x_k)s_k = - Ф (x_k).

In [¹⁸], the authors present a new algorithm with good numerical performance based on the Jacobian smoothing strategy to solve the NCP, by reformulating it as a system of nonlinear equations using the Fischer-Burmeister complementary function defined by . Motivated by the results obtained with this function, and since it is a particular case of the following family of complementary functions [¹⁷]

corresponding to λ = 2, in this paper we use this strategy with the family (5) to propose a new algorithm that solves complementarity problems by reformulating it as a nonlinear, nondifferentiable system of equations. This algorithm can be seen as a generalization of the one proposed in [¹⁸] to any member of family φλ, with λ in (0, 4). Under certain hypotheses, we demonstrate that the proposed algorithm converges local and, q-superlinear or q-quadratically to a solution of the complementarity problem. In addition, we analyze the numerical performance of the proposed algorithm.

The organization of this paper is as follows. In Section 2, we present the Jacobian smoothing strategy applied to a function Фλ, we described the Jacobian matrix of its smoothing and find λ Section 3, we present some preliminary results that we use to develop the convergence theory of the algorithm proposed. In Section 4, we present a new Jacobian smoothing algorithm to solve nonlinear complementarity problems that generalize the one presented in [¹⁸] to all members of family (5). Moreover, we prove that our algorithm is well defined. In Section 5, we develop its global convergence theory. In Section 6, under some hypotheses, we prove that the algorithm converges local and q-superlinear or q-quadratically to the solution of the complementarity problem. In Section 8, we analyze the numerical performance of the proposed algorithm. Finally, In Section 9, we present our concluding remarks.

2. Smoothing Jacobian strategy for Фλ(x) = 0

We consider the reformulation (2) of the NCP as a nonsmooth nonlinear system of equations. If φ = φλ, the family (5), we obtain the system Фλ(x) = 0. The basic iteration of a generalized Newton method to solve this system has the form,

where H _k Є ∂Фλ(x ^k ) or H _k Є ∂_c Фλ(x ^k ). Here, we use H _k Є ∂_c Фλ (x ^k ). In order to define a smoothing Jacobian method for Фλ (x) = 0, we follow the basic idea given in [¹⁸] and we consider smoothing φλ as proposed in [⁴]: for all λ Є (0,4) and μ > 0,

As expected, the distance between φλ and its smoothing function, φλμ, is upper bounded by a constant that depends on the parameters λ and μ. This is a particular case of the following proposition that will be useful in the convergence theory.

Proposition 2.1. Function φλμ satisfies the inequality

for all (a,b) Є ℝ² and μ ₁ ,μ ₂ μ 0. In particular, , for all (a,b) Є ℝ², λ Є (0, 4) and μ ≥ 0.

Proof. Let (a, 6) Є ℝ ², μ ₁ and μ ₂ nonnegatives, such that μ ₁# μ ₂.

Observe that . Then

In particular, if μ ₁ = μ and μ ₂ =0 then .

From (7), we define the function φλμ: ℝⁿ → ℝⁿ by,

The next proposition gives an upper bound for the distance between φλ and its approximation φλμ.

Proposition 2.2. The function φλμ satisfies the following inequalities

for all x Є ℝ ⁿ, and μ, μ ₁ , μ ₂ ≥ 0, where .

Proof. Using the Euclidean norm and Proposition 2.1,

The second part of the proposition is obtained by choosing μ ₁ = μ and μ ₂ = 0.

Now, the basic iteration of a smoothing Jacobian method for solving φλ (x) = 0 is as follows

where Ф’ _λμ (x ^k ) is the Jacobian matrix of Ф’_λμ at x ^k . From (6) and (9), we have that this method solves the reformulation Ф _λ (x) = 0, replacing H _k Є ∂_C Ф _λ (x), with an approximation Ф_’λμK (x ^k ). Thus, these methods can be seen as quasi-Newton. The Jacobian matrix Ф_’λμ (x) is given by

where , with

where .

The next proposition guarantees that, if j tends to zero, the distance between Ф_’λμ (x) and ∂_C Ф _λ (x) also tends to zero. Thus, it makes sense to replace the Newton iteration (6) with (9).

Proposition 2.3 ([4]). Let x Є ℝⁿ be arbitrary but fixed. Then we .

From this proposition, for every δ > 0 there exists such that dist , for all . In our algorithmic proposal is very important to obtain an expression of J since it gives an upper bound of μ. For this, we proceed as in [¹⁸] and we obtain the following proposition.

Proposition 2.4 ([³⁰]). Let x Є ℝ n be arbitrary but fixed. Assume that x is not a solution of the NCP and define

with β(x) = {i: xi = 0 = Fi(x)} . Let δ > 0 and define

Then distp , for all μ such that .

Since our algorithmic proposal is a global algorithm for solving the NCP, indirectly through its reformulation Фλ(x) = 0, we consider the natural merit function Ψλ : ℝ n → ℝ defined by . The idea is to solve the NCP by minimizing

Ψλ. But, there is a problem: the direction computed from (9), is no necessarily a descent direction for Ψλ at xk. Following [¹⁸], an alternative is to use this direction to reduce the related merit function

3. Preliminaries

In this section, we present some definitions, propositions, and lemmas related to the nonlinear complementarity which will be useful in the development of the convergence theory of our algorithmic proposal.

Definition 3.1. Let A Є ℝ nxn. The Frobenius norm of A is defined by

Definition 3.2. Let A Є ℝ nxn and M ⊆ ℝ nxn be a nonempty set of matrices. The distance between A and M is defined by, dist(A, M) = inf BeM {||A - B||}, where || • || is a matrix norm.

Definition 3.3. Let {tk} ⊆ ℝ, if there exists a number U such that

1. For every Є > 0 there exists an integer N such that k > N implies tk < U + Є.

2. Given Є > 0 and m > 0, there exists an integer k > m such that tk > U - Є.

Then U is called the superior limit of {tk} and we write U = limsuxpk →∞ tk.

Related with a solution x* of the NCP, we have the following index sets,

When β # 0, x* is called a degenerate solution.

Definition 3.4. Let x* be a solution of the NCP.

1. If all matrices in ∂ BФλ(x*) are nonsingular, x* is called a BD-regular solution.

2. If the submatrix ^¹ F'(x*)ââ is nonsingular and the Schur complement

is a P-matrix^², x* is called an R-regular solution.

Definition 3.5. Given the sequences {αk} and {βk} such that βk ≥ 0, for all k, and αk = O(βk), if there exists a positive constant M such that |αk| ≤ Mβk, for all k. We write αk = o(βk), if αk/βk = 0..

Proposition 3.6 ([¹⁷], [²⁸]). Assume that limk→∞ {xk} C ℝ n is a sequence converging to x*. Then,

1. The function Φλ is semismooth, so ||Φλ(xk)−Φλ(x∗)−Hk(xk−x∗) || = o(∥xk−x∗∥), for any Hk ∈ ∂CΦλ(xk).

2. If F is continuously differentiable with locally Lipchitz Jacobian then Φλ is strongly semismooth; that is, ||Φλ(xk) − Φλ(x∗) − Hk(xk − x∗)||= O(||xk − x∗||2), for any Hk ∈ ∂CΦλ(xk).

Proposition 3.7 ([¹⁸]). Given x Є ℝ n and μ> 0 arbitrary but fixed, then

Proposition 3.8 ([²³]). If x* Є ℝ n is an isolated accumulation point ^³ of a sequence {xk} ⊆ ℝ n such that q||xk+1 - xk ||} L converges to zero, for any subsequence {xk}L converging to x* . Then xk converges to x*.

Proposition 3.9 ([¹³]). Let G: ℝ n → ℝ n be locally Lipschitz and x* Є ℝ n such that G(x*) = 0, satisfary all matrices in ∂ G(x*) are nonsingular and assume that there exist two sequences {xk} ⊆ ℝ n and {dk} ⊆ ℝ n with

Then ||G(xk + dk)|| = o (||G(xk)||) .

Proposition 3.10 ([³⁰]). Let x, z Є ℝ n such that ||x - z|| ≤ α ||x||, α Є (0,1). Then xT(x - z) ≤ α ||x||2.

Proposition 3.11. [¹⁷] Let x Є ℝ n arbitrary. Then

where Dα = diag(α1(x), .. ., α n(x)), Db = diag(b1(x),.. ., bn(x)) are diagonal matrices in ℝ nxn.

. If (xi,F(xi)) = (0, 0), then

■ If (xi,F(Xi)) = (0, 0) then αi(x) = σi - 1 and bi(x) = Pi - 1, for any (σi,pi) Є ℝ 2 such that .

Proposition 3.12 ([¹⁷]). The merit function Ψλ is continuously differentiable and ∇ Ψλ(x) = V TΦλ(x), for any V ∈ ∂CΦλ(x).

Lemma 3.13 ([³⁰]). Let μ > 0 and λ Є (0,4). The function h: (0, ∞) → ℝ, defined by is strictly decreasing.

Lemma 3.14 ([³⁰]). Let [xk} ⊆ ℝ n and {μk} ⊆ ℝ two sequences such that {xk} → x* for some x* Є ℝ n and {pk} → 0. Then limk→∞ ∇ Ψλ μk (xk) = ∇ Ψλ (x*) and lim

Lemma 3.15 ([¹⁸]). Let {xk}, {dk} ⊆ ℝ n and {tk} ⊆ ℝ with xk+1 = xk + tk dk such that xk → x*, dk → d* and {tk} → 0 for some vectors x*, d* Є ℝ n. Moreover, consider {μk} ⊆ ℝ a sequence such that {μk} - 0. Then

4. New algorithm

In this section, we propose a new algorithm for solving the NCP. Basically, the proposal is a generalization of the smoothing Jacobian method proposed in [¹⁸], and its basic iteration is given in (9). To guarantee the algorithm to be well-defined for an arbitrary NCP, we use a gradient step for Ψλ when linear system (11) solution does not exist or gives a poor descent direction for Ψλ μ.

Algorithm 1. (Smoothing Jacobian method),

SO. .

S1. If ||∇ Ψλ(xk)|| ≤ Є, stop.

S2. Find dk Є ℝ n solving the linear system of equations,

If the system (11) is not solvable or if the condition

is not satisfied, set

S3. Find the smallest mk Є {0,1, 2,...} such that

if dk is given by (11), and such that

if dk is given by (13). Set tk = θmk and xk+1 = xk + tk dk.

S4. If

set (βk+1 = ||Фλ(xk+1)|| and choose μk+1 such that,

If (16) is not satisfied and dk = -∇Фλ(xk) then set βk+1 = βk, and choose μk+1 such that

If none of the above conditions is met, set βk+1 = βk and μk+1 = μk.

S5. Set k = k + 1 and return to S1.

In SO, we introduce the parameters and initialize the variables. In S1, it is natural to think that the algorithm stops when the gradient of the merit function becomes too small. However, in the implementation, we add other classic criteria like maximum number of allowed iterations and one that prevents the algorithm from no finding an adequate step size. In S2, the calculus of a search direction is perhaps the main step of the algorithm: we find dk by mixed Newton equation (11). In case of (11) is not solvable or the direction does not satisfy descent criteria (12), we use the steepest descent direction of the merit function (13), which guarantees a descent direction of Ψλ.

After finding the descent direction, the algorithm is globalized in step S3 using a line search which depends on this direction: if it is obtained by the Newton equation (11), the line search is made using (14), which is also used in [⁹] as a global strategy. On the other hand, if it is the steepest descent direction (13), the line search (15) is type Armijo [¹²]. The update of μk, in S4, starts with the condition (16) used in en [⁹]. If it is satisfied, μk is updated by (17). This guarantees that the distance between the subdifferential and smoothing Jacobian is small, and that μk tends to zero. If (16) is not satisfied, μk is updated by (18). The conditions (18) and (16) are essential to guarantee the algorithm to be well defined and to converge globally. For the convergence analysis of the algorithm, we define the following set

which is motivated by condition (16). Moreover, K = {0} U K1 U K2 (disjoint union), where

The next proposition is useful to demonstrate that the algorithm is well-defined. Its demonstration is analogous to that given in [¹⁸].

Proposition 4.1. The following inequalities hold:

The following result guarantees that Algorithm 1 is well-defined: it ends in a finite number of steps.

Proposition 4.2. Algorithm 1 is well-defined.

Proof. It is essentially the same given in [⁹]. It is sufficient to prove that mk in S3 is finite, for all k Є ℕ. In effect, if a descent direction is given by (13) then the condition Armijo-type guarantees that mk is finite [¹⁸]. On the other hand, let assume that the direction is given by (11). Since Ψλμk is continuously differentiable and its gradient is given by , by Taylor's Theorem, we have

Using the Newton's direction (11) in (20),

where the last inequality follows from Propositions 3.10 and 4.1. On the other hand,

Therefore, from (20), (21) and (22), . This allows to conclude that exists a nonnegative integer mk such that (14) is satisfied.

5. Global convergence

In this section, we present the convergence results of the algorithm proposed. Basically, we prove that any accumulation point of the sequence generated by Algorithm 1 is a stationary point of Ψλ. These results generalize those presented in [¹⁸] for an algorithm using the Fischer-Burmeister function, which in turn is based on the theory developed in [⁹] for smoothing Newton-type methods. The proofs of the theorems and lemmas in this section and in the next section use the same steps as the proofs of the corresponding theorems in [¹⁸].

We start with two lemmas whose proofs use the updating rules of Algorithm 1.

Lemma 5.1. Assume that a sequence generated by Algorithm 1 has an accumulation point x*, which is a solution of the NCP. Then the index set K defined by (19) is infinite and the sequence {μk} → 0.

Proof. Assume that K is finite. For the updating rule (16) there exists an integer k0, such that βk = βk0. Moreover, ||Фλ(xk+1) || > ηβk0, for all k ≥ k0, which contradicts that x∗ is a solution of the NCP.

Lemma 5.2. The following statements hold:

Proof. Part a) is verified using the updating rule (14), and part b) is true if (16) is satisfied. In addition, we use Proposition 2.2 and some algebraic manipulations. 0

The next corollary is an important consequence of the previous result. Corollary 5.3. If .

Proof. The proof is the same as Corollary 5.3 in [¹⁸].

The two following results are technical lemmas; they give some bounds that we use in the proof of Proposition 5.6. Details of the proofs are given in [30].

Lemma 5.4 ([³⁰]). Assume that K is ordered like this k0=0 < k1 < k2 < .... Let k Є ℕ be arbitrary but fixed and kj the largest integer in K such that kj ≤ k then

Lemma 5.5 ([³⁰]). Assume that K is ordered like this ko =0 < k1 < k2 < .... Let k Є ℕ be arbitrary but fixed and kj the largest integer in K such that kj ≤ k. Then where

Using the two previous lemmas, we prove that the iterations xk stay at the same level set. This is important because we minimize different merit functions and a decrease in one does not imply a decrease in the other. This set can be arbitrary close to the level set Ψλ(x0).

Proposition 5.6. The sequence generated by Algorithm 1 stays in the level set

Proof. We assume, without loss of generality, that the set K given by (19) is ordered like this k0 = 0 < k1 < k2 < ..., which does not indicate that K is infinite. We consider k Є ℕ, arbitrary but fixed and kj the largest integer in K such that kj < k. From Lemmas 5.4 and 5.5, we have where r is defined by (23), that is.

Then . Therefore, xk Є L0.

The following proposition is a consequence of inequality (24).

Proposition 5.7. Let {xk} be a sequence generated by Algorithm 1 and assume that the set K is infinite. Then each accumulation point of {xk} is a solution of the NCP.

Proposition 5.8. Let {xk} be a sequence generated by Algorithm 1 and let {xk}L be a subsequence converging to for all k Є L, then x* is a stationary point of Ψλ.

Proof. If K is infinite, the accumulation point, x* is a solution of the NCP by Proposition 5.7. Thus, this is a global minimum and therefore, a stationary point of Ψλ.

If K is finite, we can assume that K ∩ L = 0 since the sequence has an infinite number of elements. Therefore, the updating rule (18) is satisfied for all k Є L, and consequently, the sequence {μk} converges to zero. Since K is finite there exists the largest element that we call k. Using the updating rules defined in step S4 of Algorithm 1, we have the following inequalities

Let assume by contradiction that x* is not a stationary point of Ψλ . That is, ∇Ψλ (x*) = 0. First, we prove that liminfkЄL tk = 0. For this, let assume liminf tk = t* > 0. Since dk = - ∇Ψλ (xk) for all k Є L, using Armijo-rule (15), we have that

for all k Є L sufficiently large, where c = σt* ||∇Ψλ (x*)||2 > 0. Since {μk} converges to zero, Proposition 2.2 guarantees that, for all k G N, sufficiently large,

Using again that {pk} converges to zero, the sequence {||Ф λ(xk)||} is bounded; since K is finite, we have, by Proposition 5.6, that , for all k Є ℕ, sufficiently large. If L = {l0,l1... } then, for all lj, sufficiently large, we have, in analogous way than in Proposition 5.6, that,

From (28) and (29), , for all lj, sufficiently large. Then the sequence as j → ∞, which contradicts that Ψλ(x) ≥ 0 for all x Є ℝ n. Therefore, liminfkЄL tk = 0.

If necessary, we can assume, from a subsequence, that limkЄL tk = 0. Hence, a full stepsize never is accepted for all k k Є L, sufficiently large. From Armijo-rule (15), we obtain or, equivalently,

By taking the limit k → ∞ on L, we obtain from (30), the continuous differentiability of for all k Є L and the fact that θmk-1 → 0 for k → ∞, in L,

- This implies that σ ≥ 1, which is clearly a contradiction with the initial choice of parameter σ. Therefore,

The following lemmas are useful results for the proof of the main global convergence theorem.

Lemma 5.9. Let {xk} be a sequence generated by Algorithm 1 and let {xk}L be a subsequence converging to x* Є ℝn. If dk is a Newton direction for all k Є L, and the set K is finite, then the sequence {||dk||} is bounded, that is, there exist positive constants m and M such that

Proof. Let assume that dk is a Newton direction for all k G L, Thus, for these indices, we have that

If on a subset then, from (32), because is bounded on the convergent sequence . The continuity of Ф _λ would imply that Ф _λ (x*) = 0 and, from Lemma 5.1, K would be infinite, contradicting that K is finite. On the other hand, from (12), for all k Є L,

Since is convergent (either by Lemma 3.14 or because μ _k is constant, for k sufficiently large) and therefore bounded, there exists a positive constant C such that for all k Є L. From this and (33), we have for all k Є L. Since p> 1, {||d ^k ||}_L is bounded, which guarantees (31).

Lemma 5.10 ([³⁰]). Let {x ^k } be generated by Algorithm 1 and {x ^k } _L a subsequence converging to x* Є ℝⁿ . If d ^k is a Newton direction for k Є L, and K is finite, then liminfkeL t _k = 0.

Proof. The details of the proof are given in [³⁰].

The next theorem is the main convergence result of the proposed algorithm.

Theorem 5.11. Let {x ^k } a sequence generated by Algorithm 1. Then each accumulation point of {x ^k } is a stationary point of .

Proof. If K is infinite, Proposition 5.7 guarantees the conclusion of this theorem. If K is finite and k denote its largest index then (25) to (27) are satisfied for all k ≥ k. On the other hand, let x* be an accumulation point of {x ^k }. There exists a subsequence {x ^k } _L of {x ^k } converging to x*. If d ^k = - ∇Ψλ (x ^k ) for a finite number of index k Є L, Proposition 5.8 guarantees that x* is a stationary point of Ψ _λ . Without loss of generality, we assume that d ^k is a Newton direction for all k Є L. Since K is finite, we can assume that for all k Є L, it is satisfied that k Є K, thus the updating rules (17) and (18) do not apply to these indices.

We proceed by contradiction and assume that x* is not a stationary point of the function Ψ _λ .That is, Ψ _λ (x*)=0. By Lemma 5.10, liminf _kЄL t _k = 0. Let L ₀ be a subsequence of L such that {t _k } _L0 converges to zero. Then, m _k > 0 for all k Є L ₀ , sufficiently large, where m _k Є ℕ is the exponent in (14). Then, -2σ θ ^mk-1 Ψ _λ (x ^k ) < Ψ _{λμ k} (x ^k + θ ^mk-1 d ^k ) - Ψ _{λμ k} (x ^k ) for all k Є L ₀ , sufficiently large. Dividing both inequalities by θ ^mk-1 , we obtain

Let μ* be the limit of {μ _k } and if μ* = 0, we denote ∇ Ψ _λμ* (x*) for the gradient of the unperturbed function Ψ _λ in x . From (31), we can assume (through a subsequence) that {d ^k } _L0 - d* =0. By taking the limit k - ∞, we obtain

For μ ^* = 0, this follows from Lemma 3.15. If μ ^* > 0, then μ _k = μ ^* for k sufficiently large, then (34) follows from the Mean Value Theorem. Using (11), (27) and

Proposition 2.2, we have that, for all passing to the limit k - ∞, k Є L ₀ , from (34) we obtain (and from Lemma 3.14, if μ * = 0 ),

From Proposition 5.6, {Ψ _λ (x ^k )} is bounded. Moreover, , since {μ _k } converges. We have that Ψ _λ (x*) > 0, (in another case, K would be infinite). Therefore, from (35), σ ≥ (1 - a), which contradicts that σ < (1 - α). This completes the proof.

6. Local convergence

In this section, we prove under certain hypotheses that the algorithm proposed converges locally and q-superlinearly or q-quadratically. The first result gives a sufficient condition for the convergence of a sequence x ^k generated by Algorithm 1.

Theorem 6.1. If one of the accumulation points of {x ^k } , let us say x*, is an isolated solution of the NCP, then {x ^k } converges to x*.

Proof. From Lemma 5.1, the index set K defined by (19) is infinite and {μ _k } converges to zero. Therefore, Proposition 5.7 guarantees that each accumulation point of {x ^k } is also a solution of the NCP. Thus, x* must be an isolated point of {x ^k } . Let assume that x ^k _L is an arbitrary subsequence of x ^k converging to x . Using the updating rule of S3 in Algorithm 1, we have

From (36), it is enough to prove that {d ^k } _L → 0. We have that

because Ψ _λ is continuous differentiable and the solution x* is a stationary point of 1 _x . If {d ^k } _L has a finite number of Newton directions then it converges to zero. Because of this, let us assume that there exists a subsequence {d ^k } _L0 of {d ^k } _L such that d ^k is a solution of (11) for all k Є L ₀ . From (13), , for all k Є L ₀ , thus,

since p > 1. Since {x ^k } → x* and {μ _k } - 0, with k Є L ₀ , the Lemma 3.14 allows to conclude that,

and this implies that the right side of (38) converges to zero; therefore, {d ^k } _L0 → 0. From (38), we have that {d ^k } _L\L0 → 0, if L \ L ₀ is infinite. Tims, Crom (36), we have that {||x ^k+1 - x ^k || }_L → 0 , and therefore, from Proposition 3.8, {x ^k } converges to x*.

The two following results are technical lemmas that we will use in the proof of Theorem 6.6. The first one guarantees that, for all k Є K, the matrices Ф’ _λμk (x ^k ) are nonsingular and its inverses are bounded. The second guarantees that the Newton direction satisfies the descent condition (12), for all k G K, with k sufficiently large. The details of the proof of each lemma are given in [³⁰].

Lemma 6.2 ([³⁰]). Let {x ^k } be a sequence generated by Algorithm 1. If one of the limit points, lets us say, x , is a R-regular solution of the NCP then for all k G K ; sufficiently large, the matrices Ф’ _λμk (x ^k ) are nonsingular and its inverses satisfies that || Ф’ _λμk (x ^k ) ^-1 || ≤ 2c, for some positive constant c.

Lemma 6.3 ([³⁰]). Under the hypotheses of Lemma 6.2, the solution of linear system (11) satisfies the descent condition (12) for all k G K sufficiently large.

The following result will be useful to verify that Algorithm 1 eventually takes the full stepsize t _k = 1. Its proof is the same of Lemma 3.2 en [⁹].

Lemma 6.4 ([³⁰]). If there exists a scalar such that , for some k Є K and y Є ℝ ⁿ , then , where μk is the smoothing parameter in the k-th step of Algorithm 1.

The next lemma guarantees that the indices of the iterations x ^k remain in K. By repeating this argument, we will guarantee that k Є K and t _k = 1 for all k Є ℕ, suiciently large.

Lemma 6.5. Assume the hypotheses of Lemma 6.2. There exists an index k Є K such that for all k ≥ k, the index k + 1 also remain in K and x ^k+1 = x ^k + d ^k .

Proof. By Lemma 6.2, there is c > 0 such that , for all k Є K suiciently large. For this k, from the Algorithm 1, we have

Here, is such that dtst _F Υβk, where the inequality is obtained by the part b) of the Proposition 4.1. Moreover, using Proposition 3.6 and since β _k → 0, we have

with k → ∞ for k Є K. From this and Proposition 3.9 we have

with k → ∞. for k Є K. Let , where α, η and σ are the constants defined by Algorithm 1. From (41), there exists an index k Є K such that

for all k Є K with k ≥ k. Then, from Lemma 6.4, . Thus, full step is accepted for all k ≥ k, k Є K. In particular, x ^k+1 = x ^k + d ^k . From (42) and the definition o ω, we have which implies k + 1 Є K. By repeating this process, we can prove that, for all k ≥ k, itholds k Є K and x ^k+1 = + d ^k .

The next result gives suicient conditions for the rate of convergence of Algorithm 1.

Theorem 6.6. Let {x ^k } be generated by Algorithm 1. If one of its limit points, let us say x*, is a R-regular solution of the NCP, then {x ^k } converges to x* at least q-superlinearly to x*. If F is continuous differentiable with Jacobian matrix locally Lipschitz, the convergence is q-quadratic.

Proof. Lemma 6.5 guarantees that k Є K and t _k = 1, for all k Є N sufficiently large. Then the q-superlinear convergence is obtained from (40). If F is continuous differentiable with Jacobian matrix locally Lipschitz, then by Proposition 3.6, we have ||H _k (x ^k + d ^k ) -Ф _λ (x ^k ) + Ф _λ (x*) || = O(|| x ^k - x* || ² ). Since Ф _λ is locally Lipschitz, βk = || Ф _λ (x ^k ) || = || Ф _λ (x ^k ) - Ф _λ (x*) || = O(|| x ^k - x* ||). Using these two inequalities in (39), ||x ^k + d ^k - x* || = O(|| x ^k - x* || ² ); therefore, {x ^k } converges q-quadratically to x*.

7. Numerical results

In this section, we analyze the global numerical performance of Algorithm 1 and compare it with three global methods for solving the NCP. The first, a nonsmooth quasi-newton method proposed in [⁵] which we call Algorithm 2. The second, a smoothing Jacobian method proposed in [¹⁸] which, unlike our proposal, uses a smoothing of the Fischer function (Algorithm 1 with λ = 2), which Algorithm 3 and, the third, a smooth Newton method proposed recently in [³²], which we call Algorithm 4. We vary λ in two forms obtaining two versions of our algorithm, namely, Method 1: we use the dynamic choice of λ used in [⁵], (this strategy combines the eiciency of Fisher function far from the solution with that of the minimum function near to it), Method 2: we vary randomly λ in the interval (0, 4). We finalized the section with a local analysis of our algorithmic proposal.

For the parameters, we use the following values: p = 10^-18 , p = 2.1, θ = 0.5, σ = 10^-4,Υ= 30, α= 0.95, η = 0.9, Є₁= 10^-12 Є₂ = 10^-6 , k _max = 300, t _min = 10 ^-16 (minimum stepsize in linear search), where Є₁ , Є₂ are the tolerances for Ф _λ (x ^k ) and || ∇ Ψ _λ (x ^k )||, respectively [¹⁸].

For the numerical test, we consider nine complementarity problems associated with the functions Kojima-Shindo (Koj-Shi), Kojima-Josephy (Koj-Jo), Mathiesen modificado (Math mod), Mathiesen (Mathiesen) Billups (Billups) [⁷], [²⁵]; Nash-Cournot (Nash-Co) [¹⁶], Hock-Schittkowski (HH 66) [³²], Geiger-Kanzow (Geiger-Kanzow) [¹⁵], Ahn (Ahn) [²]. We implemented Algorithms 1 (with Methods 1 and 2) and the test functions in MATLAB and use the following starting points taking from [⁵], [³²],

To analyze the global convergence of Algorithm 1 and the variation of λ, we generate 100 random initial vectors for each problem with each of the methods described previously. We present the results in Table 1, which contains the problem and the method used to choose the parameter λ; the execution average time in seconds (t); the average number of iterations (k), and the percentage of times that the algorithm finds a solution to the problem (Success (%)).

Table 1 Algorithm 1 varying X with random starting points.

The results of Table 1 show that the number of iterations with Methods 1 and 2 are similar, except for two of the problems for which Method 2 increases them. In average time, except for one problem, Method 2 always is better (even with the problems where there are more iterations). In Succes(%), they are similar, except in the case of Billups Problem, where it decreases by 31 % with Method 2. Thus, for this set of numerical test, Method 1, generally used to choose λ in nonsmooth Newton-type methods for NCP, is not better than a random choice of this parameter (Method 2), which indicates that it would be convenient to find an alternative to choosing λ.

Now, we compare Algorithm 1 with Algorithms 2, 3, and 4. For this comparison, first, we consider Algorithm 1 with Method 1 versus the other three algorithms. Afterward, we consider Algorithm 1 with Method 2. We do numerical tests using the fixed starting points described previously. In addition, we take λ = 2 in our algorithm to obtain Algorithm 3 [¹⁸]. We measure the average of iterations and the average algorithm execution time. We present the results in Table 2.

To compare with Algorithm 2, we consider the results associate with Koj-Jo, Koj-Shi, Math mod, and Billups problems, which were also used in [⁵] to analyze the performance of its algorithm (Tabla 4.2 in [⁵]). Algorithm 1, with Methods 1 and 2, converges for some starting points for which Algorithm 2 does not converge. For example, with Method 1, the Koj-Jo problem with x ₁₀ and the Billups problem with x₁₄, and with Method 2, the Koj-Jo problem with x ₂₀. To compare with Algorithm 3, we consider the results in Table 2 for Methods 1 and 2 versus those in the same table for λ = 2. We observe that with Method 1, the average number of iterations and the execution time of Algorithms 1 and 3 are similar. Now, Algorithm 3 compared to Algorithm 1 using Method 2 presents better performance for some problems in the average number of iterations, (Mathiesen and Geiger-Kanzow problems). On the other hand, the execution time with these two methods is very similar, except for 3 problems (Mathiesen Mod, Billups, H66), where it is less using Method 2. Thus, being able to choose A brings advantages.

Table 2: Comparison of the Algorithms 1 and 3.

Taking into account the results for Koj-Shi, Mathiesen, HH 66, and Geiger-Kanzow problems, reported in Table 2 for Algorithm 1 and in Tables 1 to 4 in [³²] for Algorithm 4, we observe that in all cases, our algorithm with Methods 1 finds a solution in fewer iterations. However, with Method 2, it generally exceeds the number of iterations reported in [³²].

Finally, we analyze the local performance of Algorithm 1. In Section 6, we prove, under certain hypotheses, that Algorithm 1 converges superlinear and even quadratically, which is desirable for an iterative method. To illustrate this types of convergence we calculate the quotients

which are related to the definitions of superlinear and quadratic convergence of a vector sequence, respectively. In Table 3, we present the results for Billups problem (for more examples, see [³⁰]). In this Table, R1 _k , clearly, converges to zero, which means that Algorithm 1 converges at least superlinearly, and R2_k seems to be bounded, which means that Algorithm 1 may converge quadratically. This fast convergence of Algorithm 1 is due to (closely related to what was proved in Section 6: near the solution, the Newton step is full (t _k = 1). Moreover, this is illustrated in Table 4, where the column np indicates the full number of steps in the linear search which also correspond to the last steps.

Table 3 Convergence rate for Algorithm 1 with Billups Problem.

Table 4 Full Newton steps of Algoritmo 1 varying λ.

It is important to mention that in all cases, Algorithm 1 uses the Newton direction which is desirable because it makes its convergence fast. In the 77% of numerical tests, the number of full Newton steps in the monotone linear search (np) is greater than or equal to half the number of iterations ( k/2, ) which explains the fast convergence of our algorithm for this set of tests.

In the other part, we use three indices to collect additional information to compare Algorithm 1, varying λ (Methods 1 and 2) with Algorithm 3 (Algorithm 1 with λ = 2). These indices are the follows [⁶]:

Robustness index:

Efficiency index:

Combined robustness and efficiency index:

where r _ij is the number of iterations required to solve the problem i by the method j, r _ib = min_j , t _j is the number of successes by method j and nj is the number of problems attempted by method j.

For the calculation of the indices, we use the data from Table 4 and Table 7.1 in [³⁰], the latter for λ - 2. In Table 5, we present the results obtained.

Table 5 Indices for the Algorithm 1 varying λ.

On this set of problems, the results in Table 5 confirm robustness of Algorithm 1 with the three choices of parameter λ. Also, the algorithm is more efficient with the dynamic choice of A (Method 1) of the three, although further testing will indicate to what extent this is true of broader classes of problems.

8. Final Remarks

One way to deal with the non-differentiability of the NCP is using the smoothed Jacobian strategy [⁹], [¹⁸], which allows us to approximate the reformulation of the problem by a succession of differentiable nonlinear systems that depend on a positive parameter. In this article, we proposed a global smoothed Jacobian algorithm that generalizes the one proposed in [¹⁸] to all members of the family defined in (5). We developed their convergence theory and did numerical tests to analyze their global and local performance. Our proposal presents some advantages in terms of global convergence compared to other methods as those proposed in [⁵], [¹⁸], and [³²].

Finally, we believe that it is necessary to study a relationship between the parameter A and the nonlinear complementarity problem, which may improve the convergence of the algorithm. It would also be interesting to apply the smoothing Jacobian strategy to equation-reformulation of the nonlinear complementarity problem presented in [³²].

Acknowledgments.

The authors are grateful an anonymous referee for valuable suggestions and comments.

References

[1] Abaffy J., Broyden C. and Spedicato E., "A class of direct methods for linear systems", Numer. Math ., 45 (1984), No. 3, 361-376. doi:10.1007/BF01391414. [ Links ]

[2] Ahn B.H., "Iterative methods for linear complementarity problems with upperbounds on primary variables", Math. Program ., 26 (1983) No. 3, 295-315. doi:10.1007/BF02591868. [ Links ]

[3] Anitescu M., Cremer J.F. and Potra F.A., "On the existence of solutions to complementarity formulations of contact problems with friction", Complementary and Variational Problems: State of the Art ., 92 (1997), 12-21. [ Links ]

[4] Arenas F., Martínez H.J. and Pérez R., "A local Jacobian smoothing method for solving Nonlinear Complementarity Problems", Universitas Scientiarum, 25 (2020), No. 1, 149-174. doi: 10.11144/Javeriana.SC25-1.aljs. [ Links ]

[5] Arias C.A., "Un algoritmo cuasi Newton global para problemas de complementariedad no lineal", Thesis (MSc.), Universidad del Cauca, Popayan, 2014, 44 p. [ Links ]

[6] Buhmiler S. and Krejic N., "A new smoothing quasi-Newton method for nonlinear complementarity problems", J. Comput. Appl. Math ., 211 (2008), No. 2, 141-155. doi: 10.1016/j.cam.2006.11.007. [ Links ]

[7] Billups S.C., "Algorithms for Complementarity Problems and Generalized Equations", Thesis (Ph.D.), University of Wisconsin, Madison, 1994. 159 p. [ Links ]

[8] Chen A., Oh J.S., Park D. and Recker W., "Solving the bicriteria traffic equilibrium problem with variable demand and nonlinear path costs", Appl. Math. Comput., 217 (2010), No. 7, 3020-3031. doi: 10.1016/j.amc.2010.08.035. [ Links ]

[9] Chen X., Qi L. and Sun D., "Global and superlinear convergence of the smoothing Newton method and its application to general box constrained variational inequalities", Math. Comp ., 67 (1998), No. 222, 519-540. doi: 10.1090/S0025-5718-98-00932-6. [ Links ]

[10] Clarke F.H., "Generalized gradients and applications", Trans. Amer. Math. Soc ., 205 (1975), 247-262. doi:10.1090/S0002-9947-1975-0367131-6. [ Links ]

[11] Clarke F.H., Optimization and Nonsmooth Analysis, SIAM, 1990. [ Links ]

[12] Dennis J.E. and Schnabel R.B., Numerical Methods for Unconstrained Optimization and Nonlinear Equations, SIAM, 1st ed., Philadelphia, 1996. [ Links ]

[13] Facchinei F. and Soares J., "A New Merit Function For Nonlinear Complementarity Problems And A Related Algorithm", SIAM J. Optim ., 7 (1997), No. 1, 225-247. doi: 10.1137/S1052623494279110. [ Links ]

[14] Ferris M.C. and Pang J.S., "Engineering and Economic Applications of Complementarity Problems", SIAM Rev ., 39 (1997), No. 4, 669-713. doi: 10.1137/S0036144595285963. [ Links ]

[15] Geiger C. and Kanzow C., "On the resolution of monotone complementarity problems", Comput. Optim. Appl ., 5 (1996), No. 2, 155-173. doi: 10.1007/BF00249054. [ Links ]

[16] Harker P.T., "Accelerating the convergence of the diagonalization and projection algorithms for finite-dimensional variational inequalities", Math. Program ., 41 (1988), No. 1, 29-59. doi: 10.1007/BF01580752. [ Links ]

[17] Kanzow C. and Kleinmichel H., "A New Class of Semismooth Newton-Type Methods for Nonlinear Complementarity Problems", Comput. Optim. Appl ., 11 (1998), No. 3, 227-251. doi:10.1023/A:1026424918464. [ Links ]

[18] Kanzow C. and Pieper H., "Jacobian Smoothing Methods for Nonlinear Complementarity Problems", SIAM J. Optim ., 9 (1999), No. 2, 342-373. doi: 10.1137/S1052623497328781. [ Links ]

[19] Kostreva M.M., "Elasto-hydrodynamic lubrication: A non-linear complementarity problem", Internat. J. Numer. Methods Fluids ., 4 (1984), No. 4, 377-397. doi: 10.1002/fld.1650040407. [ Links ]

[20] Li D.H. and Fukushima M., "Globally Convergent Broyden-Like Methods for Semismooth Equations and Applications to VIP, NCP and MCP", Ann. Oper. Res ., 103 (2001), No. 1, 71-97. doi: 10.1023/A:1012996232707. [ Links ]

[21] Lopes V.L., Martínez J.M. and Pérez R., "On the local convergence of quasi-Newton methods for nonlinear complementarity problems", Appl. Numer. Math ., 30 (1999), No. 1, 3-22. doi:10.1016/S0168-9274(98)00080-4. [ Links ]

[22] Ma C., "A new smoothing quasi-Newton method for nonlinear complementarity problems", Appl. Math. Comput. , 171 (2005), No. 2, 807-823. doi: 10.1016/j.amc.2005.01.088. [ Links ]

[23] Moré J.J. and Sorensen D.C., "Computing a Trust Region Step", SIAM J. Sci. and Stat. Comput ., 4 (1983), No. 3, 553-572. doi: 10.1137/0904038. [ Links ]

[24] Nocedal J. and Wright S., Numerical optimization, Springer Science & Business Media, 2nd ed., 2006. [ Links ]

[25] Pang J.S. and Qi L., "Nonsmooth Equations: Motivation and Algorithms", SIAM J. Optim ., 3 (1993), No. 3, 443-465. doi: 10.1137/0803021. [ Links ]

[26] Pérez R. and Lopes V.L., "Recent Applications and Numerical Implementation of Quasi-Newton Methods for Solving Nonlinear Systems of Equations", Numer. Algorithms ., 35 (2004), No. 2, 261-285. doi: 10.1023/B:NUMA.0000021762.83420.40. [ Links ]

[27] Qi L., "Convergence Analysis of Some Algorithms for Solving Nonsmooth Equations", Math. Oper. Res ., 18 (1993), No. 1, 227-244. doi: 10.1287/moor.18.1.227. [ Links ]

[28] Qi L., C-differentiability, C-differential operators and generalized Newton methods, School of Mathematics University of New South Wales, Sydney, 1996. [ Links ]

[29] Sherman A.H., "On Newton-iterative methods for the solution of systems of nonlinear equations", SIAM J. Numer. Anal ., 15 (1978), No. 4, 755-771. doi: 10.1137/0715050. [ Links ]

[30] Sánchez G.W., "Un algoritmo global con jacobiano suavizado para complementariedad no lineal", Thesis (MSc), Universidad del Cauca, 2019. [ Links ]

[31] Schraudolph N.N., Yu J. and Günter S., "A stochastic quasi-Newton method for online convex optimization", Eleventh International Conference on Artificial Intelligence and Statistics, San Juan, Puerto Rico, 2, 436-443, March, 2007. [ Links ]

[32] Zhu J. and Hao B., "A new smoothing method for solving nonlinear complementarity problems", Open Math ., 17 (2019), No. 1, 104-119. doi: 10.1515/math-2019-0011. [ Links ]

¹Given A = (α _ij ) ∈ ℝ ^m×n and the index sets η and τ, A _ητ is the matrix with components α _ij , i ∈ η and j ∈ τ.

²The matrix M ∈ ℝ n×m is called a P-matrix, if for every nonzero vector and ∈ ℝ n exists an index i0 = i0(y) ∈ {1, . . . , n} such that yi0 [My]i0 ≥ 0.

³If Ω is the set of accumulation points of {x ^k }, we say that x* Є Ω is an isolated accumulation point, if exist δ > 0 such that B(x; δ) = {x*}.

To cite this article: W. Sánchez, R. Pérez & H. J. Martínez, A global Jacobian smoothing algorithm for nonlinear complementarity problems, Rev. Integr. Temas Mat., 39 (2021), No. 2, 191-215. doi: 10.18273/revint.v39n2-20210004

Received: October 11, 2020; Accepted: June 14, 2021

This is an open-access article distributed under the terms of the Creative Commons Attribution License