Table Of ContentJOURNALOFLATEXCLASSFILES,VOL.*,NO.**,*********** 1
Optimal Inventory Decisions in the Multi-period
News-vendor Problem with Partial-Observed
Markovian Supply Capacities
Haifeng Wang, and Houmin Yan
Abstract—Thispaperconsidersamulti-periodnews-vendor I. INTRODUCTION
problem with partially observed supply capacity information
Inventorycontrolisoneofthemoststudiedtopicsforthe
which evolves as a Markovian Process. The supply capacity
supply chain management. Both demand and supply infor-
is fully observed by the buyer when the capacity is smaller
thanbuyer’sorderingquantity.Withadynamicprogramming mation is fundamental for any inventory control decisions.
formulation, we prove the existence of a unique optimal Most works in the supply chain management literature
ordering policy. In a two-state Markovian capacity case, we assumethatthesupplycapacityiseitherknowntothebuyer,
furtherdemonstratethatthebuyerordersmorethanrequired
or unlimited. Few of them consider the supply capacity
torevealsupplycapacity.Wealsoprovideanumericalexample
information uncertainty and its influence to the inventory
to demonstrate the characterization of the optimal policy.
policy. Yet the supply capacity is always unknown or only
Note to Practitioners—Supplier management is a critical
partially observable in practice. In such an environment,
task for inventory managers, in particular, when there ex-
mostofwell-knownresultsmaynothold.Itistheobjective
isting supply uncertainties. In such circumstances, inventory
managers have to make inventory replenishment decisions of this paper to explore the structural properties of an
based on their best estimation of supply conditions. Their inventory system with partially observed supply capacity
past experiences are the basis for estimation. Note that their and to study if there exists an optimal policy for the above
past experiences, to the best, only partially reveals the supply
problem.
condition.Thispaperdevelopsanewmodelinwhichitmakes
useofapartialobservedsupplycapacityinformation,whichis
derived in the order fulfillment process, to forecast the future
A. The Importance of Modeling the Partially Observed
supply capacity. There are two possible observations of the
Capacity
order fulfillment: (1) if the ordering quantity is greater than
the current period supply capacity, the buyer observes the The study of inventory systems with the partially ob-
valueofthecurrentperiodsupplycapacity;(2)otherwise,the served supply capacity is important in many real-life cases.
buyerknowsthatthecurrentperiodsupplycapacityisgreater
We shall introduce some of cases in a supply chain, where
thanitsorderingquantity.Basedonthesetwoobservations,the
capacities can be partially observed.
buyer updates the future supply capacity forecasting accord-
ingly.Comparedwithtraditionalmodels,wedemonstratethat Supplier’s capacity uncertainty: Unexpected machine
making use of the partial observed information significantly breakdowns, resulting unexpected maintenance, may lead
reduces costs of inventory systems. However, this inventory to a lower production capacity. Moreover, the uncertain
model assumes that all the leftover inventories are salvaged
repairingtimemayaffecttheavailabilityofcertainfacilities:
at the end of each period. This assumption is applicable to
the repair is not planned, even planned, then its repair-
perishableproducts.Forfutureresearch,extendingthismodel
to cases where leftover inventories to be carried to the next ing time is uncertain. These stochastic events lead to an
period is the future research direction. uncertain supply capacity. Further, the product quality can
also be uncertain. When the product quality is low or the
Index Terms—Partial information, Markovian process, in-
ventory planning and control rate of defects is high, the actual available products are
unknown. Since these uncertainties can’t be predicted, the
supply capacity is not observed directly.
ThisworkissupportedinpartbyRGCCompetitiveEarmarkedResearch
Grants,CUHK4239/03E,CUHK4167/04EandCUHK4177/06E. Buyers’ competition: A supplier serves multiple cus-
Haifeng Wang is with the Center for Intelligent Networked Systems, tomers. When the capacity is tight, the supplier reserves
Department of Automation, Tsinghua University, Beijing 100084, China
its capacity to individual customer. It is known in practice
(email:[email protected]).
Houmin Yan is with the Department of Systems Engineering and as the capacity allocation, or rationing. Orders from buyers
EngineeringManagement,ChineseUniversityofHongKong,Shatin,N.T., varyfromtimeandareunknowntotheotherbuyers.Hence
Hong Kong. He is also a non-resident member of Center for Intelligent
the reserved capacity for an individual buyer is uncertain,
Networked Systems, Department of Automation, Tsinghua University,
Beijing100084,China(email:[email protected]). evenunderconditionofknowingthesupplyallocationrules.
JOURNALOFLATEXCLASSFILES,VOL.*,NO.**,*********** 2
As a result, a buyer doesn’t know the capacity volume Hence, our finding suggests inventory managers pay a par-
reserved for him. ticular attention to the partially observed supply capacity
Multiple sources of supply: The buyer also orders from informationwhenthereexistsasupplycapacityuncertainty.
multiple suppliers. Each supplier has its own reserved
capacity for the buyer. Hence the buyer can only estimate C. Literature Review and Its Relation to Our Model
the capacity reserved for him, based on observed signals, There is an extensive literature in inventory decisions
such as the received ordering quantity. As a result, the with the given capacity constraint, such as the work of
available capacity is partially observed. Federgruen and Zipkin [5], Gallego and Scheller-Wolf [8],
Gallego and Toktay [9], and Gallien and Wein [10]. In ad-
dition, there are papers dealing with the uncertain capacity
B. Summary of Paper
constraint. Khang and Fujiwara [7] address a discrete time
This paper studies the optimal inventory replenishment inventory model where the maximum amount of supply,
policyatthebeginningofeachperiodtomeetthestochastic from which instantaneous replenishment orders can be
demand with the partially observed supply capacity infor- made, is a random variable. They also characterize the
mation. We allow the buyer to fulfill the out-of-capacity optimal ordering policies. Wang and Gerchak [13] extends
orders by paying an extra cost from other sources, such themodeltoincludethevariablecapacityandrandomyield
as a spot market. The demand is realized at the end of together in periodic inventory systems.
each period. The leftover inventory is salvaged while the The research on inventory decisions with the partially
unfulfilled demand incurs a penalty cost. observed information is quite recent. Bensoussan, Cakany-
We consider a multi-period problem where the capacity ildirim and Sethi [4] study a multi-period news-vendor
observationofthecurrentperiodwillinfluencethecapacity model with a partially observed demand. By assuming
distributionofthenextperiod,aswellasthevaluefunction that the leftover inventories are salvaged and unsatisfied
of the next period. The capacity allocated by the supplier demands are lost in each period, they decouple the peri-
maychangefromperiodtoperiod,evolvingasaMarkovian ods from the view point of the Beyesian demand update,
Process. The capacity for the current period is NOT fully and prove the existence of optimal ordering policy. Aviv
observedbythebuyeratthetimewhenthebuyerplacesits and Pazgal [1] introduce the partially observed Markovian
orders. When the order quantity is greater than the supply decision process. In their setting, to maximize its revenue,
capacity, the supplier provides its maximum capacity to the a buyer faces a dynamic pricing problem of selling a
buyer. Only at this time, the capacity is fully observed by given stock during a finite selling season. Bensoussan,
thebuyer.Thepartiallyobservedcapacityinformationlimits Cakanyildirim and Sethi [2] and [3] study the inventory
buyer’s capability in forecasting its needs as well as the system with partially observed inventory levels. Lu, Song
inventory optimization. and Zhu [11] and [12] study the news-vendor model with
In this paper, we prove the existence of an optimal censored demand data.
ordering policy for a general Markovian capacity process. To the best of our knowledge, the problem of inven-
With results for the general case, we specialize our study tory decisions with a partially observed Markovian ca-
to cases where the supply capacity has two possible states. pacity remains open. We model an inventory system with
We find out that revealing the supply capacity leads to a the partially observed Markov supply capacity. Similar to
cost reduction. Hence the buyer may purchase more than Bensoussan, Cakanyildirim and Sethi [4], the unnormal-
neededinordertofigureoutthesupplycapacity.Moreover, ized probability is used to linearized the highly nonlinear
ifthepossiblecapacityislow,thebuyer’sorderingpolicyis dynamic programming equation. We solve the inventory
equivalenttoamyopicinventoryorderingpolicy.Otherwise, managementproblemwiththepartialinformationofsupply
the buyer makes the tradeoff between the current and the capacity, while Bensoussan, Cakanyildirim and Sethi [4]
future period costs. solve the inventory management problem with the partial
We also study the performance of the partial-observation information of demand. The methodology in Bensoussan,
model and the non-update model. In the partial-observation Cakanyildirim and Sethi [4] is extended in our model to
model, the buyer dynamically updates the supply capacity prove the existence of optimal inventory policy. Moreover,
forecast according to the partial supply information based an extensive model with two possible supply capacities is
on the order fulfillment in each period. On the contrary, for studied and the buyer’s tradeoff between the current and
non-update model, this partially observed supply capacity future period cost is characterized.
information is ignored. We prove the existence of the
optimal inventory replenishment for the partial-observation D. Plan of Paper
model. And we use numerical experiments to demonstrate In the next section, we formulate the problem. In our
significant cost savings of this partial-observation model. model, the evolution of partially observed Markovian ca-
JOURNALOFLATEXCLASSFILES,VOL.*,NO.**,*********** 3
pacity is characterized, and the un-normalized probability B. Dynamic forecast updates for the supply capacity
is used to linearize the dynamic equation. In Section 3, we
In each period, the buyer observes a , which is the
t
provetheexistenceofanoptimalorderingpolicy.InSection
amount of ordering charged at the normal purchasing cost
4, we specialized the optimal inventory replenishment pol-
c. Then, two scenarios are as follows.
icy for the problem with two-state Markovian capacity. We
1) When the order quantity is greater than the supply
alsocarryoutnumericalexperiments.Wefindthatordering
capacity, i.e. q ≥ Q , the supplier can’t satisfy the
uptothesmallerpossiblecapacitycansavethefuturecost. t t
request. Then the supplier exhausts all of its capacity
Finally, we conclude the paper in Section 5.
to produce products, i.e. a = Q . For this scenario,
t t
the supply capacity reveals through the observed a .
t
The density probability function of the capacity for
II. PROBLEMFORMULATIONANDDYNAMIC
the next period can be obtained as follows:
PROGRAMMINGEQUATIONS
π (Q )=p(Q |a ); (1)
t+1 t+1 t+1 t
A. Sequence of Events and Cost Structure
2) On the other hand, the order quantity is less than the
The sequence of events is as follows: at the beginning of supply capacity, and the supplier satisfies the request
each period, the buyer informs the supplier for its intended completely, i.e. q ≤ Q and a = q . For this
t t t t
order quantity qt; thus the supplier checks the available scenario, the buyer knows that the capacity is greater
capacity reserved for this buyer and tries its best to satisfy thanitsorderquantity,i.e.Q ≥a ,butdoesn’tknow
t t
therequestedpurchase.Denotethereservedsupplycapacity theexactcapacity.Wedenotethiscaseasthepartially
as Qt. Note that {Qt} is a Markovian Process with a observed supply capacity. Then
transition probability p(Q | Q ), which is exogenously
t+1 t
decided as an input. The buyer gets 1) at = qt ∧ Qt πt+1(Qt+1) = Pr{Qt+1 |Qt ≥at}
at the normal unit purchasing cost c and 2) qt − at, if = Pr{Qt+1,Qt ≥at}
the unsatisfied order is greater than 0, at a higher unit Pr{Q ≥a }
t t
(cid:82)
purchasing cost c+e, where e≥0 indicates that the extra +∞
π (ξ)p(Q |ξ)dξ
ordercanbemoreexpensive.Notethattheextraordermay = at (cid:82)t t+1 , (2)
+∞
come from the same supplier by asking for an expensive, at πt(ξ)dξ
emergentorder,orcomefromsomeothersupplyresources,
where the second equality is due to Bayesian equa-
such as a spot market. At the end of this selling season,
tions, and the third equality is due to the condi-
all the unsatisfied demand is lost. Shortage cost at the unit
tionalprobabilityequations,whichisPr{Q ,Q ≥
cost of b, and salvaged value at the unit revenue of h are (cid:82)+∞ t+1 t
a }= Pr{Q |Q =ξ}π (ξ)dξ.
recorded. t at t+1 t t
Hence the supply capacity Q is partially observed.
The following is the list of notations used in this paper: t
This approach is similar to Bensoussan, Cakanyildirim and
Sethi [4] where the partially observed demand signal is
q : order quantity from the buyer;
t characterized.
Q : available capacity of the supplier;
t
α: discount factor;
Dt : demand of period t; C. Dynamic Programming Equations
p(Q |Q ): transition probability of a Markov
t+1 t With the sequence of events described above, it is possi-
process {Q };
t ble for us to derive one-period cost function as L(D ,q )+
Pr{A}: probability of the event A; t t
e(q −Q )+, where
c: normal unit purchasing cost; t t
(cid:189)
at : amount of order charged at normal cy−h(y−x), if x≤y
L(x,y) =
purchasing cost c; cy+b(x−y), if y ≤x,
(cid:189)
e: additional unit purchasing cost for the
cy−h(y−x), if x≤y
extra unit when q >Q ; = (3)
t t bx+(c−b)y, if y ≤x.
h: unit salvage value;
b: unit shortage cost; To eliminate trivial cases, it is reasonable to assume that
F(·): cumulative demand distribution function; 0≤h<c<b and 0<e. Then it is straightforward to see
π (·): probability density function of capacity that
t
(cid:189)
Q ;
t cy, if x≤y
V (·): value function. L(x,y) ≤ if y ≥0. (4)
t bx, if y ≤x
JOURNALOFLATEXCLASSFILES,VOL.*,NO.**,*********** 4
I
Recallthatπ (·)istheprobabilitydensityfunctionofthe otherwise =0. Define recursively that
t A
capacity Qt. Then the value function can be expressed as (cid:90) +∞
I
follows: ρ (x) := p(x|ξ)ρ (ξ)dξ
t+1 at=qt t
qt
Vt(πt(cid:90)(x)qt) ρ (x) := +πI(xat)<.qtp(x|at), t≥1;
= inf { E [L(D ,q )+e(q −Q ) 1 1
qt≥0 0 Dt t t t t
Also let
+αVt+1(p(x|Qt))]πt(Qt)dQt (cid:90)
(cid:90)
+∞ λ := ρ (x)dx.
+ E [L(D ,q )+ t t
Dt t t
qt (cid:82) Then
+∞π (ξ)p(x|ξ)dξ (cid:90)
αVt+1( qt (cid:82)q+tt∞πt(ξ)d(cid:90)ξ )]πt(Qt)dQt} λt+1 =Iat=qt qt+∞ρt(ξ)dξ+Iat<qt, t≥1;λ1 =1.
qt We can check recursively (it holds when t = 1; and by
= inf {E [L(D ,q )]+ e(q−Q )π (Q )dQ
qt≥0(cid:90) Dt t t 0 t t t t supposingthecorrectnessfort,checkthatitholdsfort+1)
qt that the following equation is true:
+α V (p(x|Q ))π (Q )dQ
t+1 t t t t
0 (cid:82)+∞ (cid:90) ρt(x) = πt(x)λt.
π (ξ)p(x|ξ)dξ +∞
+αV ( qt (cid:82) t ) π (Q )dQ }, Hence, we have
t+1 +∞ t t t
qt πt(ξ)dξ qt (5) πt(x) = ρtλ(x) = (cid:82) ρρt((xx))dx.
t t
where the first term on the right hand side is In what follows, we consider the infinite horizon case. To
the cost when the supply capacity Q is smaller simplify the notation, the sub-script is omitted.
t
than the buyer’s intended ordering quantity, and We define W(ρ) as
(cid:82)
+∞ (cid:90)
the sRecond term of + qt EDt[L(Dt,qt) + W(ρ) := ρ(x)dx·V((cid:82) ρ ). (6)
tαhVet+su1p(pql+ytR∞q+ctπ∞atp(πaξtc)(piξt()y·d|ξξQ)dξ)i]sπtl(aQrgte)rdtQhatn itshe tbhueyerc’ossitntewnhdeend It is obvious that W(·) can be obtaiρn(exd)dbxy V(·). By
t
ordering quantity. definition of W(ρ), we have
The buyer chooses a set of purchasing quantities {q } to
t W(ρ(·))
minimize its total cost. Ordering too much results in a high (cid:90) (cid:90)
purchasingcost;ontheotherhand,orderingtoolessresults = ρ(x)dx· inf { L(D,q)f(D)dD
q≥0
in a high penalty cost and failures to observe the supply (cid:90)
q ρ(Q)
capacity. + e(q−Q)(cid:82) dQ
ρ(x)dx
0(cid:90)
q ρ(Q)
+α V(p(·|Q))(cid:82) dQ
D. Unnormalized Probability ρ(x)dx
0
(cid:82) (cid:41)
It is difficulty to directly solve the dynamic equation +∞ρ(ξ)p(·|ξ)dξ (cid:90) +∞ ρ(Q)
(5) owning to complex fractional parts. Both denominator +αV( q (cid:82) ) (cid:82) dQ
+∞ρ(ξ)dξ q ρ(x)dx
and numerator include the decision variable qt. In this part, (cid:189)(cid:90) q (cid:90)
we use the similar technique of a variable substitution as
= inf L(D,q)f(D)dD ρ(x)dx
in Bensoussan, Cakanyildirim and Sethi [4] so that the q≥0
(cid:90)
equation (5) can be expressed in a simpler and solvable q
+ e(q−Q)ρ(Q)dQ
form.
0(cid:90)
From equations (1) and (2), we know q
+α W(p(·|Q))ρ(Q)dQ
(cid:82)
π (x) = I q+t∞(cid:82)πt(ξ)p(x|ξ)dξ 0 (cid:90) +∞ (cid:190)
t+1 at=qt +∞π (ξ)dξ +αW( ρ(ξ)p(·|ξ)dξ)
qt t q
I
+ p(x|a ), t≥1, := inf G(q). (7)
at<qt t q≥0
I
where the indicator function =1 when event A occurs, Note that Equation (7) is a Bellman equation in ρ.
A
JOURNALOFLATEXCLASSFILES,VOL.*,NO.**,*********** 5
III. THEEXISTENCEOFANOPTIMALFEEDBACK period forecasted by p(·|·) is equal to the mean of current
SOLUTION period demand.
Inthissection,wefirstshowtheexistenceanduniqueness We need the following lemma. Its proof appears in
of the solution W for equation (7). Then we prove the Appendix.
existence of an optimal feedback control q∗. Lemma 3.1: If equation (7) has a solution W, the solu-
tion is in B.
A. Existence of a Unique Solution W Now define the map T(W) as,
In this subsection we start with a definition of a function (cid:90) (cid:90)
space B. Then we prove that, if there is a solution to T(W) := min{ L(D,q)f(D)dD ρ(x)dx
equation (7), it must exist in the above defined function q≥(cid:90)0
q
space. As a result, the existence and the uniqueness of the
+ e(q−Q)ρ(Q)dQ
solution follow. Note that similar definitions of function 0(cid:90)
spacesHandB areusedinBensoussan,Cakanyildirimand q
+α W(p(·|Q))ρ(Q)dQ+
Sethi [4].
(cid:90)0
Define function space H of function ρ: +∞
(cid:90) αW( ρ(ξ)p(·|ξ)dξ)}. (15)
∞
H:={ρ∈L1(R+): x|ρ(x)|dx<∞}, (8) q
0
Thenwecanobtainthefollowinglemma.SeeAppendixfor
where L1(R+) is the space of integrable functions whose
its proof.
domain is the set of nonnegative real numbers, and
Lemma 3.2: ||T(W)−T(W˜)|| ≤ αmax{1,c }||W −
B 0
H+ :={ρ∈H|ρ≥0}, (9) W˜|| .
B
Theorem 3.1: There exists a unique solution W of the
with the norm:
(cid:90) (cid:90) dynamic programming equation (7).
+∞ ∞
||ρ||= |ρ(x)|dx+ x|ρ(x)|dx. (10) Proof: From Assumption 3.1, we know that αc < 1,
0
0 0 such that αmax{1,c } < 1. Hence by Lemma 3.2, the
0
Also define the following space B of function φ: mappingT :B →B isacontractionmapping.Thenbythe
(cid:189) (cid:190)
Contraction Mapping Theorem [6], there exists a unique
|φ(ρ)|
B = φ(ρ):H+ →R|sup <∞ , (11) solution W such that T(W)=W. Hence we have proved
||ρ||
x>0
the desired results. (cid:164)
with the norm:
|φ(ρ)|
||φ|| = sup . (12)
B ||ρ||
ρ∈H+ B. The existence of Optimal Feedback Control
For the technical convenience, we need the following as-
By Equation (3), we can easily obtain L(x,y) ≥ (c−
sumptions:
h)y. Substitute it into Equation (7), we then can prove that
Assumption 3.1: Assume, for any ρ∈H+, (cid:82)
(cid:90) (cid:90) (cid:90) W(ρ) ≥ (c−h)q ρ(x)dx. With equations (52) and (56),
we know that
x p(x|ξ)ρ(ξ)dξdx≤c ξρ(ξ)dξ, with αc <1.
0 0
(cid:90)
(13)
(c−h)q ρ(x)dx≤W(ρ)
This assumption is necessary to complete proofs of the
(cid:181) (cid:182)
followinglemmasandthetheorem.Itissatisfiedbyspecific
bµ
probability transition function p(·|·), such that ≤ µD +αmax{1,c0}||W||B ||ρ||
(cid:90) (cid:90) (cid:90) (cid:90) (cid:181) Q (cid:182)
bµ bµ
α x p(x|ξ)ρ(ξ)dξdx≤αc0 ξρ(ξ)dξ < ξρ(ξ)dξ, ≤ D +αmax{1,c } D ||ρ||.
µ 0 µ (1−αmax{1,c })
Q Q 0
i.e.
(cid:82) R R
The discounted expected demand of next period Note that ρ(x)dx = R xρ(x)dx = xρ(x)dx, then
R R xπ(x)dx µQ
forecasted by p(·|·) R ||ρ|| = ρ(x)Rdx+ xρ(x)dx = 1+µ . Hence we obtain
ρ(x)dx ρ(x)dx Q
that
< The mean of current period demand. (14)
Since the discount factor is always strictly smaller than 1, bµ (1+µ )
0≤q ≤ D Q :=B. (16)
this assumption holds when the expected demand of next µ (c−h)(1−αmax{1,c })
Q 0
JOURNALOFLATEXCLASSFILES,VOL.*,NO.**,*********** 6
Now we define W (ρ) as values of Q and Q with Q < Q . Note that three or
B l h l h
more finite state problem can be analyzed by the similar
W (ρ)
B (cid:90) (cid:90) approach, with much more tedious forms and equations.
= min { L(D,q)f(D)dD ρ(x)dx We assume that the un-normalized capacities are Ql and
q∈[0,B] Q , with probability of θ , and θ , respectively. Hence the
(cid:90) h l h
q
distribution function can be written as
+ e(q−Q)ρ(Q)dQ
0(cid:90) ρ(x)=θ I +θ I . (21)
q l x=Ql h x=Qh
+α W (p(·|Q))ρ(Q)dQ
B
Suppose that the transition probability functions are
0 (cid:90) (cid:190)
+∞
I I
+αW ( ρ(ξ)p(·|ξ)dξ) . (17) p(x|Q )=β +(1−β ) ,
B l l x=Ql l x=Qh
q I I
p(x|Q )=(1−β ) +β , (22)
h h x=Ql h x=Qh
Note that W (ρ) can be shown to be a unique solution to
B
equation(17)sinceW(ρ)isshowntobeuniquebyLemma that is to say, the transition matrix is
(cid:181) (cid:182)
3.2.IfW (ρ)iscontinuousinρ,weobtainthecontinuityof
B β 1−β
G(q)(definedinequation(7))inq.Afterthat,theexistence B:= l l .
1−β β
h h
oftheminimizerq∗ isestablished.SeeAppendixforproofs
of the following lemma. ThenthedynamicprogrammingEquation(7)canbefurther
Lemma 3.3: For any ρ,ρ˜∈H+, we have reduced to
(cid:90)
q
|W (ρ)−W (ρ˜)| ≤ H ||ρ−ρ˜||, (18) W(ρ) = min{[cq−h (q−D)f(D)dD
B B B
q≥0(cid:90) 0
where H is a constant which is independent of ρ. +∞
B
+b (D−q)f(D)dD](θ +θ )
Theorem 3.2: There exists an optimal feedback control, l h
q
i.e, the optimal order quantity.
I
+ e(q−Q )θ
Proof: Recall the definition of G(q) in Equation (7), and Ql≤q<Qh l l
I
we can write + Qh≤q[e(q−Ql)θl+e(q−Qh)θh]
WB(ρ)=q∈m[0i,nB]G(q), (19) +αIQl<q≤QhθlW(p(·|Ql))
I
+α [W(p(·|Q ))θ +W(p(·|Q ))θ ]
where Qh<q l l h h
(cid:90) (cid:90) +αI W(θ p(·|Q )+θ p(·|Q ))
q≤Ql l l h h
G(q) = L(D,q)f(D)dD ρ(x)dx I
+α W(θ p(·|Q ))}. (23)
(cid:90) Ql<q≤Qh h h
q
+ e(q−Q)ρ(Q)dQ With the definition of W(ρ) in Equation (6), we obtain
0(cid:90)
q W(aρ(.))=aW(ρ(.)), if a is a constant. (24)
+α W (p(·|Q))ρ(Q)dQ
B
0 (cid:90) By collecting and simplifying terms, we have
+∞ (cid:90)
+αW ( ρ(ξ)p(·|ξ)dξ). (20) q
B W(ρ) = min{[cq−h (q−D)f(D)dD
q
q≥0(cid:90) 0
The first, second, and third terms of equation (20) are +∞
continuous in q. By Lemma 3.3, we know that W(ρ) is +b (D−q)f(D)dD](θl+θh)
continuous in ρ, which leads to the continuity of the fourth q
I
+ e(q−Q )θ
term of equation (20) in q. Hence G(q) is continuous in Ql≤q<Qh l l
q. Moreover, q belongs to a bounded and closed set [0,B]. +I [e(q−Q )θ +e(q−Q )θ ]
Qh≤q l l h h
Hence,byWeierstrass’Theorem[6],thereexistsaminimum I
+α [W(p(·|Q ))θ +W(p(·|Q ))θ ]
intheboundedandclosedsettominimizesuchacontinuous Ql<q l l h(cid:111) h
I
function. The minimum q∗ can be obtained. (cid:164) +α q≤QlW[θlp(·|Ql)+θhp(·|Qh)] . (25)
Note that the aim of observing current period supply
IV. OPTIMALPOLICYANALYSISOFTWO-STATE capacity is to predict supply capacity for future periods.
MARKOVCHAIN And with this prediction, it is possible for us to adjust our
Inordertofurtheranalyzetheoptimalpurchasingpolicy, purchasing quantity to minimize the cost function. In order
and reveal the supply capacity, we specialize our study to to characterize the purchasing policy which influences the
cases where the supply capacity can only take two possible costs for the current (myopic cost) and the future period
JOURNALOFLATEXCLASSFILES,VOL.*,NO.**,*********** 7
(cost-to-go from the next period), we divide the total cost Then P (q) = g (q)+g (q). The minimizer of function
c 1 2
function into two parts: P (q) and P (q). Define g (q) is
c f 1
(cid:90) b−c
q F−1( ).
P (q) := [cq−h (q−D)f(D)dD b−h
c
(cid:90) 0 g (q) is a piece-wise linear function with minimizers less
+∞ 2
+b (D−q)f(D)dD](θl+θh) than or equal to Ql.
q For the notation convenience, we further define
I
+ Ql≤q<Qhe(q−Ql)θl q¯ := F−1(b−c),
+I [e(q−Q )θ +e(q−Q )θ ], c b−h
Qh≤q l l h h
b−c−e θl
and q¯ := F−1( θl+θh),
e1 b−h
I
P (q) := α [W(p(·|Q ))θ +W(p(·|Q ))θ ] b−c−e
f Ql<q l l h h q¯ := F−1( ). (29)
+αI W [θ p(·|Q )+θ p(·|Q )]. e2 b−h
q≤Ql l l h h
Obviously, we have q¯ >q¯ >q¯ .
c e1 e2
Then the total cost can be expressed as follows:
1) Case 1: q¯ <Q : Since q¯ minimizes both g (q) and
c l c 1
W(ρ) = min{Pc(q)+Pf(q)}. (26) g2(q), the minimizer
q≥0
b−c
q∗ =q¯ =F−1( ). (30)
c b−h
A. The case of the Independent Capacity
2) Case 2: Q ≤q¯ <Q : In this case, since g (q) and
When the prevailing future capacity is independent to l c h 1
g (q)arenon-increasingforq ∈[0,Q ]andnon-decreasing
the current capacity, we denote this case as the case 2 l
for q ∈ [Q ,+∞), we can narrow down our search space
of the independent capacity. Mathematically, the transi- h
to [Q ,Q ]. That is
tion probabilities are chosen as β = 1 − β , which l h
l h (cid:90)
q
leads to p(·|Q ) = p(·|Q ). With Expression (24),
l h min {[cq−h (q−D)f(D)dD
W [θlp(·|Ql)+θhp(·|Ql)]=(θl+θh)W(p(·|Ql)). Hence (cid:90) q∈[Ql,Qh] 0
+∞
I
Pf(q) = α Ql<q[W(p(·|Ql))θl+W(p(·|Ql))θh] +b (D−q)f(D)dD](θl+θh)+e(q−Ql)θl}. (31)
+αI W [θ p(·|Q )+θ p(·|Q )] q
q≤Ql l l h l
Let us solve the first order condition, and we have
= α(θ +θ )W(p(·|Q )), (27)
l h l
b−c−e θl
which is independent of q. No matter how the purchasing F−1( θl+θh)=q¯ <q¯ <Q .
b−h e1 c h
policyis,thevaluefunctiondoesn’tchangeatall.Basedon
this, W(ρ) can be reduced as follows: Hence the optimality is obtained at the point q¯e1 or the
lower bound of Q , i.e.
l
W(ρ) = min{P (q)}+P . (28)
q≥0 c f b−c−e θl
q∗ = max{q¯ ,Q }=max{F−1( θl+θh),Q }.
Hence the myopic solution is optimal for the case of the e1 l b−h l
independent capacity. In other words, the buyer only needs (32)
tofocusonreducingthecurrentperiod’scost.Theresulting
3) Case 3: Q ≤ q¯: In this case, since g (q) and
myopic policy is also optimal in the view of multi-period. h c 1
g (q) are non-increasing for q ∈ [0,Q ], similarly, we
This result is intuitively right. In what follows, we figure 2 l
narrow down our search space to [Q ,+∞). Note that
out what exactly q∗ is, and how other factors influence the l
g (q)+g (q) is equal to equation (31) for Q ∈ [Q ,Q ].
optimal purchasing quantity. 1 2 l h
For q ∈[Q ,+∞), g (q)+g (q) is
Define h 1 (cid:90) 2
(cid:90) q
q
min {[cq−h (q−D)f(D)dD
g (q) := cq−h (q−D)f(D)dD
1 (cid:90) 0 q∈[Qh,+(cid:90)∞+)∞ 0
+∞
+b (D−q)f(D)dD](θ +θ )
+b (D−q)f(D)dD](θ +θ ); l h
l h
q
q
+e(q−Q )θ +e(q−Q )θ }, (33)
l l h h
its convexity in q is straitforward, and
which is convex in q and the solution of the first order
I
g (q) := e(q−Q )θ
2 Ql≤q<Qh l l condition is q¯e2 (defined in equation (29)). Hence the
I
+ [e(q−Q )θ +e(q−Q )θ . optimal purchasing in this case is as follows:
Qh≤q l l h h
JOURNALOFLATEXCLASSFILES,VOL.*,NO.**,*********** 8
Ql Qh Legend
priceisaswellgreaterthanthebiggerpossiblecapacityQ ,
h
scenario(1) q¯c we order up to the larger one between q¯e2 and Qh.
q¯e1 Moreover, when θ =0, i.e. the only possible capacity is
l
scenario(2) Qh, then it is easy to obtain from Theorem 4.1 that
(cid:189)
q¯, if q¯ ≤Q ;
scenario(3) q∗ = c c h
max{q¯ ,Q }, otherwise.
e2 h
Fig.1. IllustrationofpossiblescenariosinTheorem4.1and4.3 Further, if the additional unit purchasing cost e = +∞,
which indicates that it is impossible to make the additional
order, the optimal policy is
1) q¯ > Q : since q¯ > Q , we have q¯ >
e2 h e2 h e1 q∗ =min{q¯,Q }.
Q , which indicates that g (q) + g (q) is non- c h
h 1 2
increasing for q ∈ [Q ,Q ]. Then the optimal Theorem 4.1 characterizes a myopic policy and estab-
l h
purchasing quantity is order-up-to q¯ , i.e., q∗ = lishes a foundation for the subsequent discussions.
e2
q¯ .
e2
2) q¯ > Q and q¯ ≤ Q : it indicates that
e1 h e2 h B. General Case: Purchase More to Observe Supply Ca-
g (q)+g (q)isnon-decreasingforq ∈[Q ,+∞).
1 2 h pacity
q¯ > Q indicates that g (q) + g (q) is non-
e1 h 1 2 Inthissubsection,weconsiderthecaseswherethesupply
increasing for q ∈ [Q ,Q ]. Then the optimal
l h capacity of the next period is correlated to the capacity
purchasing quantity is order-up-to Q , i.e., q∗ =
h of the current period. If capacities are correlated, then the
Q .
h future cost function P has to be taken into consideration.
3) Otherwise q¯ ≤ Q : this is similar to Case 2 f
e1 h First, we need the following lemma.
andtheoptimalpurchasingquantityisorder-up-to
Lemma 4.1: The solution W(ρ) of equation (7) is con-
max{q¯ ,Q }, i.e., q∗ =max{q¯ ,Q }.
e1 l e1 l cave in ρ, i.e.
We conclude the optimal purchasing quantity as follows:
W(βρ+(1−β)ρ˜)≥βW(ρ)+(1−β)W(ρ˜).
(cid:189)
max{q¯ ,Q }, if q¯ >Q ; Proof: Note that the uniqueness of W(ρ) is proved in
q∗ = e2 h e1 h (34)
max{q¯ ,Q }, if q¯ ≤Q . Theorem 3.1. We use the mapping T defined in equation
e1 l e1 h
(15) and prove the lemma by value iteration according
To conclude this subsection, combining 3 cases together, to Wn+1 = TWn. Let W0(ρ) = 0, which is obviously
we have the following theorem: concave in ρ. Suppose Wn(ρ) is concave in ρ, then
Theorem 4.1: Theoptimalpurchasingquantityq∗ forthe
independent capacity case is as follows, Wn+(cid:189)1(cid:90)(βρ+(1−β)ρ˜) (cid:90)
scenarios Conditions Optimal purchasing q∗
= min L(D,q)f(D)dD (βρ(x)+(1−β)ρ˜(x))dx
(1) q¯c <Ql q¯c q≥(cid:90)0
Q ≥q¯ q
(2) andh q¯ >e1Q max{q¯e1,Ql} + e(q−Q)(βρ(Q)+(1−β)ρ˜(Q))dQ
c l 0(cid:90)
(3) Q <q¯ max{q¯ ,Q } q
h e1 e2 h
+α Wn(p(·|Q))(βρ(Q)+(1−β)ρ˜(Q))dQ
where q¯,q¯ ,q¯ are defined by equation (29) and Q ,Q
c e1 e2 l h 0 (cid:90) (cid:190)
are two possible supply capacities with Q > Q . See +∞
h l +αWn( (βρ(ξ)+(1−β)ρ˜(ξ))p(·|ξ)dξ)
Figure 1 for an illustration of the possible scenarios.
q
Comparing with possible supply capacities, when the ≥ min{βT (Wn)(ρ)+(1−β)T (Wn)(ρ˜)}
y y
demand is low (q¯ < Q and q¯ < Q ), we simply q≥0
c l e1 h
purchaseuptotheminimizerq¯ atthenormalpricewithout = βWn+1(ρ)+(1−β)Wn+1(ρ˜). (35)
c
considering the capacity constraint.
Since Wn(ρ) converges to the solution W(ρ), we have the
When the demand is moderate, i.e. the minimizer q¯ of
c desired result. (cid:164)
thenormalpriceisgreaterthanthesmallerpossiblecapacity
Then based on this fact, define
Q , and the minimizer q¯ of the high price is smaller than
l e1
the larger possible capacity Q , we order up to the larger K := αW(θ p(·|Q )+θ p(·|Q ))
h l l l h h
one between q¯ and Q .
e1 l −α[θ W(p(·|Q ))+θ W(p(·|Q ))]
l l h h
When the demand is forecasted to be high, i.e. the cost
> 0, (36)
minimizerq¯ ofthenormalpriceisgreaterthanthesmaller
c
possiblecapacityQ ,andthecostminimizerq¯ ofthehigh where the inequality is due to Lemma 4.1.
l e1
JOURNALOFLATEXCLASSFILES,VOL.*,NO.**,*********** 9
Recall the definition of P , and it is straightforward to Remark 4.4: Whenthedemandis low(Scenario (1)),in
f
conclude the following theorem. order to save the cost-to-go from the next period, the buyer
Theorem 4.2: P (q ) = P (q )−K , for all q ≥ Q , purchases products only up to the smaller possible supply
f m f l l m l
and 0≤q <Q . capacity Q .
l l l
This theorem demonstrates that the buyer may try to pur- Remark 4.5: Whenthedemandislow(Scenario(1)),the
chase more to reduce the future cost at the amount of K . largerthegapisbetweenq¯ andQ ,thehigherthepossible
l c l
ThisisbecausepurchasingmorethanQ leadstothebuyer istoorderuptoq¯,whichisthesameasthemyopicpolicy.
l c
to figure out whether the current period capacity is Q . If it Remark 4.6: When the smaller possible supply capacity
l
isn’tQ ,theonlypossiblevalueofQisQ .Henceordering Q is very low (Scenario (2) and (3)), the optimal policy
l h l
up to Q can fully discover the exact value of the current is the same as the myopic one.
l
period’s capacity, which leads to a reduction of the cost-to-
go function.
C. Numerical Analysis and Results
Sincethecost-to-gofunctionremainsthesamewhenq ∈
[Q ,+∞), we know that the purchasing policy does not Itispossibletore-writeequation(25)inanexplicitform
l
change in scenarios (2) and (3) described in Theorem 4.1. in order to carry out numerical studies. By substituting
Equation (21) and using Equation (24), we have,
When scenario (1) occurs, we need to balance the tradeoff
betweenmyopicandcost-to-gofunction.Definethemyopic W(θI +θ I )=θ W(θlI +I )
cost reduction as l x=Ql h x=Qh h θ x=Ql x=Qh
Z h
q
K := g (Q )+g (Q )−g (q¯)+g (q¯) = min{[cq−h (q−D)f(D)dD
m 1 l 2 l 1 c 2 c q≥0Z 0
= g1(Ql)−g1(q¯c)>0. (37) +∞
+b (D−q)f(D)dD](θ +θ )
l h
The buyer can reduce this myopic cost by ordering up to q
I
thecostminimizerq¯c insteadofthesmallerpossiblesupply + Ql≤q<Qhe(q−Ql)θl
capacity Ql. +IQh≤q[e(q−Ql)θl+e(q−Qh)θh]
Remark 4.1: The larger the gap between Ql and q¯c is, +αI (θ(1−β)W( βl I +I )
the greater the myopic cost reduction Km is. Ql<q l l 1−βl x=Ql x=Qh
Remark 4.2: The greater the second derivative of the +θ β W(1−βhI +I ))
cost function g1(·), i.e. d2dgq12(q), the greater the myopic cost h h βh x=Ql x=Qh
saving Km is. +α(θl(1−„βl)+θhβh) «ff
Then the optimal ordering quantity is ·I W βlβl+θh(1−βh)I +I .
(cid:189) q≤Ql θ(1−β)+θ β x=Ql x=Qh
l l h h
q¯, if K >K ;
q∗ = c m l (38) (39)
Q , if K ≤K .
l m l
Further define
Theorem 4.3: The optimal purchasing quantity q∗ of the
multi-period news-vendor problem with partially observed θ = θl, Φ(x)=W(xI +I ), (40)
two-state Markovian capacity is as follows. θ x=Ql x=Qh
h
Scenarios Conditions Optimal purchasing q∗
(cid:189) and then we obtain
q¯, if K >K ;
(1) q¯ <Q c m l Z
c l Q , if K ≤K . q
l m l Φ(θ) = min [cq−h (q−D)f(D)dD
(2) Qh ≥q¯e1 max{q¯ ,Q } q≥0Z 0
and q¯ >Q e1 l +∞
c l +b (D−q)f(D)dD](θ+1)
(3) Q <q¯ max{q¯ ,Q }
h e1 e2 h q
I
where q¯c,q¯e1,q¯e2 are defined in equation (29), Kl and Km + Ql≤q<Qhe(q−Ql)θ
are defined in equations (36) and (37), respectively. Ql +IQh≤q[e„(q−Ql)θ+e(q−Qh)] «
and Q are two possible supply capacities with Q > Q .
See Fighure 1 for the illustration of the above three phossiblle +αIQl<q θ(1−βl)Φ(1−βlβ )+βhΦ(1−ββh)
„l h «ff
scenarios.
Remark 4.3: When θl = 0, the only possible capacity +α(θ(1−βl)+βh)Iq≤QlΦ θθβ(1l+−(β1)−+ββh) .
l h
is Q . Then K = 0 and the policy is the same as the
h l (41)
one of the basic model, the case of the independent supply
capacity. The myopic policy is also optimal for the multi- Assumption 4.1: Assume a symmetric Markov chain of
period model. capacity: β =β =β.
l h
JOURNALOFLATEXCLASSFILES,VOL.*,NO.**,*********** 10
I
Then we can rewrite (41) as: + [e(q−Q )·2+e(q−Q )]
(cid:189) (cid:90) Qh≤q (cid:181) l h (cid:182)
q 1
Φ(θ) = min [cq−h (q−D)f(D)dD +αIQl<q 2(1−β)Φn(2)+βΦn(2)
q≥0(cid:90) 0 (cid:181) (cid:182)(cid:190)
+∞ +α(2(1−β)+β)I Φn 5 .(44)
+b (D−q)f(D)dD](θ+1) q≤Ql 4
q
+I e(q−Q )θ And let θ = 1, then
+IQl≤q<[eQ(hq−Q )θl+e(q−Q )] 1 2 (cid:189) (cid:90) q
Qh≤q (cid:181) l h (cid:182) Φn+1( ) = min [cq−h (q−D)f(D)dD
+αIQl<q θ(1−β)Φ(1−ββ)+βΦ(1−ββ) 2 q≥0(cid:90) +∞ 0 3
(cid:181) (cid:182)(cid:190) +b (D−q)f(D)dD]·
I θβ+(1−β) 2
+α(θ(1−β)+β) Φ . q
q≤Ql θ(1−β)+β 1
I
+ e(q−Q )·
(42) Ql≤q<Qh l 2
1
Let β = 2,h = 1,c = 2,b = 4,e = 1,Q = 12,Q = +I [e(q−Q )· +e(q−Q )]
3 l h Qh≤q l 2 h
15,α = 0.95, demand D subjects to a normal distribution (cid:181) (cid:182)
1 1
with mean µ = 10 and variance σ = 1. Then we find +αI (1−β)Φn(2)+βΦn( )
Ql<q 2 2
that q¯c = 10.43 < Ql (Here we only do analysis of this (cid:181) (cid:182)(cid:190)
1 4
case because other cases’ policy is the same as the myopic +α( (1−β)+β)I Φn .
inventory decision according to Remark 4.6). 2 q≤Ql 5
We use value-iteration method to solve this equation. (45)
Start with Φ0(θ)=0 and
Also, let θ =1 and we have
Φn+1(θ)
Z
q (cid:189) (cid:90)
= min [cq−h (q−D)f(D)dD q
q≥0Z +∞ 0 Φn+1(1) = mq≥in0(cid:90) [cq−h 0 (q−D)f(D)dD
+b (D−q)f(D)dD](θ+1) +∞
q +b (D−q)f(D)dD]·2
I
+ e(q−Q)θ q
Ql≤q<Qh l I
I + e(q−Q )
+ Qh≤q[e„(q−Ql)θ+e(q−Qh)] « IQl≤q<Qh l
+αI θ(1−β)Φn( β )+βΦn(1−β) + Qh≤q[e(cid:181)(q−Ql)+e(q−Qh)] (cid:182)
Ql<q 1−β β 1
„ «ff +αI (1−β)Φn(2)+βΦn( )
+α(θ(1−β)Z+β)Iq≤QlΦn θθβ(1+−(β1)−+ββ) , +α((Q1l<−qβ)+β)I Φn(1)(cid:111). 2 (46)
q q≤Ql
= min [cq−h (q−D)f(D)dD
q≥0Z +∞ 0 Note that Φ(45),Φ(45) and Φ(1) appear in equations
(44), (45), and (46), respectively. In order to carry out the
+b (D−q)f(D)dD](θ+1)
q iteration, we make the following approximations:
I I
+ Ql≤q<Qhe(q−Ql)θ+ Qh≤q[e(q−Ql)θ Φ(1)≈Φ(4)≈Φ(5).
+e(q−Qh„)] « 5 4
+αI θ(1−β)Φn(2)+βΦn(1) Then combining equations (44), (45), and (46) together,
Ql<q „ 2 «ff starting with Φ0(1) = 0,Φ0(2) = 0 and Φ0(1) = 0, we
2
+α(θ(1−β)+β)I Φn 2θ+1 . (43) solve the equation numerically and obtain
q≤Ql θ+2
1
Φ( )=416.8<Φ(1)=547.6<Φ(2)=756.8, (47)
2
Inordertocarryouttheiterationprocess,letθ =2,then
(cid:189) (cid:90) q where we find out that Φ(1) ≥ 2Φ(1)+ 1Φ(2) (demon-
3 2 3
Φn+1(2) = min [cq−h (q−D)f(D)dD strating concavity), and the optimal purchasing
q≥0(cid:90) 0
+∞ q∗| =12, q∗| =12, q∗| =10.43. (48)
θ=1 θ=1 θ=2
+b (D−q)f(D)dD]·3 2
q With the above calculation, the optimal purchasing is Q =
l
+IQl≤q<Qhe(q−Ql)·2 12orq¯c =10.43whenq¯c <Ql.Andalsothelargerθ = θθhl
Description:May 23, 2013 http://publicaa.ansi.org/sites/apdl/NESCCDocs/Forms/welding.aspx . Since AWS D1.1 allows the use of other standards for D1.1 to allow the engineer to accept WPS qualifications or performance qualifications done in.