By T. N. E. GREVILLE
Bureau of the Census
(Annals of mathematical statistics,
1941, 12, 350-354)
This paper considers the matching of two decks of cards of arbitrary
composition, and the complete frequency distribution of correct matchings
is obtained, thus solving a problem proposed by Stevens. It is also shown
that the results can be interpreted in terms of a contingency table.
Generalizing a problem considered by Greenwood (1938) let us consider the
matching of two decks of cards consisting of t distinct kinds, all the cards of
each kind being identical.
|
The first or "call" deck will be composed of i1 cards of the first kind, i2 of the second, etc., such that
and the second or "target" deck will contain j1 cards of the first
kind, j2 of the second, etc., such that
Any of the i's or j's may be zero. It is
desired to calculate, for a given arrangement of the "call" deck,
the number of possible arrangements of the "target" deck which will
produce exactly r matchings between them (r = 0, 1, 2, .. . , n). It is clear that these frequencies are independent
of the arrangement of the call deck. For convenience the call deck may be
thought of as arranged so that all the cards of the first kind come first,
followed by all those of the second kind, and so on.
Let us consider the number of arrangements of the target deck which will match the cards in the k1th, k2th, … , ksth positions in the call deck, regardless of whether or not matchings occur elsewhere. Let the cards in these s positions in the call deck consist of c1 of the first kind, c2 of the second, etc. Then:
The number of such arrangements of the target deck is
For fixed values of the c's, the s specified positions may be selected in
ways.
Consider now the expression
obtained by summing the product of
(1) and (2) over all sets of values of the numbers satisfying
the conditions
.
Let W, denote the number of arrangements
of the target deck which result in exactly s matchings. Then it is evident that V, exceeds W, since
the former includes those arrangements which give more than s matchings, and these, moreover, are counted more than once. Consider
an arrangement which produces it matchings, where u > s. Such an arrangement will be counted once in V for every set
of s
matchings which can be selected from the total of u- that is times.
In other words,
It has been shown (Geiringer, 1938) that the solution of these equations is
Equations (3) and (4) apparently give the solution of the problem, but in practice the
labor of carrying out the summation indicated in (3) would often be very great. However,(3) may be rewritten in the form
where
It will be seen that Hs is the coefficient of in
the product
where denotes
the smaller of ih
and jh . The factor
was
included in Hs in order to make the coefficient in the polynomials of (6)
always integers. Equation (4) may now be written in the form
or
(7)
a form which lends itself to actual computation.
The factorial moments of the frequency
distribution of the number of matchings are easy to compute. Let ms. denote the sth factorial moment, so that
Substituting from (4)
Reversing the order of summation and simplifying,
Hence,
(9)
and from (5) and (8),
(10)
From (6)
(11)
and
(12)
Hence the mean number of matchings is
The variance m2
is
or
In the special case these
formulae become
These formulae have previously
been given by Stevens, and those for the special case also by Greenwood. The
maximal conditions for the variance, given by Greenwood for this particular
case, apparently can not be put in a simple form for the general case.
6. Unequal decks.
Suppose the call deck contains m cards, m < n, and is to be matched with m cards selected from the target deck. It can be assumed without loss of generality that the first m cards in any arrangement of the target deck are the ones to be used. The formulae of this paper can be applied to this more general problem by the expedient of imagining n - m blank cards to be added at the end of the call deck and regarding these as an additional kind. It is thus apparent that formulae (13) and (14) apply without modification to this altered situation.
Stevens has considered the distribution
of entries in a contingency table with fixed marginal totals, and has pointed
out that the problem of matching two decks of cards may be dealt with from
that standpoint. A contingency table classifies data into n columns and m rows, and we may consider the row as indicating
the kind of card which occupies a given position in the call deck, the columns
having the same function with respect to the target deck. Stevens defines
a quantity c as the sum of entries in a prescribed set of cells, subject to the
condition that no two cells of the set are in the same row or column, and
mentions as unsolved the problem of the exact sampling distribution of c.
We now have at our disposal the machinery for solving this problem. Following
Stevens's notation, let denote
the fixed row totals and
. the
fixed column totals, while
denotes
the frequency of the cell in the rth row and the sth
column. Then, let
where
does
not exceed either m
or n. Imagine two decks of N cards
, the
first containing a1 cards of one kind, a2 of another, etc., and the
second containing b1 cards of one kind, b2 of another, etc. Moreover,
let the rhth kind in the first deck and the shth kind in the second deck be the same kind (h = 1, 2, - … , l), the other kinds being all different. Evidently
c is the number of matchings between
the two decks. Hence, the methods of this paper can be used to obtain the
distribution of c. The formulae we have obtained agree with those for the expected value
and variance of c given by Stevens.
H. GEIRINGER, Annals of Math. Stat., Vol. 9 (1938), p. 262.
J. A. GREENWOOD, Annals of Math. Stat., Vol. 9 (1938), pp. 56-59.
W. L. STEVENS, Annals of Eugenics, Vol. 8 (1937), pp. 238-244.
W. L. STEVENS, Annals of Eugenics, loc. cit., Psychol. Review, Vol. 46 (1939), pp. 142-150.