tohokuaikiのチラシの裏

技術的ネタとか。

HTML4のエンティティの数字参照と文字実体参照の一覧

DTDが欲しかったのだけど、見つけられ無かったので書いた。

http://www.w3.org/TR/html4/sgml/entities.html
の記事を

<?php
$regexp = '@<!ENTITY\s+(\w+)\s+CDATA\s+"&#(\d+);"[^>]+>@smU';
preg_match_all($regexp, $ここにさっきのURLの記事, $m);

で抜き出して整形しただけ

数字参照(10進数) 数字参照(16進数) 文字実体参照
160 a0 nbsp
161 a1 iexcl
162 a2 cent
163 a3 pound
164 a4 curren
165 a5 yen
166 a6 brvbar
167 a7 sect
168 a8 uml
169 a9 copy
170 aa ordf
171 ab laquo
172 ac not
173 ad shy
174 ae reg
175 af macr
176 b0 deg
177 b1 plusmn
178 b2 sup2
179 b3 sup3
180 b4 acute
181 b5 micro
182 b6 para
183 b7 middot
184 b8 cedil
185 b9 sup1
186 ba ordm
187 bb raquo
188 bc frac14
189 bd frac12
190 be frac34
191 bf iquest
192 c0 Agrave
193 c1 Aacute
194 c2 Acirc
195 c3 Atilde
196 c4 Auml
197 c5 Aring
198 c6 AElig
199 c7 Ccedil
200 c8 Egrave
201 c9 Eacute
202 ca Ecirc
203 cb Euml
204 cc Igrave
205 cd Iacute
206 ce Icirc
207 cf Iuml
208 d0 ETH
209 d1 Ntilde
210 d2 Ograve
211 d3 Oacute
212 d4 Ocirc
213 d5 Otilde
214 d6 Ouml
215 d7 times
216 d8 Oslash
217 d9 Ugrave
218 da Uacute
219 db Ucirc
220 dc Uuml
221 dd Yacute
222 de THORN
223 df szlig
224 e0 agrave
225 e1 aacute
226 e2 acirc
227 e3 atilde
228 e4 auml
229 e5 aring
230 e6 aelig
231 e7 ccedil
232 e8 egrave
233 e9 eacute
234 ea ecirc
235 eb euml
236 ec igrave
237 ed iacute
238 ee icirc
239 ef iuml
240 f0 eth
241 f1 ntilde
242 f2 ograve
243 f3 oacute
244 f4 ocirc
245 f5 otilde
246 f6 ouml
247 f7 divide
248 f8 oslash
249 f9 ugrave
250 fa uacute
251 fb ucirc
252 fc uuml
253 fd yacute
254 fe thorn
255 ff yuml
402 192 fnof
913 391 Alpha
914 392 Beta
915 393 Gamma
916 394 Delta
917 395 Epsilon
918 396 Zeta
919 397 Eta
920 398 Theta
921 399 Iota
922 39a Kappa
923 39b Lambda
924 39c Mu
925 39d Nu
926 39e Xi
927 39f Omicron
928 3a0 Pi
929 3a1 Rho
931 3a3 Sigma
932 3a4 Tau
933 3a5 Upsilon
934 3a6 Phi
935 3a7 Chi
936 3a8 Psi
937 3a9 Omega
945 3b1 alpha
946 3b2 beta
947 3b3 gamma
948 3b4 delta
949 3b5 epsilon
950 3b6 zeta
951 3b7 eta
952 3b8 theta
953 3b9 iota
954 3ba kappa
955 3bb lambda
956 3bc mu
957 3bd nu
958 3be xi
959 3bf omicron
960 3c0 pi
961 3c1 rho
962 3c2 sigmaf
963 3c3 sigma
964 3c4 tau
965 3c5 upsilon
966 3c6 phi
967 3c7 chi
968 3c8 psi
969 3c9 omega
977 3d1 thetasym
978 3d2 upsih
982 3d6 piv
8226 2022 bull
8230 2026 hellip
8242 2032 prime
8243 2033 Prime
8254 203e oline
8260 2044 frasl
8472 2118 weierp
8465 2111 image
8476 211c real
8482 2122 trade
8501 2135 alefsym
8592 2190 larr
8593 2191 uarr
8594 2192 rarr
8595 2193 darr
8596 2194 harr
8629 21b5 crarr
8656 21d0 lArr
8657 21d1 uArr
8658 21d2 rArr
8659 21d3 dArr
8660 21d4 hArr
8704 2200 forall
8706 2202 part
8707 2203 exist
8709 2205 empty
8711 2207 nabla
8712 2208 isin
8713 2209 notin
8715 220b ni
8719 220f prod
8721 2211 sum
8722 2212 minus
8727 2217 lowast
8730 221a radic
8733 221d prop
8734 221e infin
8736 2220 ang
8743 2227 and
8744 2228 or
8745 2229 cap
8746 222a cup
8747 222b int
8756 2234 there4
8764 223c sim
8773 2245 cong
8776 2248 asymp
8800 2260 ne
8801 2261 equiv
8804 2264 le
8805 2265 ge
8834 2282 sub
8835 2283 sup
8836 2284 nsub
8838 2286 sube
8839 2287 supe
8853 2295 oplus
8855 2297 otimes
8869 22a5 perp
8901 22c5 sdot
8968 2308 lceil
8969 2309 rceil
8970 230a lfloor
8971 230b rfloor
9001 2329 lang
9002 232a rang
9674 25ca loz
9824 2660 spades
9827 2663 clubs
9829 2665 hearts
9830 2666 diams
34 22 quot
38 26 amp
60 3c lt
62 3e gt
338 152 OElig
339 153 oelig
352 160 Scaron
353 161 scaron
376 178 Yuml
710 2c6 circ
732 2dc tilde
8194 2002 ensp
8195 2003 emsp
8201 2009 thinsp
8204 200c zwnj
8205 200d zwj
8206 200e lrm
8207 200f rlm
8211 2013 ndash
8212 2014 mdash
8216 2018 lsquo
8217 2019 rsquo
8218 201a sbquo
8220 201c ldquo
8221 201d rdquo
8222 201e bdquo
8224 2020 dagger
8225 2021 Dagger
8240 2030 permil
8249 2039 lsaquo
8250 203a rsaquo
8364 20ac euro

ついでにDTD

<!ENTITY nbsp "&#160;">
<!ENTITY iexcl "&#161;">
<!ENTITY cent "&#162;">
<!ENTITY pound "&#163;">
<!ENTITY curren "&#164;">
<!ENTITY yen "&#165;">
<!ENTITY brvbar "&#166;">
<!ENTITY sect "&#167;">
<!ENTITY uml "&#168;">
<!ENTITY copy "&#169;">
<!ENTITY ordf "&#170;">
<!ENTITY laquo "&#171;">
<!ENTITY not "&#172;">
<!ENTITY shy "&#173;">
<!ENTITY reg "&#174;">
<!ENTITY macr "&#175;">
<!ENTITY deg "&#176;">
<!ENTITY plusmn "&#177;">
<!ENTITY sup2 "&#178;">
<!ENTITY sup3 "&#179;">
<!ENTITY acute "&#180;">
<!ENTITY micro "&#181;">
<!ENTITY para "&#182;">
<!ENTITY middot "&#183;">
<!ENTITY cedil "&#184;">
<!ENTITY sup1 "&#185;">
<!ENTITY ordm "&#186;">
<!ENTITY raquo "&#187;">
<!ENTITY frac14 "&#188;">
<!ENTITY frac12 "&#189;">
<!ENTITY frac34 "&#190;">
<!ENTITY iquest "&#191;">
<!ENTITY Agrave "&#192;">
<!ENTITY Aacute "&#193;">
<!ENTITY Acirc "&#194;">
<!ENTITY Atilde "&#195;">
<!ENTITY Auml "&#196;">
<!ENTITY Aring "&#197;">
<!ENTITY AElig "&#198;">
<!ENTITY Ccedil "&#199;">
<!ENTITY Egrave "&#200;">
<!ENTITY Eacute "&#201;">
<!ENTITY Ecirc "&#202;">
<!ENTITY Euml "&#203;">
<!ENTITY Igrave "&#204;">
<!ENTITY Iacute "&#205;">
<!ENTITY Icirc "&#206;">
<!ENTITY Iuml "&#207;">
<!ENTITY ETH "&#208;">
<!ENTITY Ntilde "&#209;">
<!ENTITY Ograve "&#210;">
<!ENTITY Oacute "&#211;">
<!ENTITY Ocirc "&#212;">
<!ENTITY Otilde "&#213;">
<!ENTITY Ouml "&#214;">
<!ENTITY times "&#215;">
<!ENTITY Oslash "&#216;">
<!ENTITY Ugrave "&#217;">
<!ENTITY Uacute "&#218;">
<!ENTITY Ucirc "&#219;">
<!ENTITY Uuml "&#220;">
<!ENTITY Yacute "&#221;">
<!ENTITY THORN "&#222;">
<!ENTITY szlig "&#223;">
<!ENTITY agrave "&#224;">
<!ENTITY aacute "&#225;">
<!ENTITY acirc "&#226;">
<!ENTITY atilde "&#227;">
<!ENTITY auml "&#228;">
<!ENTITY aring "&#229;">
<!ENTITY aelig "&#230;">
<!ENTITY ccedil "&#231;">
<!ENTITY egrave "&#232;">
<!ENTITY eacute "&#233;">
<!ENTITY ecirc "&#234;">
<!ENTITY euml "&#235;">
<!ENTITY igrave "&#236;">
<!ENTITY iacute "&#237;">
<!ENTITY icirc "&#238;">
<!ENTITY iuml "&#239;">
<!ENTITY eth "&#240;">
<!ENTITY ntilde "&#241;">
<!ENTITY ograve "&#242;">
<!ENTITY oacute "&#243;">
<!ENTITY ocirc "&#244;">
<!ENTITY otilde "&#245;">
<!ENTITY ouml "&#246;">
<!ENTITY divide "&#247;">
<!ENTITY oslash "&#248;">
<!ENTITY ugrave "&#249;">
<!ENTITY uacute "&#250;">
<!ENTITY ucirc "&#251;">
<!ENTITY uuml "&#252;">
<!ENTITY yacute "&#253;">
<!ENTITY thorn "&#254;">
<!ENTITY yuml "&#255;">
<!ENTITY fnof "&#402;">
<!ENTITY Alpha "&#913;">
<!ENTITY Beta "&#914;">
<!ENTITY Gamma "&#915;">
<!ENTITY Delta "&#916;">
<!ENTITY Epsilon "&#917;">
<!ENTITY Zeta "&#918;">
<!ENTITY Eta "&#919;">
<!ENTITY Theta "&#920;">
<!ENTITY Iota "&#921;">
<!ENTITY Kappa "&#922;">
<!ENTITY Lambda "&#923;">
<!ENTITY Mu "&#924;">
<!ENTITY Nu "&#925;">
<!ENTITY Xi "&#926;">
<!ENTITY Omicron "&#927;">
<!ENTITY Pi "&#928;">
<!ENTITY Rho "&#929;">
<!ENTITY Sigma "&#931;">
<!ENTITY Tau "&#932;">
<!ENTITY Upsilon "&#933;">
<!ENTITY Phi "&#934;">
<!ENTITY Chi "&#935;">
<!ENTITY Psi "&#936;">
<!ENTITY Omega "&#937;">
<!ENTITY alpha "&#945;">
<!ENTITY beta "&#946;">
<!ENTITY gamma "&#947;">
<!ENTITY delta "&#948;">
<!ENTITY epsilon "&#949;">
<!ENTITY zeta "&#950;">
<!ENTITY eta "&#951;">
<!ENTITY theta "&#952;">
<!ENTITY iota "&#953;">
<!ENTITY kappa "&#954;">
<!ENTITY lambda "&#955;">
<!ENTITY mu "&#956;">
<!ENTITY nu "&#957;">
<!ENTITY xi "&#958;">
<!ENTITY omicron "&#959;">
<!ENTITY pi "&#960;">
<!ENTITY rho "&#961;">
<!ENTITY sigmaf "&#962;">
<!ENTITY sigma "&#963;">
<!ENTITY tau "&#964;">
<!ENTITY upsilon "&#965;">
<!ENTITY phi "&#966;">
<!ENTITY chi "&#967;">
<!ENTITY psi "&#968;">
<!ENTITY omega "&#969;">
<!ENTITY thetasym "&#977;">
<!ENTITY upsih "&#978;">
<!ENTITY piv "&#982;">
<!ENTITY bull "&#8226;">
<!ENTITY hellip "&#8230;">
<!ENTITY prime "&#8242;">
<!ENTITY Prime "&#8243;">
<!ENTITY oline "&#8254;">
<!ENTITY frasl "&#8260;">
<!ENTITY weierp "&#8472;">
<!ENTITY image "&#8465;">
<!ENTITY real "&#8476;">
<!ENTITY trade "&#8482;">
<!ENTITY alefsym "&#8501;">
<!ENTITY larr "&#8592;">
<!ENTITY uarr "&#8593;">
<!ENTITY rarr "&#8594;">
<!ENTITY darr "&#8595;">
<!ENTITY harr "&#8596;">
<!ENTITY crarr "&#8629;">
<!ENTITY lArr "&#8656;">
<!ENTITY uArr "&#8657;">
<!ENTITY rArr "&#8658;">
<!ENTITY dArr "&#8659;">
<!ENTITY hArr "&#8660;">
<!ENTITY forall "&#8704;">
<!ENTITY part "&#8706;">
<!ENTITY exist "&#8707;">
<!ENTITY empty "&#8709;">
<!ENTITY nabla "&#8711;">
<!ENTITY isin "&#8712;">
<!ENTITY notin "&#8713;">
<!ENTITY ni "&#8715;">
<!ENTITY prod "&#8719;">
<!ENTITY sum "&#8721;">
<!ENTITY minus "&#8722;">
<!ENTITY lowast "&#8727;">
<!ENTITY radic "&#8730;">
<!ENTITY prop "&#8733;">
<!ENTITY infin "&#8734;">
<!ENTITY ang "&#8736;">
<!ENTITY and "&#8743;">
<!ENTITY or "&#8744;">
<!ENTITY cap "&#8745;">
<!ENTITY cup "&#8746;">
<!ENTITY int "&#8747;">
<!ENTITY there4 "&#8756;">
<!ENTITY sim "&#8764;">
<!ENTITY cong "&#8773;">
<!ENTITY asymp "&#8776;">
<!ENTITY ne "&#8800;">
<!ENTITY equiv "&#8801;">
<!ENTITY le "&#8804;">
<!ENTITY ge "&#8805;">
<!ENTITY sub "&#8834;">
<!ENTITY sup "&#8835;">
<!ENTITY nsub "&#8836;">
<!ENTITY sube "&#8838;">
<!ENTITY supe "&#8839;">
<!ENTITY oplus "&#8853;">
<!ENTITY otimes "&#8855;">
<!ENTITY perp "&#8869;">
<!ENTITY sdot "&#8901;">
<!ENTITY lceil "&#8968;">
<!ENTITY rceil "&#8969;">
<!ENTITY lfloor "&#8970;">
<!ENTITY rfloor "&#8971;">
<!ENTITY lang "&#9001;">
<!ENTITY rang "&#9002;">
<!ENTITY loz "&#9674;">
<!ENTITY spades "&#9824;">
<!ENTITY clubs "&#9827;">
<!ENTITY hearts "&#9829;">
<!ENTITY diams "&#9830;">
<!ENTITY quot "&#34;">
<!ENTITY amp "&#38;">
<!ENTITY lt "&#60;">
<!ENTITY gt "&#62;">
<!ENTITY OElig "&#338;">
<!ENTITY oelig "&#339;">
<!ENTITY Scaron "&#352;">
<!ENTITY scaron "&#353;">
<!ENTITY Yuml "&#376;">
<!ENTITY circ "&#710;">
<!ENTITY tilde "&#732;">
<!ENTITY ensp "&#8194;">
<!ENTITY emsp "&#8195;">
<!ENTITY thinsp "&#8201;">
<!ENTITY zwnj "&#8204;">
<!ENTITY zwj "&#8205;">
<!ENTITY lrm "&#8206;">
<!ENTITY rlm "&#8207;">
<!ENTITY ndash "&#8211;">
<!ENTITY mdash "&#8212;">
<!ENTITY lsquo "&#8216;">
<!ENTITY rsquo "&#8217;">
<!ENTITY sbquo "&#8218;">
<!ENTITY ldquo "&#8220;">
<!ENTITY rdquo "&#8221;">
<!ENTITY bdquo "&#8222;">
<!ENTITY dagger "&#8224;">
<!ENTITY Dagger "&#8225;">
<!ENTITY permil "&#8240;">
<!ENTITY lsaquo "&#8249;">
<!ENTITY rsaquo "&#8250;">
<!ENTITY euro "&#8364;">