|
U.S. Patent and Trademark Office Information Products Division Data Dissemination Trademark Daily XML Migration |
July 25, 2003
Trademark Daily XML Files - Weekly Status Report
~~~
Please Note: The Trademark Weekly Text File (TWTF) and Trademark Daily XML File (TDXF) Transition Plan Milestones and Dates were provided in the Weekly Status Report of July 18, 2003. Since then inquiries have been received concerning information presented in that plan.
These inquiries are being addressed.Answers will be provided in next weeks Trademark Weekly Status Report dated August 1, 2003.
~~~
There were 3 new inquiries since the last Weekly Status Report of July 18, 2003.Reference New Inquiries below.
Inquiries can be made to: Ed Johnson at Ed.Johnson@uspto.gov - (703) 306-2621 or Marva Dubar at Marva.Dubar@uspto.gov - (703) 305-1669 or sent to OEIP@uspto.gov.
The following is a status update on outstanding items and new inquiries.
Inquiries that were previously considered outstanding and have been resolved will have the resolution in black and bold.
Any inquiries that require additional research and/or response are considered outstanding items and will appear in red and italicized.
~~~Outstanding Items:
Inquiry - 5/23/2003
Problems exist over the use of the 'Section Sign' character inside the TTAB xml dated May 15, 2003. I did an UNIX command "od -hc" to dump the contents of the file so I could see what you are sending it as (247) which is causing the SAX parser to error. I think the character should be §.
In XML, there are only five predefined character entities, as follows:
Character Entity Reference Decimal Hexadecimal
< < < <
> > > >
& & & &
" " " "
' ' ' '
Substituting a character entity reference for a character is REQUIRED by W3C for < and & in all cases where these characters are not markup. It's good practice to do it for the other three as well. That means, wherever these five characters are found in content or in comments, they should be replaced with the corresponding character entity.
Entity declarations will be made for other characters that are included in the trademark xml data according to the W3C Entity Reference recommendation.
Please Note: The above characters entities are awaiting official notification to be implemented.
~~~
Inquiry 6/04/2003
Documentation
Our review of the "Trademark Trial and Appeal Board XML DTD Element Mapping Documentation [v .8]" (the most recent version that we are aware of) has found the following problems:
XML Elements - At least four elements that are included in the DTD and in the files that we have received -- <code-type>, <data-available-code>, <day-in-location>, and <int-attorney-number> are not listed as XML Elements and are completely excluded from this document.
TWTF Elements - Despite the use of the word "mapping" in the title, there is none. Most of the items/terminology in the "TWTF Elements" bear no relation to the TWTF document whatsoever. The TWTF includes none of the following terms or equivalent data elements or many others -- Proceeding Information, Proceeding Entry, Proceeding Type, Role, or Country. What appears in the second column of that document is merely an explanation of the XML tag.
Descriptions -- This column contains the only explanatory information in the document. Unhappily much of it is incorrect or incomplete, making it generally unreliable.
· <status-text> is described as the literal that corresponds to the <status-code>. It is, in fact, nothing of the sort, but rather a variety of notes and notations like the following:
A IS 124817
ADDRESS NOT CHANGED; NEED REV & P/A
cl 9, 45, 41, 35
CONSOL W/91125739 & 91152596. NOT PARENT.
EXHIBIT IS ATTACHED TO THIS PAPER AS EXHIBIT A.
EXHIBITS
GRANTED UNTIL 01/26/03
HOLD 30
KL
PARENT
PLEASE PREPARE COMMISSIONER'S ORDER
READY FOR FINAL DECISION
SERVICE BY PUBThe stated maximum length of data bears no resemblance to the data that actually appears. For example <name> data that is described as "250 position" field is truncated at 40 characters.
The <country> codes in use are not the 2 position WIPO Standard ST.3 codes that were previously furnished us, but the 3 position FIPS codes that have always been used in the TWTF.
The definition for <orgname> includes two elements (appearing below in bold type) that simply do not exist.
Organization and orgname are overflow fields for the Party Name. Instead of saying Party Name Line 1, Party Name Line 2 and Party Name Line 3, we use Name, Company, and Organization to help our users divide/organize party names the (sic) are extremely long.The documentation for TTAB is being properly updated and will also include the proper mapping back to the TWTF process.
~~~
Inquiry 6/05/2003
The Trademark Daily XML Process Documentation for Trademark Assignments XML DTD points to the old version (0.1) and not the current (0.2) version.
The Trademark Assignments XML Documentation to be updated with the correct Assignments DTD version number.
~~~Inquiry – 6/06/2003
In reviewing the country codes for each of the 3 XML files and discovered the following
*Trademark-Applications XML
Uses 3 digit code from TWTF file
*Trademark-Assignments XML
Uses no codes at all, they expand all codes (Spelling out countries)
*Trademark-Proceedings XML
Uses officially designated country as prescribed by the World Intellectual Property Organization (WIPO) Standard ST.3
This difference has been presented to management and a decision to adhere to the WIPO Standard ST. 3 should be implemented with the Madrid Protocol.
Also, if the decision is made to use the WIPO Standard ST. 3 changes would be required to have a separate field for the country code and a separate field for the state code.
~~~
Inquiry - 6/09/2003
After analyzing the most recent Trademark Daily XML TTAB DTD related data files, we found the following issues:
1. The <filing-date> field (which is part of the <proceeding-entry> tag) does not always have the correct value. Here are some examples of this issue:
a. For the Proceeding Number 92042024, which can be found in last Tuesday's (June 3) TWTF file, the value of the DT-FIL field (which is located in the TWTF TTAB record) is "20030310". In the TTAB data file called TT030529.xml, which contains the most up-to-date version of this TTAB Proceeding, the value of the <filing-date> field is "20030529". This is incorrect since this Proceeding was filed on March 10th, 2003.
b. For the Proceeding Number 92042025, which can be found in last Tuesday's (June 3) TWTF file, the value of the DT-FIL field (which is located in the TWTF TTAB record) is "20030430". In the TTAB data file called TT030529.xml, which contains the most up-to-date version of this TTAB Proceeding, the value of the <filing-date> field is "20030529". This is incorrect since this Proceeding was filed on April 30th, 2003.
c. For the Proceeding Number 92042026, which can be found in last Tuesday's (June 3) TWTF file, the value of the DT-FIL field (which is located in the TWTF TTAB record) is "20030424". In the TTAB data file called TT030529.xml, which contains the most up-to-date version of this TTAB Proceeding, the value of the <filing-date> field is "20030529". This is incorrect since this Proceeding was filed on April 24th, 2003.
This was reported in the 6/27/2003 Status Report as being corrected. A correction is planned to take place at the end of September 2003.
2. The <status-update-date> field (which is part of the <proceeding-entry> tag) does not always have the most up-to-date value after the value of the <status-code> field (which is also part of the <proceeding-entry> tag) changes. Here are some examples of this issue:
a. For the Proceeding Number 91154190, which can be found in last Tuesday's (June 3) TWTF file, the value of the DT-STAT field (which is located in the TWTF TTAB record) is "20030529" and the value of the STAT field (which is also located in the TWTF TTAB record) is "9" (Terminated). In the TTAB data file called TT030528.xml, the value of the <status-update-date> field is "20030103" and the value of the <status-code> field is "2" (Pending) for this TTAB Proceeding. In the TTAB data file called TT030529.xml, which contains the most up-to-date version of this TTAB Proceeding, the value of the <status-code> field is changed to "9" (Terminated), but the value of the <status-update-date> field remains the same ("20030103") for some reason. Instead, this field should have the value "20030529" just like in last Tuesday's (June 3) TWTF file.
b. For the Proceeding Number 91154593, which can be found in last Tuesday's (June 3) TWTF file, the value of the DT-STAT field (which is located in the TWTF TTAB record) is "20030529" and the value of the STAT field (which is also located in the TWTF TTAB record) is "9" (Terminated). In the TTAB data file called TT030528.xml, the value of the <status-update-date> field is "20030122" and the value of the <status-code> field is "2" (Pending) for this TTAB Proceeding. In the TTAB data file called TT030529.xml, which contains the most up-to-date version of this TTAB Proceeding, the value of the <status-code> field is changed to "9" (Terminated), but the value of the <status-update-date> field remains the same ("20030122") for some reason. Instead, this field should have the value "20030529" just like in last Tuesday's (June 3) TWTF file.
The value of the <status-update-date> is in error and a correction is planned to take place at the end of September 2003.
~~~
Inquiry – 6/20/2003
The following analysis was conducted by our Database Maintenance staff. It's conclusion, that updates have been made that are reflected on TARR but were never included as updates on the weekly tapes is disturbing. Please respond as to how this discrepancy came to pass, and if possible, some estimate of how frequently this may occur:
We recently ran across a case of a Federal trademark record that contained affidavit information (viewable on TARR) that was not present in our record. Upon analysis, it was determined that the information on filing of said affidavit was never supplied to us on our weekly tape. The record in question was registration 2139349. When viewed in TARR, the record indicates that a Section 8 and 15 has been filed. Our record does not contain such an indication.
Normally, these filings are indicated via flags in the TWTF GENX record . The flags in question (FS8F and FS15F) should indicate T when a filing has occurred. Indications are that although the Sec. 8 & 15 were filed in March, we have not received a record update from the PTO for this record since February of 1998, when the status was changed to Renewed. In the prosecution history in TARR, the filing is indicated (2003-03-20) We have inferred from this discrepancy that not every entry in the PTO's record 'prosecution history' is recorded in our data.
For purposes of equivalency between TARR/PTO data and our current data, it is important for us to ascertain why this filing didn't trigger a record update in TWTF. Is there a reason why we didn't receive this filing?
After an investigation it has been determined that the Trademark Weekly Text File and also the Trademark Daily XML file extracts data according to: 1. What is present in the Trademark weekly OG. 2. New Applications and 3. Modifications to existing records.
TARR contains everything that pertains to all Trademark transactions. The criteria for excluding data from the Trademark Daily XML file must be further investigated.
~~~
Inquiry - 7/2/2003
In an email dated May 21, 2003 'TRADEMARK DAILY XML MAY 20, 2003 ANNOUNCEMENT' an issue was raised and answered about TTAB proceeding-address fields. The answer supplied by the PTO states that the address-information contained currently in the TWTF/PARC Correspondent Information would populate the address-information (proceeding-address).
So to clarify what this means is we will no longer receive address information for particular owners within the TTAB action. This seems like we may be taking a step backwards with the XML feed. Using Proceeding Number 92042153 which is included in the weekly TWTF of July 1 2003 and the daily file of tt030624 as an example:
In the TWTF full address information including the Correspondent for both parties included in this action.
In the daily XML file we only receive the (party) name and orgname for role-code = D (where the orgname is overflow of the name) and the address-information of the type-code = C. Also for (party) name for role-code = P and the address-information for type-code = C.
With XML we are loosing the address information of the specific parties involved.
This is indeed how it was stated that it would be, but we just wanted to point out that the TWTF contains more data elements which are being lost with the XML feeds.
This inquiry will be further investigated.
~~~
July 03, 2003
Poorly formatted addresses in XML
You are trying to fit unstructured data into a structured format, I propose you add an address-2 tag to hold the data in cases like this.
-< proceeding-address >
< identifier >357358</ identifier >
< type-code >C </ type-code >
< name >DOUGLAS W SPRINKLE </ name >
< orgname > GIFFORD KRASS GROH SPRINKLE ANDERSON & C </orgname > THIS WAS CUT OFF SHOULD BE<orgname>GIFFORD KRASS GROH SPRINKLE ANDERSON & CITKOWSKI, P.C.</orgname>
<address-1> 280 N OLD WOODWARD SUITE 400</address-1>
<city> BIRMINGHAM MICHIG </ city >
< state > AN</ state > THIS IS THE TAIL END OF THE ABOVE TAG
<postcode >48009</>
</proceeding-address >-<proceeding-address>
< identifier>384315</identifier>
<type-code>C</type-code>
<name>EDGAR A. ZARINS</name>
<orgname>MASCO CORPORATION</orgname>
<address-1>21001 VAN BORN ROAD< address-1>
<city>TAYLOR MICHIG</city >
<state>AN></state> THIS IS THE TAIL END OF THE ABOVE TAG
<postcode>48180< /postcode>
</proceeding-address>-<proceeding-address> >
<identifier>387621</identifier>
<type-code>C</type-code>
<name>JOHN R GARBER</name>
<orgname>COOPER & DUNHAM LLP</orgname>
<address-1>1185 AVENUE OF THE AMERICAS</address-1>
<city>NEW YORK NEW YO</city>
<state>RK</state> THIS IS THE TAIL END OF THE ABOVE TAG
<postcode>10036</postcode>
</proceeding-address>-<proceeding-address><identifier>367755</identifier>
<type-code>C</type-code>
<name>STEVEN A. GIBSON</name>
<orgname>SANTORO DRIGGS WALCH KEARNEY ET AL</orgname>
<address-1>400 S FOURTH ST 3RD FL</address-1>
<city>LAS VEGAS NEVA</city>
<state>DA</state> THIS IS THE TAIL END OF THE ABOVE TAG
<postcode>89101</postcode>
</proceeding-address>You aren't validating the state code field
-<proceeding-address>
<identifier>292989</identifier>
<type-code>C</type-code>
< name>SALLY M. ABEL</ name>
<orgname><FENWICK & WEST LLP</orgname>
<address-1>TWO PALO ALTO SQUARE</address-1>
<city><PALTO ALTO</city>
<state>C</state> INVALID STATE CODE
<postcode>94306</postcode>
</proceeding-address><proceeding-address>
What code list are these from?
</proceeding-address>
<proceeding-address>
<identifier>298457</identifier>
<type-code>C</type-code>
<name>KRISTI A. ZENTNER</name>
<orgname>FAFINSKI AND WALLRICH, P.A.</orgname>
<address-1>STE. 100 DUNNE MANSION 337 OAK GROVE STREET</address-1>
<city>MINNEAPOLIS</city>
<state>M</state> INVALID STATE CODE
<postcode>55403</postcode>
</proceeding-address><proceeding-address>
<proceeding-address>
<identifier>391698</identifier>
<type-code>C</type-code>
<name>SUSAN UPTON DOUGLASS</name>
<orgname>FROSS ZELNICK LEHRMAN & ZISSU, P.C.</orgname>
<address-1>866 UNITED NATIONS PLAZA AT FIRST AVENUE & 48TH STREET</address-1>
<city>NEW YORK</city>
<state>N7</state> INVALID STATE CODE
<postcode>10017</postcode>
</proceeding-address>
<identifier>369899</identifier>
<type-code>C</type-code>
<name>PETER L. COSTAS</name>
<orgname>PEPE & HAZARD LLP</orgname>
<address-1>225 ASYLUM STREET</address-1>
<city>HARTFORD</city>
<state>CN</state> INVALID STATE CODE
<postcode>06103</postcode>
</proceeding-address><proceeding-address>
<identifier>386393</identifier>
<type-code>C</type-code>
<name>ROLAND W. BAGGOTT III</name>
<orgname>THE BAGGOTT LAW OFFICES, L.L.C.</orgname>
<address-1>1316 CHRISTOPHER COURT</address-1>
<city>METATRIE</city>
<state>LO</state> INVALID STATE CODE
<postcode>70001-3804</postcode>~~~
The above inquiry was presented to the appropriate area for investigation.
~~~
July 09, 2003
The response that was included in the June 13, 2003 document relating to the TWTF entry numbers in the 400-599 range is not understood. You've provided us with no information as to the entity numbers of the parties that were given "assignee" and "assignor" status. The single example also ignores the fact that a significant proportion of the records that have a 400-599 entry number also have two additional parties. Often A and B merged and the resulting entity is re-named. Unless a more detailed answer can be provided, it would seem that the elimination of an intermediate type causes loss of data.
This inquiry will be addressed again with the appropriate area.
~~~
July 09, 2003
We are seeing increasing issues in the timeliness of Trademark image delivery. For example, we have trademarks that have filing dates as of July 18 & July 19 and did not receive images on the image tape shipments on June 25 or July 2. Additionally, these images are available on TARR. Please review this issue.
Another example of this behavior are the following marks:
78267734
78266855They have filing dates of June 26 and we did not receive the images in the July 2 or July 9 image tapes.These images are also available on TARR.
This inquiry is being investigated into why the images are not synchronized.
~~~
July 11, 2003
On April 29,2003 the Trademark 24 Hour Box (An electronic file containing applications filed for a 24 hour period) had a format change resulting in not all of the information being captured.This change was subsequently brought to our attention by an external customer.
Effective with the 7/16/2003 Trademark 24 Hour Box the file names have been restored to their original naming convention.
The software that creates the Trademark 24 Hour Box has been corrected to restore the serial number/barcode.Serial Number 78263478, present in the Trademark 24 Hour Box of July 24, 2003, reflexes this correction. A style sheet change is now required at the front end of the capture procedure.This style sheet change is to be implemented during the week of July 28, 2003.
~~~July 11, 2003
The following documentation still does not exist.
Trademark Assignments XML DTD Element Documentation (Trademark-Assignments-v0.2-2003-05-19)
As also reported 6/5/2003 the Trademark Assignments XML Documentation to be updated with the correct Assignments DTD version number.
~~~
July 16, 2003
Here is some more information/errors....
Processing XML File ==> xml\030620\tt030620.xml
start:: Wed Jul 16 12:05:31 EDT 2003
[Fatal Error] tt030620.xml:145:67181: An invalid XML character (Unicode: 0x12) was found in the element content of the document.
error: Parse error occurred - An invalid XML character (Unicode: 0x12) was found in the element content of the document.Processing XML File ==> xml\030621\tt030621.xml
start:: Wed Jul 16 11:46:32 EDT 2003
[Fatal Error] tt030621.xml:144:390195: An invalid XML character (Unicode: 0x12) was found in the element content of the document.
error: Parse error occurred - An invalid XML character (Unicode: 0x12) was found in the element content of the document.Processing XML File ==> xml\030625\tt030625.xml
start:: Wed Jul 16 12:14:58 EDT 2003
[Fatal Error] tt030625.xml:141:728318: An invalid XML character (Unicode: 0x12) was found in the element content of the document.
error: Parse error occurred - An invalid XML character (Unicode: 0x12) was found in the element content of the document.Processing XML File ==> xml\030626\tt030626.xml
start:: Wed Jul 16 13:05:46 EDT 2003
[Fatal Error] tt030626.xml:143:292294: An invalid XML character (Unicode: 0x12) was found in the element content of the document.
error: Parse error occurred - An invalid XML character (Unicode: 0x12) was found in the element content of the document. All of the characters 0 through 31 and character 127 are nonprinting control characters. With the exception of characters 09, 10, and 13, (Ox09, Ox0A, and Ox0D) the others may NOT appear anywhere in an XML document.The above inquiry was presented to the appropriate area for investigation.
~~~
New Inquiries:
July 14, 2003
Is it possible to put in more line breaks into this file. The file is unable to be loaded into a normal text editor due to the line lengths (this is not true for the other xml files). Here is an example:
tt030701.xml, length of the longest line: 1309721, new line count: 252
The above inquiry was presented to the appropriate area for investigation. A correction will be made.
~~~
July 17, 2003
During the month of May there were 3 different totals of applications filed.
Trademark Weekly Text File (TWTF) had 19,421
TESS had 19,404
The Daily XML had 19,276
This is being investigated.
~~~
July 23, 2003
As per your document "Transition Plan Milestones and Dates" the TWTF is being discontinued October 1st, 2003. Will there be XML masters for each of the 3 file types (Trademark Applications, Assignments and Proceedings)? Will there be a XML monthly status file as we have now?
1. Retrospective XML masters files for each of the 3 file types will be available with the creation of the annual files at the completion of calendar year 2003. 2. A Monthly Status File will be available after the TWTF file is discontinued.
2. The Monthly Status File will remain in its current ASCII format until after the Madrid Protocol is completed in November 2003.
~~~
If you have any questions or need additional information please contact one of the following individuals:
Ed Johnson Marva Dubar Information Products Division Information Dissemination Data Dissemination Branch Systems Division (703) 306-2621 (703) 305-1669 (703) 306-2737 Fax (703) 308-5164 Fax Ed.Johnson@uspto.gov Marva.Dubar@uspto.gov
|
HOME | INDEX | SEARCH | SYSTEM STATUS | BUSINESS CENTER | NEWS&NOTICES | CONTACT US | PRIVACY STATEMENT |