Question Software to find common list entries ?

diliptimblo

Reputable
Oct 29, 2018
51
2
4,535
Given below are samples of trade list Report and a page of a contract note. The contract note is available only in PDF format and Trade list is available in exel and pdf format. What we want to know is which of the entries is common in both the trade list and the contract note. There will be 40 to 50 pages of contract notes and 20 to 30 pages of trade list. Please direct us to a software which will do this
Contract note sample


Order No.
Order Time
Trade No.
Trade Time
Security Description
Quantity Bought for You
Quantity Sold for You
Gross Rate Per Security (Rs.)
Nse F&O
130000000014313809:15:283975484909:15:31HINDALCO OPT 30JUL 20 PE @ 155
0​
8600​
2.80​
130000000014313809:15:283983092809:30:16HINDALCO OPT 30JUL 20 PE @ 155
0​
4300​
2.80​
130000000014313809:15:283983207509:30:32HINDALCO OPT 30JUL 20 PE @ 155
0​
4300​
2.80​
130000000014313809:15:283982456109:28:45HINDALCO OPT 30JUL 20 PE @ 155
0​
4300​
2.80​


Date​
Series​
Type​
Trade No​
Order No​
B/S​
Qty​
Rate​
Brokerage Per Share​
Stk Price​
Srv Tax​
STT​
CTT​
Stmp​
To Chrg​
CESS​
IGST​
SGST​
CGST​
UTGST​
SEBI FEES​
Net Amount​
10/03/2022BANKNIFTY FUT 31MAR 22IFB50034327.841.710.000.000.000.00343.28343.280.000.00108.14108.140.0017.16-17165696.95
10/03/2022BANKNIFTY OPT 17MAR 22 PE @ 34000IO/PES500659.700.8034000.000.00164.930.000.00174.820.000.0051.7351.730.000.33329006.46
10/03/2022CUB FUT 31MAR 22SFB68000123.400.000.000.000.000.00167.82167.820.000.0053.0453.040.008.39-8392071.71
10/03/2022CUB OPT 31MAR 22 PE @ 120SO/PES680003.610.00120.000.00122.660.000.00130.010.000.0047.2047.200.000.25244568.28
10/03/2022NIFTY FUT 31MAR 22IFB100016508.880.820.000.000.000.00330.18330.180.000.00104.01104.010.0016.51-16510585.29
10/03/2022NIFTY OPT 17MAR 22 PE @ 15700IO/PEB50027.300.4015700.000.000.000.000.417.230.000.0018.6518.650.000.01-13894.95
10/03/2022NIFTY OPT 17MAR 22 PE @ 16400IO/PES1000232.950.4016400.000.00116.470.000.00123.460.000.0047.1147.110.000.23232215.62
14/03/2022IBULHSGFIN FUT 28APR 22SFB31000148.040.000.000.000.000.0091.7991.790.000.0028.9128.910.004.59-4589870.39
14/03/2022IBULHSGFIN OPT 31MAR 22 PE @ 140SO/PES310006.000.00140.000.0093.000.000.0098.580.000.0026.7326.730.000.19185556.37
14/03/2022RECLTD FUT 28APR 22SFB60000125.610.000.000.000.000.00150.74150.740.000.0047.5947.590.007.54-7537682.20
14/03/2022RECLTD OPT 31MAR 22 PE @ 120SO/PES600001.600.00120.000.0048.000.000.0050.880.000.0022.4022.400.000.1095658.22
15/03/2022JKCEMENT OPT 31MAR 22 PE @ 2200SO/PES175066.000.112200.000.0057.750.000.0061.220.000.0023.5023.500.000.12115134.06
17/03/2022BANKNIFTY OPT 17MAR 22 PE @ 34000IO/PEB5000.000.0034000.000.000.000.000.000.000.000.000.000.000.000.000.00
17/03/2022NIFTY OPT 17MAR 22 PE @ 16400IO/PEB10000.000.0016400.000.000.000.000.000.000.000.000.000.000.000.000.00
-53007662.48
 

Ralston18

Titan
Moderator
What is a "common entry"? What specifically is being searched for occurences in each list?

= = = =

The upper table being, as labeled, the Contract Note list and the lower table being the Trade List (unlabled) - correct?

Overall it appears to me that you are working with many-to-many match ups between different columns/fields in the two lists (tables).

I do not see a common column/field in the tables that could be used to link the tables. That would be the first requirement before any searches for common entries.

Trade No would be a likely candidate but the Trade List does not appear to have any values in that column.

However, not absolutely necessary for just a brute force search that starts with some value (text, date, numeric) in the Contract List and then looks for the a matching value(s) in the Trade List.

Such a search would require identifying the fields (columns) to parse and then do the necessary string comparisons that indicate a match. And possibly some necessary conversions to compare "123.45" as text (.pdf) to "123.45" as a numeric value (Excel .xls)

Any number of ways that that could be done.

There are any number of apps that can be used to compare documents and flag differences. There may be simpler ways to meet the requirements: use a database or perhaps Python.

The Contract Note List appears structured enough that it could be imported into a database table or even Excel. The Trade List (Excel) should easily import into another table in the same date base and linked to Contract List via Trade No..

Given a choice I would use a database. Even if some of the data is from a linked Excel spreadsheet(s).

Especially if and as the number of pages grow in each list. Or the data requires updates and corrections.

What columns/fields are being compared to find matches and when matches are found how are the results to be presented? What fields to show and what is the desired output format?

Provide some examples of the search process and the expected results.

What, if any, software apps have been tried and tested? What worked and what did not work?

Consider doing it yourself and savings the costs of an app and/or licensing.

Overall more information needed.

That all said how much control, if any, do you have in the creation and maintenance of the two lists? A few changes could make the overall comparison process very straightforward and simple.

Just something to think about.