修改后的 baugh-wooley 算法乘法 verilog 代码不能正确乘法

2024-01-04

以下 verilog 源代码和/或测试平台可以很好地工作商业模拟器，iverilog https://www.edaplayground.com/x/3TuQ也形式化验证工具（yosys-smtbmc） https://gist.github.com/promach/5f2d9a9494704ed93cf65687c982198c

请将有关 `ifdef FORMAL 的投诉保留到稍后。我需要它们与 yosys-smtbmc 一起使用，它还不支持绑定命令。

我现在正在调试生成编码，因为乘法（使用修改的 baugh-wooley 算法）尚未起作用。

当 o_valid 置位时，乘法代码应给出 o_p = i_a * i_b = 3*2 = 6，但波形清楚地显示代码给出 o_p = 0x20 = 32

test_multiply.v

// Testbench
module test_multiply;

  parameter A_WIDTH=4, B_WIDTH=4;

  reg i_clk;
  reg i_reset;
  reg i_ce;
  reg signed[(A_WIDTH-1):0] i_a;
  reg signed[(B_WIDTH-1):0] i_b;
  wire signed[(A_WIDTH+B_WIDTH-1):0] o_p;
  wire o_valid;

  // Instantiate design under test
  multiply #(A_WIDTH, B_WIDTH) mul(.clk(i_clk), .reset(i_reset), .in_valid(i_ce), .in_A(i_a), .in_B(i_b), .out_valid(o_valid), .out_C(o_p));


  initial begin
    // Dump waves
    $dumpfile("test_multiply.vcd");
    $dumpvars(0, test_multiply);

    i_clk = 0;
    i_reset = 0;
    i_ce = 0;
    i_a = 0;
    i_b = 0;

  end

  localparam SMALLER_WIDTH = (A_WIDTH <= B_WIDTH) ? A_WIDTH : B_WIDTH;
  localparam NUM_OF_INTERMEDIATE_LAYERS = $clog2(SMALLER_WIDTH);

  genvar i, j; // array index

  generate
    for(i = 0; i < NUM_OF_INTERMEDIATE_LAYERS; i = i + 1) begin
        for(j = 0; j < SMALLER_WIDTH; j = j + 1) begin
            initial $dumpvars(0, test_multiply.mul.middle_layers[i][j]);
        end
    end
  endgenerate

  always #5 i_clk = !i_clk;

  initial begin

    @(posedge i_clk);
    @(posedge i_clk);

    $display("Reset flop.");

    i_reset = 1;

    @(posedge i_clk);
    @(posedge i_clk);

    i_reset = 0;

    @(posedge i_clk);
    @(posedge i_clk);

    i_ce = 1;
    i_a = 3;
    i_b = 2;

    #50 $finish;

  end

endmodule

乘法.v

module multiply #(parameter A_WIDTH=16, B_WIDTH=16)
(clk, reset, in_valid, out_valid, in_A, in_B, out_C); // C=A*B

`ifdef FORMAL
parameter A_WIDTH = 4;
parameter B_WIDTH = 4;
`endif

input clk, reset;
input in_valid; // to signify that in_A, in_B are valid, multiplication process can start
input signed [(A_WIDTH-1):0] in_A;
input signed [(B_WIDTH-1):0] in_B;
output signed [(A_WIDTH+B_WIDTH-1):0] out_C;
output reg out_valid; // to signify that out_C is valid, multiplication finished

/* 
   This signed multiplier code architecture is a combination of row adder tree and 
   modified baugh-wooley algorithm, thus requires an area of O(N*M*logN) and time O(logN)
   with M, N being the length(bitwidth) of the multiplicand and multiplier respectively

   see [url]https://i.imgur.com/NaqjC6G.png[/url] or 
   Row Adder Tree Multipliers in [url]http://www.andraka.com/multipli.php[/url] or
   [url]https://pdfs.semanticscholar.org/415c/d98dafb5c9cb358c94189927e1f3216b7494.pdf#page=10[/url]
   regarding the mechanisms within all layers

   In terms of fmax consideration: In the case of an adder tree, the adders making up the levels
   closer to the input take up real estate (remember the structure of row adder tree).  As the 
   size of the input multiplicand bitwidth grows, it becomes more and more difficult to find a
   placement that does not use long routes involving multiple switch nodes for FPGA.  The result
   is the maximum clocking speed degrades quickly as the size of the bitwidth grows.

   For signed multiplication, see also modified baugh-wooley algorithm for trick in skipping 
   sign extension (implemented as verilog example in [url]https://www.dsprelated.com/showarticle/555.php[/url]),
   thus smaller final routed silicon area.

   [url]https://stackoverflow.com/questions/54268192/understanding-modified-baugh-wooley-multiplication-algorithm/[/url]

   All layers are pipelined, so throughput = one result for each clock cycle 
   but each multiplication result still have latency = NUM_OF_INTERMEDIATE_LAYERS 
*/


// The multiplication of two numbers is equivalent to adding as many copies of one 
// of them, the multiplicand, as the value of the other one, the multiplier.
// Therefore, multiplicand always have the larger width compared to multipliers

localparam SMALLER_WIDTH = (A_WIDTH <= B_WIDTH) ? A_WIDTH : B_WIDTH;
localparam LARGER_WIDTH = (A_WIDTH > B_WIDTH) ? A_WIDTH : B_WIDTH;

wire [(LARGER_WIDTH-1):0] MULTIPLICAND = (A_WIDTH > B_WIDTH) ? in_A : in_B ;
wire [(SMALLER_WIDTH-1):0] MULTIPLIPLIER = (A_WIDTH <= B_WIDTH) ? in_A : in_B ;

`ifdef FORMAL
// to keep the values of multiplicand and multiplier before the multiplication finishes 
reg [(LARGER_WIDTH-1):0] MULTIPLICAND_reg;
reg [(SMALLER_WIDTH-1):0] MULTIPLIPLIER_reg;

always @(posedge clk)
begin
    if(reset) begin
        MULTIPLICAND_reg <= 0;
        MULTIPLIPLIER_reg <= 0;
    end

    else if(in_valid) begin
        MULTIPLICAND_reg <= MULTIPLICAND;
        MULTIPLIPLIER_reg <= MULTIPLIPLIER;
    end
end
`endif

localparam NUM_OF_INTERMEDIATE_LAYERS = $clog2(SMALLER_WIDTH);


/*Binary multiplications and additions for partial products rows*/

// first layer has "SMALLER_WIDTH" entries of data of width "LARGER_WIDTH"
// This resulted in a binary tree with faster vertical addition processes as we have 
// lesser (NUM_OF_INTERMEDIATE_LAYERS) rows to add

// intermediate partial product rows additions
// Imagine a rhombus of height of "SMALLER_WIDTH" and width of "LARGER_WIDTH"
// being re-arranged into binary row adder tree
// such that additions can be done in O(logN) time

//reg [(NUM_OF_INTERMEDIATE_LAYERS-1):0][(SMALLER_WIDTH-1):0][(A_WIDTH+B_WIDTH-1):0] middle_layers;
reg [(A_WIDTH+B_WIDTH-1):0] middle_layers[(NUM_OF_INTERMEDIATE_LAYERS-1):0][0:(SMALLER_WIDTH-1)];
//reg [(NUM_OF_INTERMEDIATE_LAYERS-1):0] middle_layers [0:(SMALLER_WIDTH-1)] [(A_WIDTH+B_WIDTH-1):0];
//reg middle_layers [(NUM_OF_INTERMEDIATE_LAYERS-1):0][0:(SMALLER_WIDTH-1)][(A_WIDTH+B_WIDTH-1):0];

generate // duplicates the leafs of the binary tree

    genvar layer; // layer 0 means the youngest leaf, layer N means the tree trunk

    for(layer=0; layer<NUM_OF_INTERMEDIATE_LAYERS; layer=layer+1) begin: intermediate_layers

        integer pp_index; // leaf index within each layer of the tree
        integer bit_index; // index of binary string within each leaf

        always @(posedge clk)
        begin
            if(reset) 
            begin
                for(pp_index=0; pp_index<SMALLER_WIDTH ; pp_index=pp_index+1)
                    middle_layers[layer][pp_index] <= 0;
            end

            else begin

                if(layer == 0)  // all partial products rows are in first layer
                begin

                    // generation of partial products rows
                    for(pp_index=0; pp_index<SMALLER_WIDTH ; pp_index=pp_index+1)
                        middle_layers[layer][pp_index] <= 
                        (MULTIPLICAND & MULTIPLIPLIER[pp_index]);

                    // see modified baugh-wooley algorithm: [url]https://i.imgur.com/VcgbY4g.png[/url] from
                                        // page 122 of book "Ultra-Low-Voltage Design of Energy-Efficient Digital Circuits"
                    for(pp_index=0; pp_index<SMALLER_WIDTH ; pp_index=pp_index+1)
                        middle_layers[layer][pp_index][LARGER_WIDTH-1] <= 
                       !middle_layers[layer][pp_index][LARGER_WIDTH-1];

                    middle_layers[layer][SMALLER_WIDTH-1] <= !middle_layers[layer][SMALLER_WIDTH-1];
                    middle_layers[layer][0][LARGER_WIDTH] <= 1;
                    middle_layers[layer][SMALLER_WIDTH-1][LARGER_WIDTH] <= 1;
                end

                // adding the partial product rows according to row adder tree architecture
                else begin
                    for(pp_index=0; pp_index<(SMALLER_WIDTH >> layer) ; pp_index=pp_index+1)
                        middle_layers[layer][pp_index] <=
                        middle_layers[layer-1][pp_index<<1] +
                      (middle_layers[layer-1][(pp_index<<1) + 1]) << 1;

                    // bit-level additions using full adders
                    /*for(pp_index=0; pp_index<SMALLER_WIDTH ; pp_index=pp_index+1)
                        for(bit_index=0; bit_index<(LARGER_WIDTH+layer); bit_index=bit_index+1)
                            full_adder fa(.clk(clk), .reset(reset), .ain(), .bin(), .cin(), .sum(), .cout());*/
                end
            end
        end
    end

endgenerate

assign out_C = (reset)? 0 : middle_layers[NUM_OF_INTERMEDIATE_LAYERS-1][0];


/*Checking if the final multiplication result is ready or not*/

reg [($clog2(NUM_OF_INTERMEDIATE_LAYERS)-1):0] out_valid_counter; // to track the multiply stages
reg multiply_had_started;

always @(posedge clk)
begin
    if(reset) 
    begin
        multiply_had_started <= 0;
        out_valid <= 0;
        out_valid_counter <= 0;
    end

    else if(out_valid_counter == NUM_OF_INTERMEDIATE_LAYERS-1) begin
        multiply_had_started <= 0;
        out_valid <= 1;
        out_valid_counter <= 0;
    end

    else if(in_valid && !multiply_had_started) begin
        multiply_had_started <= 1;
        out_valid <= 0; // for consecutive multiplication
    end

    else begin
        out_valid <= 0;
        if(multiply_had_started) out_valid_counter <= out_valid_counter + 1;
    end
end


`ifdef FORMAL

initial assume(reset);
initial assume(in_valid == 0);

wire sign_bit = MULTIPLICAND_reg[LARGER_WIDTH-1] ^ MULTIPLIPLIER_reg[SMALLER_WIDTH-1];

always @(posedge clk)
begin
    if(reset) assert(out_C == 0);

    else if(out_valid) begin
        assert(out_C == (MULTIPLICAND_reg * MULTIPLIPLIER_reg));
        assert(out_C[A_WIDTH+B_WIDTH-1] == sign_bit);
    end
end

`endif

`ifdef FORMAL

localparam user_A = 3;
localparam user_B = 2;

always @(posedge clk)
begin
    cover(in_valid && (in_A == user_A) && (in_B == user_B));
    cover(out_valid);
end

`endif

endmodule

问题已解决：以下代码现在在 vivado 模拟和形式验证中的 cover() 中给出了正确的有符号乘法结果。

See 乘法.v https://gist.github.com/promach/5f2d9a9494704ed93cf65687c982198c#file-multiply-v以及对应的正确波形

module multiply #(parameter A_WIDTH=16, B_WIDTH=16)
(clk, reset, in_valid, out_valid, in_A, in_B, out_C); // C=A*B

`ifdef FORMAL
parameter A_WIDTH = 4;
parameter B_WIDTH = 6;
`endif

input clk, reset;
input in_valid; // to signify that in_A, in_B are valid, multiplication process can start
input signed [(A_WIDTH-1):0] in_A;
input signed [(B_WIDTH-1):0] in_B;
output signed [(A_WIDTH+B_WIDTH-1):0] out_C;
output reg out_valid; // to signify that out_C is valid, multiplication finished

/* 
   This signed multiplier code architecture is a combination of row adder tree and 
   modified baugh-wooley algorithm, thus requires an area of O(N*M*logN) and time O(logN)
   with M, N being the length(bitwidth) of the multiplicand and multiplier respectively

   see https://i.imgur.com/NaqjC6G.png or 
   Row Adder Tree Multipliers in http://www.andraka.com/multipli.php or
   https://pdfs.semanticscholar.org/415c/d98dafb5c9cb358c94189927e1f3216b7494.pdf#page=10
   regarding the mechanisms within all layers

   In terms of fmax consideration: In the case of an adder tree, the adders making up the levels
   closer to the input take up real estate (remember the structure of row adder tree).  As the 
   size of the input multiplicand bitwidth grows, it becomes more and more difficult to find a
   placement that does not use long routes involving multiple switch nodes for FPGA.  The result
   is the maximum clocking speed degrades quickly as the size of the bitwidth grows.

   For signed multiplication, see also modified baugh-wooley algorithm for trick in skipping 
   sign extension (implemented as verilog example in https://www.dsprelated.com/showarticle/555.php),
   thus smaller final routed silicon area.
   https://stackoverflow.com/questions/54268192/understanding-modified-baugh-wooley-multiplication-algorithm/

   All layers are pipelined, so throughput = one result for each clock cycle 
   but each multiplication result still have latency = NUM_OF_INTERMEDIATE_LAYERS 
*/


// The multiplication of two numbers is equivalent to adding as many copies of one 
// of them, the multiplicand, as the value of the other one, the multiplier.
// Therefore, multiplicand always have the larger width compared to multipliers

localparam SMALLER_WIDTH = (A_WIDTH <= B_WIDTH) ? A_WIDTH : B_WIDTH;
localparam LARGER_WIDTH = (A_WIDTH > B_WIDTH) ? A_WIDTH : B_WIDTH;

wire [(LARGER_WIDTH-1):0] MULTIPLICAND = (A_WIDTH > B_WIDTH) ? in_A : in_B ;
wire [(SMALLER_WIDTH-1):0] MULTIPLIER = (A_WIDTH <= B_WIDTH) ? in_A : in_B ;


// to keep the values of multiplicand and multiplier before the multiplication finishes 
reg signed [(LARGER_WIDTH-1):0] MULTIPLICAND_reg;
reg signed [(SMALLER_WIDTH-1):0] MULTIPLIER_reg;

always @(posedge clk)
begin
    if(reset) begin
        MULTIPLICAND_reg <= 0;
        MULTIPLIER_reg <= 0;
    end

    else if(in_valid) begin
        MULTIPLICAND_reg <= MULTIPLICAND;
        MULTIPLIER_reg <= MULTIPLIER;
    end
end


localparam NUM_OF_INTERMEDIATE_LAYERS = $clog2(SMALLER_WIDTH);


/*Binary multiplications and additions for partial products rows*/

// first layer has "SMALLER_WIDTH" entries of data of width "LARGER_WIDTH"
// This resulted in a binary tree with faster vertical addition processes as we have 
// lesser (NUM_OF_INTERMEDIATE_LAYERS) rows to add

// intermediate partial product rows additions
// Imagine a rhombus of height of "SMALLER_WIDTH" and width of "LARGER_WIDTH"
// being re-arranged into binary row adder tree
// such that additions can be done in O(logN) time

//reg [(NUM_OF_INTERMEDIATE_LAYERS-1):0][(SMALLER_WIDTH-1):0][(A_WIDTH+B_WIDTH-1):0] middle_layers;
reg signed [(A_WIDTH+B_WIDTH-1):0] middle_layers[NUM_OF_INTERMEDIATE_LAYERS:0][0:(SMALLER_WIDTH-1)];
//reg [(NUM_OF_INTERMEDIATE_LAYERS-1):0] middle_layers [0:(SMALLER_WIDTH-1)] [(A_WIDTH+B_WIDTH-1):0];
//reg middle_layers [(NUM_OF_INTERMEDIATE_LAYERS-1):0][0:(SMALLER_WIDTH-1)][(A_WIDTH+B_WIDTH-1):0];

generate // duplicates the leafs of the binary tree

    genvar layer; // layer 0 means the youngest leaf, layer N means the tree trunk

    for(layer=0; layer<=NUM_OF_INTERMEDIATE_LAYERS; layer=layer+1) begin: intermediate_layers

        integer pp_index; // leaf index within each layer of the tree

        always @(posedge clk)
        begin
            if(reset) 
            begin
                for(pp_index=0; pp_index<SMALLER_WIDTH ; pp_index=pp_index+1)
                    middle_layers[layer][pp_index] <= 0;
            end

            else begin      

                if(layer == 0)  // all partial products rows are in first layer
                begin               
                    // generation of partial products rows
                    for(pp_index=0; pp_index<SMALLER_WIDTH ; pp_index=pp_index+1)
                        middle_layers[layer][pp_index] <= MULTIPLIER[pp_index] ? MULTIPLICAND:0;    

                    // see modified baugh-wooley algorithm: https://i.imgur.com/VcgbY4g.png from
                    // page 122 of book: Ultra-Low-Voltage Design of Energy-Efficient Digital Circuits
                    for(pp_index=0; pp_index<(SMALLER_WIDTH-1) ; pp_index=pp_index+1) // MSB inversion
                        middle_layers[layer][pp_index][LARGER_WIDTH-1] <= 
                        (MULTIPLICAND[LARGER_WIDTH-1] & MULTIPLIER[pp_index]) ? 0:1;

                    for(pp_index=(LARGER_WIDTH-SMALLER_WIDTH); pp_index<(LARGER_WIDTH-1) ; pp_index=pp_index+1) // last partial product row inversion
                        //the starting index is to consider the condition where A_WIDTH != B_WIDTH
                        middle_layers[layer][SMALLER_WIDTH-1][pp_index] <= 
                        (MULTIPLICAND[pp_index] & MULTIPLIER[SMALLER_WIDTH-1]) ? 0:1;

                    middle_layers[layer][0][LARGER_WIDTH] <= 1;
                    middle_layers[layer][SMALLER_WIDTH-1][LARGER_WIDTH] <= 1;
                end

                // adding the partial product rows according to row adder tree architecture
                else begin
                    for(pp_index=0; pp_index<(SMALLER_WIDTH >> layer) ; pp_index=pp_index+1)
                    begin
                        if(pp_index==0)
                            middle_layers[layer][pp_index] <=
                            middle_layers[layer-1][0] +
                            (middle_layers[layer-1][1] << layer);

                        else middle_layers[layer][pp_index] <=
                            middle_layers[layer-1][pp_index<<1] +
                            (middle_layers[layer-1][(pp_index<<1) + 1] << layer);
                    end
                end
            end
        end
    end

endgenerate


assign out_C = (reset)? 0 : mul_result;


// both A and B are of negative numbers
wire both_negative = MULTIPLICAND_reg[LARGER_WIDTH-1] & MULTIPLIER_reg[SMALLER_WIDTH-1]; 

/*
the following is to deal with the shortcomings of the published modified baugh-wooley algorithm
which does not handle the case where A_WIDTH != B_WIDTH

The countermeasure does not do "To build a 6x4 multplier you can build a 6x6 multiplier, but replicate 
the sign bit of the short word 3 times, and ignore the top 2 bits of the result." , instead it uses 
some smart tricks/logic described by the signal 'modify_result'. The signal 'modify_result' is not 
asserted when one number is positive, and another is negative.

Please use pencil and paper method (and signals waveform) to verify or understand this.
I did not do a rigorous math proof on this countermeasure.

Instead I modify the "modified baugh-wooley algorithm" by debugging wrong multiplication results from
formal verification cover(in_valid && (in_A == A_value) && (in_B == B_value)); waveforms 
together with manual handwritten multiplication on paper. 

The countermeasure is considered successful when assert(out_C == (MULTIPLICAND_reg * MULTIPLIER_reg)); 
passed during cover() verification

Besides, the last partial product row inversion mechanism is also modified to handle this shortcoming
*/

wire modify_result = (A_WIDTH == B_WIDTH) || ((A_WIDTH != B_WIDTH) && both_negative);

wire signed [(A_WIDTH+B_WIDTH-1):0] mul_result;

assign mul_result = (modify_result) ?
        middle_layers[NUM_OF_INTERMEDIATE_LAYERS][0] : 
       {{(LARGER_WIDTH-SMALLER_WIDTH){sign_bit}} ,
        middle_layers[NUM_OF_INTERMEDIATE_LAYERS][0][LARGER_WIDTH +: SMALLER_WIDTH] , 
        middle_layers[NUM_OF_INTERMEDIATE_LAYERS][0][0 +: SMALLER_WIDTH]} ;



/*Checking if the final multiplication result is ready or not*/

reg [($clog2(NUM_OF_INTERMEDIATE_LAYERS)-1):0] out_valid_counter; // to track the multiply stages
reg multiply_had_started;

always @(posedge clk)
begin
    if(reset) 
    begin
        multiply_had_started <= 0;
        out_valid <= 0;
        out_valid_counter <= 0;
    end

    else if(out_valid_counter == NUM_OF_INTERMEDIATE_LAYERS-1) begin
        multiply_had_started <= 0;
        out_valid <= 1;
        out_valid_counter <= 0;
    end

    else if(in_valid && !multiply_had_started) begin
        multiply_had_started <= 1;
        out_valid <= 0; // for consecutive multiplication
    end

    else begin
        out_valid <= 0;
        if(multiply_had_started) out_valid_counter <= out_valid_counter + 1;
    end
end


wire sign_bit = MULTIPLICAND_reg[LARGER_WIDTH-1] ^ MULTIPLIER_reg[SMALLER_WIDTH-1];

`ifdef FORMAL

initial assume(reset);
initial assume(in_valid == 0);


always @(posedge clk)
begin
    if(reset) assert(out_C == 0);

    else if(out_valid) begin
        assert(out_C == (MULTIPLICAND_reg * MULTIPLIER_reg));
        assert(out_C[A_WIDTH+B_WIDTH-1] == sign_bit);
    end
end

`endif

`ifdef FORMAL

wire signed [(A_WIDTH-1):0] A_value = $anyconst;
wire signed [(B_WIDTH-1):0] B_value = $anyconst;

always @(posedge clk)
begin
    assume(A_value != 0);
    assume(B_value != 0);
    cover(in_valid && (in_A == A_value) && (in_B == B_value));
    cover(out_valid);
end

`endif

endmodule

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

Algorithm

verilog

FPGA

Systemverilog

multiplication

修改后的 baugh-wooley 算法乘法 verilog 代码不能正确乘法的相关文章

Google 文档如何处理编辑冲突？

我一直在尝试编写自己的 Javascript 编辑器其功能类似于 Google Docs 允许多人同时使用我不明白一件事假设用户 A 和用户 B 直接相互连接网络延迟为 10 毫秒我假设编辑器使用 diff 系统据我了解 Doc
神经网络的层和神经元

我想更多地了解神经网络我正在开发一个 C 程序来制作神经网络但我坚持使用反向传播算法很抱歉没有提供一些工作代码我知道有很多库可以用多种语言创建神经网络但我更喜欢自己制作一个关键是我不知道要实现特定目标例如模式识别或函数近似或其
填充体积算法

我有一个具有一定尺寸长度宽度高度的盒子我有不同长度宽度高度的物品是否有现有的算法可以确定放入盒子中的最佳物品这称为装箱切割库存背包问题并且是 NP 难问题一般来说您只能通过使用启发式方法获得近似解请参见示例 htt
计算字符串的所有子串中子序列的出现次数

我想编写一个算法来计算字符串的所有子字符串中字符子序列不相交出现的总数下面是一个例子字符串 jabcohnnyjohnny 后续约翰尼包含子序列的子字符串 jabcohnny jabcohnnyj jabcohnnyjo jab
重写修改后的 goto 语义的算法

我有一大堆使用旧的自行设计的脚本语言编写的遗留代码我们将它们编译翻译成 javascript 该语言有条件跳转跳转到标签与普通 goto 语句的区别在于不可能向后跳转该语言中没有嵌套的 if 语句或循环由于 javascrip
将每列的值乘以 R 中另一个 data.frame 中的权重

我有两个data frames df and weights 代码如下 df看起来像这样 id a b d EE f 1 this 0 23421153 0 02324956 0 5457353 0 73068586 0 5642554 2
如何光栅化旋转矩形（通过 setpixel 在 2d 中）

我有四个 2d 顶点 A B C D 的旋转矩形我需要在像素缓冲区中有效地光栅化绘制它使用 setpixel x y 颜色怎么做我正在尝试使用一些代码例如 convertilg a b c d do up down left
求先递增后递减列表的最大值和最小值

我尝试用谷歌搜索这个问题但没有取得太大成功我确信这个问题或类似问题有一个技术名称但我似乎找不到答案给定一个列表L整数即严格递增然后严格递减找到该列表的最大值和最小值例如 L可能 1 2 3 4 5 4 3 2 or 2 4
颜色逻辑算法

我们正在构建一个体育应用程序并希望将团队颜色融入到应用程序的各个部分现在每个团队都可以使用几种不同的颜色来表示我想做的是执行检查以验证两个团队颜色是否在彼此一定的范围内这样我就不会显示两个相似的颜色因此如果团队 1 的主要团队
用于查找最近邻居的空间划分算法如何工作？

为了找到最近的邻居空间分区 http en wikipedia org wiki Nearest neighbor search Space partitioning是算法之一它是如何工作的假设我有一组 2D 点 x 和 y 坐标并
当满足动态条件时退出递归函数

使用来自的函数生成汉明距离 t 内的所有比特序列 https stackoverflow com questions 40813022 generate all sequences of bits within hamming distan
数组所有可能的组合

我有一个字符串数组 ted williams golden voice radio 我希望这些关键字的所有可能组合采用以下形式 ted williams golden voice radio ted williams ted golden
查找字符串中最常见的子字符串的算法

是否有任何算法可用于查找字符串中最常见的短语或子字符串例如以下字符串将 hello world 作为其最常见的两个单词短语 hello world this is hello world hello world repeats thr
Java 中查看 ArrayList 是否包含对象的最有效方法

我有一个 Java 对象的 ArrayList 这些对象有四个字段我用其中两个字段来将对象视为与另一个对象相等我正在寻找最有效的方法给定这两个字段以查看数组是否包含该对象问题在于这些类是基于 XSD 对象生成的因此我无法修改类本
对 Java 中 *any* 类的所有实例进行全排序

我不确定以下代码是否能确保 Comparator 的 Javadoc 中给出的所有条件 class TotalOrder
从二叉堆中查找第 k 个最小元素的 O(klogk) 时间算法

我们有一个 n 节点二叉堆其中包含n不同的项目根部的最小项目为一个k lt n 发现O klogk 时间算法选择kth堆中的最小元素 O klogn 很明显但无法找出O klogk 一也许我们可以使用第二个堆但不确定好吧你的
while循环的时间复杂度是多少？

我正在尝试找出 while 循环的时间复杂度但我不知道从哪里开始我了解如何找到 for 循环的复杂性类别但是当涉及到 while 循环时我完全迷失了关于从哪里开始有什么建议提示吗这是一个问题的示例 x 0 A n some a
最小化代表性整数的误差之和

Given n integers between 0 10000 as D1 D2 Dn where there may be duplicates and n can be huge I want to find k distinct r
如何实现n个元素的查找和插入操作的动态二分查找

这个想法是使用多个数组每个长度为 2 k 根据 n 的二进制表示来存储 n 个元素每个数组都是排序的不同的数组没有以任何方式排序在上述数据结构中 SEARCH是通过对每个数组进行一系列二分查找来进行的 INSERT 是通过一系列相同
找到一个数字所属的一组范围

我有一个 200k 行的数字范围列表例如开始位置停止位置该列表包括除了非重叠的重叠之外的所有类型的重叠列表看起来像这样 3 5 10 30 15 25 5 15 25 35 我需要找到给定数字所属的范围并对 100k 个数字重复该

随机推荐

Docker-compose 在已安装的卷上设置用户和组

我正在尝试将 docker compose 中的卷安装到 apache 映像问题是我的 docker 中的 apache 运行在www data www data但挂载的目录是在下面创建的root root 如何指定挂载目录的用户我尝
如何在svn中以vimdiff风格查看svn diff

我开始在 Linux 上使用 Subversion svn diff给人一种非常神秘的感觉对眼睛来说非常非常不友好我如何解释它的输出更重要的是有没有一种方法可以查看 vimdiff 那种简洁风格的差异两个文件将并排打开编辑文件
如何从mysql表中删除重复行

我需要从mysql表中删除重复记录所以我有一个表名 employee 字段是 empid empname empssn 为了获取重复记录我编写了一个查询 SELECT COUNT empssn empssn FROM employee
clang：错误：-Z-reserved-lib-stdc++：更新到 xCode 4.4 后存在“-c”时，“链接器”输入未使用[关闭]

Closed 这个问题是无法重现或由拼写错误引起 help closed questions 目前不接受答案我刚刚将 xcode 更新到 4 4 版本我当前正在开发的项目正在抱怨 clang error Z reserved lib s
通过视图模型将滚动查看器滚动到顶部

我将 ScrollViewer 与 MVVM 模式一起使用并且 ScrollViewer 包装了一个项目列表例如
.net 如何将字符串生成器与字符串进行比较

在 vb net 或 C 中我无法弄清楚如何将字符串生成器与字符串进行比较我已经搜索了很多但找不到答案我必须编写自己的例程难道就没有更好的办法吗这不起作用 Dim s As String abc Dim sb As New St
使用 SNOW 在 R 2.11 Windows 64 位中进行并行处理不太有效

我在具有 8 个处理器的 WinXP64 计算机上运行 R 2 11 64 位在 R 2 10 1 中以下代码生成了 6 个 R 进程以进行并行处理 require foreach require doSNOW cl makeClust
为什么32位机器上的mongodb数据大小限制为2GB，而不是4GB？

as here http docs mongodb org manual faq fundamentals faq 32 bit limitations值得一提的是在具有单个 mongod 实例的 32 位机器中 mongodb 的数据大
为什么 std::function 的初始化器必须是可复制构造的？

根据http en cppreference com w cpp utility function function function http en cppreference com w cpp utility functional fu
任何 Javascript DateTime 函数都会返回 MySQL 日期时间格式吗？

我有一个函数可以将 ajax 请求发送到带有参数的服务器文件问题是我想发送当前的日期时间值以与数据库进行比较作为 MySQL 日期时间格式所有日期时间均显示为 2012 02 03 19 50 28 格式我如何在 Javascri
Rpy2导入外部R文件

我想执行 R 文件中的 R 脚本我怎样才能加载它我想source path to file R 在 python 代码中但是使用这个名称只需导入 Rsource功能和其他一样 import rpy2 robjects as robj
将 Angular 2 应用程序部署到 AWS Elastic Beanstalk 中

我现在正在使用 Angular 2 开发一个前端应用程序该 id 用于与部署在云中的另一个微服务进行通信在此 Angular 2 应用程序中我没有使用数据库交互我需要在 elastic beanstalk AWS PaaS 中部署这
遍历继承层次结构的 static_cast 与dynamic_cast

我看到一本关于 C 的书提到使用静态强制转换来导航继承层次结构比使用动态强制转换更有效 Example include
使用 ctree 显示每个节点的体积，在 R 中绘图

任何人都可以告诉我如何在每个节点中添加体积而不是最终的节点体积 t lt ctree is return a b c plot t type simple 我的树看起来像我如何修改该图使其在每个圆形节点上显示 N 而不仅仅是黑色或最终
为什么 LEFT JOIN 与 WHERE 子句一起使用时不返回 NULL 记录？ [复制]

这个问题在这里已经有答案了今天我尝试了一些更复杂的 MySQL 查询我注意到 MySQL 的 LEFT JOIN 不适用于 WHERE 子句我的意思是它确实返回一些记录但不会返回右侧为空的记录例如假设我们有表 albums a
了解：从 PHP 数组到 Python？

这是 PHP 和其他编程语言中的常见任务我从 PHP 开发人员转过来我想通过这个收藏来确定一下任何精通Python的人请帮助我清楚地理解这是我从 PHP 代码中收集的 gt Python 里有什么 gt Python 里有什么
Angular 2 测试 - process.env

我尝试在应用程序中模拟请求但流程变量存在问题我将后端 API 的 URL 存储在 process env backend Url 中然后在休息服务中我有 constructor private http Http this rest
将 char* 复制到 char*

这是我的代码的一部分 extern C REGISTRATION API int extreme char lKey string s1 char p NULL try ifstream myfile extreme txt int i 0
Android在没有edittext视图的情况下捕获硬件键盘事件

如何在不使用 EditText 字段的情况下捕获硬件键盘事件例如在一个简单的活动中在屏幕上显示一个正方形当在滑动键盘上按下 B 时我想将其变为蓝色当按下 G 时将其变为绿色等等我不需要颜色代码的帮助只需要如何拦截按键这
修改后的 baugh-wooley 算法乘法 verilog 代码不能正确乘法

以下 verilog 源代码和或测试平台可以很好地工作商业模拟器 iverilog https www edaplayground com x 3TuQ也形式化验证工具 yosys smtbmc https gist github com

修改后的 baugh-wooley 算法乘法 verilog 代码不能正确乘法

修改后的 baugh-wooley 算法乘法 verilog 代码不能正确乘法 的相关文章

随机推荐

热门标签

修改后的 baugh-wooley 算法乘法 verilog 代码不能正确乘法的相关文章